Gene Gobs_2697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_2697 
Symbol 
ID8754369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp2806480 
End bp2808627 
Gene Length2148 bp 
Protein Length715 aa 
Translation table11 
GC content73% 
IMG OID 
ProductHedgehog/intein hint domain protein 
Protein accessionYP_003409701 
Protein GI284991147 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.585756 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGTGGG ACGCCCAGCG GCTGGACGCC GAGGAACCGG CGACGCTGCC GGGCATGCCG 
TCGATGCGCG GCCTGCTGCG CAGCGTGCAG GTGCCGGAGT TCCCCGGGTT GACCTTCCAC
GAGGTGCGCA GCAAGAGCGC GCTCAACGAG GTCCCCGGCG ACTCGCCCAT GCCGTTCCGC
TGGACGATCA ACCCCTATCG GGGCTGCTCC CATGCGTGTG TGTACTGCCT GCGGGGAGAC
ACCCGTGTGC TGATGGCCGA CGGCCGGCAG AAGGCGATCG CCGATCTCCG GGTGGGGGAC
CGGATCGTCG GCACTCAGAA GCGCCGGACG TACCGCCACT ACGTCACCAC CGAGGTGCTG
GCGCACTGGT CGACGGTCAA GCGGGCGTAC CGCGTGCGTC TGGCCGACGG CACGGAGCTC
GTCGCCAGCG GCGACCACCG GTTCCTGACG GGCAGGGGCT GGAAGCACGT CACCGGCGCG
ATGGCCGGTC GCGACCGTCG CCCGTACCTG ACGACCGACG ACGAGCTGCT CGGTTTCGGT
CGGTCGGCAG CAGCACTGGA CGTCTGCGCG GACTACCGCC GGGGCTACCT CGCCGGGATG
ATCCGCGGAG ACGGACACCT GAAGATGTAC CGGTACGAGC GAGCCGGCCG CAGGCACGGC
GGCGTCCACC GGTTTCGTCT GGCGCTGGCC GACGTGGAGG CACTGGACCG CGCACAGGCC
TACCTCGCCG CGGAGGGCGT GCGCACGGAC CGCTCCAGCT TCTCGCCCGC GTCGGTGGAG
CGACGGGCCA TGTCGGCCAT CCGGACCTCC ACCGCCGCGG GGGTGGCCCG CATCTCGGAG
CTCGAGGAGT GGCCCAGCGC CCCCACGGCT GCATGGCAGC GTGGGTTCCT GGCAGGGGTG
TTCGATGCCG GAGGCAGCCG CAGCCAGCAC GTCCTGCGCG TCACGAACAC CGATCCGGAG
ATCTTGAGCC ACACCCGCGA GGCGTTGGAG TGGTTCGGGT TCGACGCGGT GCTCGAGGAT
CGCAAGCGGG CGAACGGGCT CGCCTGCGTA CGGGTCCGCG GCGGGCTCGG TGAACACATC
AGGTTCATCC ACCTCGTCGA CCCGGCGATC CGGCGCACGT GCTCACTCGA CGGGACAGCA
GTGAAGAGCG ACGCCGACCT GCGAGTCGTC GAGGTGGAGG ACCTCCGCCT CGAGATGCCG
ATGTACGACA TCACCACTGG CACCGGGGAC TTCCTCGCCA ACGGCGTGGT CAGCCACAAC
TGCTTCGCTC GCGCGACACA CCAATGGCTG GAGCTGGACA CCGGCCGGGA CTTCGACAGC
CAGGTCGTCG TGAAGACCAA CCTCGTCGAC GTCCTGCGCC AGGAGCTGGC CCGGCCCTCG
TGGACGCGCG AGCACGTGGC GCTGGGCACC AACACCGACC CCTACCAGCG GGCCGAGGGG
CGCTACCGGC TGATGCCCGG GGTGATCTCG GTGCTGGCCG GCTCCGGCAC GCCGTTCTCG
ATCCTGACGA AGGGCACGCT GCTGCGGCGG GACCTGCCGG TGCTGGCGGC CGCCGCCGGC
GACGTGCCGA TCGGACTGGG CGTTTCCATG GCCATCTGGG ACGACGCCCT GCACGCCTCG
CTCGAACCCG GGGTGCCCAG CCCGCGCGCC CGCCTGGAGC TGGTGCGGGC GATCGCCGAC
GCCGGGCTGT CCTGCGGTGT GTTCCTGGCC CCGGTGCTGC CCGGCCTCAC CGACCGGCTG
GCCGACCTGG ACGCCGCGCT GCGGGCGATC GCGGAGGCGG GTGCCGACGG CGTCACCGTC
GTCCCGCTGC ACCTGCGCCC CGGCGCTCGT GAGTGGTTCT CCGCCTGGCT GGCCCGCGAG
CACCCACAGC TGGTGCCGCG CTACCAGCAG CTCTACCGCC GCGGCGCGGC GGTGGCACCG
GAGTACCGCA GCTGGCTGGC CGGGCGGGTC GCCCCGCTGC TGGCGCGCTA CGGACTGGAC
CGGCAGGCCG GGGGAGCGGC CCGGGGTGTG GACGCGCCGG CCGGCATCCC CGGGGACGGG
GACAGCCGGT TCCCGGCCGG CAGCCTGCCG GCGACCCGGC CGAGCGCCCG GCCGGCTCCG
GCACGCCACG CTTCGGCCCC GGCGGGCGAG CAGCTCACGC TGCTCTGA
 
Protein sequence
MRWDAQRLDA EEPATLPGMP SMRGLLRSVQ VPEFPGLTFH EVRSKSALNE VPGDSPMPFR 
WTINPYRGCS HACVYCLRGD TRVLMADGRQ KAIADLRVGD RIVGTQKRRT YRHYVTTEVL
AHWSTVKRAY RVRLADGTEL VASGDHRFLT GRGWKHVTGA MAGRDRRPYL TTDDELLGFG
RSAAALDVCA DYRRGYLAGM IRGDGHLKMY RYERAGRRHG GVHRFRLALA DVEALDRAQA
YLAAEGVRTD RSSFSPASVE RRAMSAIRTS TAAGVARISE LEEWPSAPTA AWQRGFLAGV
FDAGGSRSQH VLRVTNTDPE ILSHTREALE WFGFDAVLED RKRANGLACV RVRGGLGEHI
RFIHLVDPAI RRTCSLDGTA VKSDADLRVV EVEDLRLEMP MYDITTGTGD FLANGVVSHN
CFARATHQWL ELDTGRDFDS QVVVKTNLVD VLRQELARPS WTREHVALGT NTDPYQRAEG
RYRLMPGVIS VLAGSGTPFS ILTKGTLLRR DLPVLAAAAG DVPIGLGVSM AIWDDALHAS
LEPGVPSPRA RLELVRAIAD AGLSCGVFLA PVLPGLTDRL ADLDAALRAI AEAGADGVTV
VPLHLRPGAR EWFSAWLARE HPQLVPRYQQ LYRRGAAVAP EYRSWLAGRV APLLARYGLD
RQAGGAARGV DAPAGIPGDG DSRFPAGSLP ATRPSARPAP ARHASAPAGE QLTLL