Gene Gobs_1079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_1079 
Symbol 
ID8752740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp1139341 
End bp1140999 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content69% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003408209 
Protein GI284989655 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.133059 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGGTTGA GCAAGCGCTC TGCTGCGCTC ATCGCGACCG GTCTGACCGG CGCGATGGTG 
CTGTCCGCCT GTGGCGGTGG CGGCGACGAG GACGGCGGAT CGGGGGCTGC CGCCGCGGAC
GGCGGCAGTT TCACCGTCTA CATCGCCGAG CCCGAGAACC CGCTCATCCC CGGCCAGACC
ACCGAGACCG AGGGTGCCCA GGTCCTCTAC TCCCTCTTCA CCGGACTCGT CCAGTACGAC
TCGGAGACCA ACGAGGCCGT CTACACCGGG GTCGCCGACT CGATCGAGTC CGCGGACCAG
ACCACCTGGA CCGTCAAGCT CAAGGACGGC TGGACCTTCC ACGACGGCAG CCCGGTGCGC
GCCCAGTCGT TCGTCGACGC CTGGAACTGG ACCGCCTACA GCCCCAATGC GGCCAACAGC
TCGTACTTCT TCGCCAACAT CGCCGGCTAC GACCAGCTGC AGGCCCCGAC CGACGACGCC
GGCAACGTGA CCGGTGACCC GGCCGCCACG GCGATGAGCG GTCTGCGGGT GGTCGACGAC
CTGACCTTCG AGGTCACCCT GTCCGCCCCG TACGCCCAGT GGCCCACCAC GGTCGGTTAC
AGCGCCTTCT ACCCGCTGCC CCCGGCCTTC TTCGAGGACT CGGCGGCCTT CGGCGAGCAG
CCGATCGGCA ACGGCCCGTT CCGTGCCGAT GAGCCCTTCG TCCCCGGTAC CGGCGTCACG
CTGACCCGCT ACGACGAGTA CGGCGGCGAC AAGCCGGCCA ACGCCGCGTC TGTGCAGTAC
GTGGTCTACG CCGAGCAGGC GACCGCGTAC CGGGACCTGC AGGCCGGGAA CCTCGACATC
ATGGACGAGC TCCCGCCGGA CGCCCTCGCT TCCGCCGAGG CCGAGCTGGG CGACCGCCTC
CTGCAGGTCG AGCAGGGGGA CATCACCTCG CTGGGCTTCC CGACCTACGA CGAGCGCTTC
GCCGACCCGA ACGTGCGGCG TGCCTTTTCG ATGGCGATCG ACCGCGAGTC GATCACCGAG
GCGATCTTCC AGGGCACCCG CATCCCGGCG ACGTCCTTCA TCAACCCGGT GGTCGACGGC
TACCGCGAGG GCGCCTGTGA CGTCTGCGAG CTGAACGTCG AGGAGGCCAA CCGGCTCCTG
GACGAGGCGG GCTTCGACCG CAGCCAGCCG GTCGACCTGT GGTTCAACGC CGGCGCCGGC
CACGAGATCT GGATGGAGGC CGTCGGCAAC CAGATCCGGG AGGGCCTCGG CGTCGACTAC
ACGCTCCAGG GCAACCTCGA CTTCGCCGAG TACCTGCCGC TGCAGGACGA GCAGGGCATG
ACCGGCCCGT TCCGGTCCGG CTGGATCATG GACTACCCGG TCGCCGAGAA CTTCCTCGGC
CCGCTGTACT CCAGCACCGC GCTGCCGCCC GGCGGGTCGA ACGTGACCTT CTACAGCAAC
CCCGAGTTCG ACGCGCTGCT GCAGCAGGGC AACTCGGCCG ACACCAACGA GGCGGCGATC
CAGGCCTACC AGGCCGCGGA GGACCTCCTG CTCCGCGACA TGCCGGCCAC CCCGCTGTTC
TACCGGGTCA ACCAGAGCGC CCACTCGGAG AACGTCGACA ACGTGGTCGT CGACGCCTTC
AACCGGATCG ACACGGCAGC CGTCCAGGTC GTCGGCTGA
 
Protein sequence
MRLSKRSAAL IATGLTGAMV LSACGGGGDE DGGSGAAAAD GGSFTVYIAE PENPLIPGQT 
TETEGAQVLY SLFTGLVQYD SETNEAVYTG VADSIESADQ TTWTVKLKDG WTFHDGSPVR
AQSFVDAWNW TAYSPNAANS SYFFANIAGY DQLQAPTDDA GNVTGDPAAT AMSGLRVVDD
LTFEVTLSAP YAQWPTTVGY SAFYPLPPAF FEDSAAFGEQ PIGNGPFRAD EPFVPGTGVT
LTRYDEYGGD KPANAASVQY VVYAEQATAY RDLQAGNLDI MDELPPDALA SAEAELGDRL
LQVEQGDITS LGFPTYDERF ADPNVRRAFS MAIDRESITE AIFQGTRIPA TSFINPVVDG
YREGACDVCE LNVEEANRLL DEAGFDRSQP VDLWFNAGAG HEIWMEAVGN QIREGLGVDY
TLQGNLDFAE YLPLQDEQGM TGPFRSGWIM DYPVAENFLG PLYSSTALPP GGSNVTFYSN
PEFDALLQQG NSADTNEAAI QAYQAAEDLL LRDMPATPLF YRVNQSAHSE NVDNVVVDAF
NRIDTAAVQV VG