Gene Gobs_2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_2049 
Symbol 
ID8753720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp2129364 
End bp2131151 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content76% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003409108 
Protein GI284990554 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.352532 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGACAGCA CGCTGGAGCG GGGCCCTGCC TGGGCGCGGG GCACCCGCGG CCGGGTGGCC 
GGCATCGCCG CCGTCGTCCT GCTGGTCGTC GGGGTGGCGG TCAGCTGCGC CAAGGCCGGC
GAGGACCGCG CCACCCTCCC CGTCTTCGGC GGCGGTCCGG ACGCCGGGGT CACCGGGGTC
CGGGCGCCGT CCGAGGCCTC CGGGGGCACG CTGCGGGTGG TGACCGGCGA GGTCGACAAC
CTCGACCCGC AACGCTCCTA CCTCCCCGGA GTGTGGAACC TCATGCGGCT CTACACCCGC
ACCCTGGTCA GCTACTCCTC CGAGCCCGGC CGCACCGCCG AGCTGGTGCC CGACCTCGCG
ACCGACCTGG GGACCACGCC CGACGGCGGG GCCACCTGGA CCTTCACCCT CCGCGAGGGG
GTGCGGTTCG AGACCGGCCA GCCGATCACC TCGCGCGACG TCAAGTACGG CATCGAGCGC
TCGTTCGCCT CGGACGTGGT CGTCGGCGGC CCGACCCGTG TGGTCGAGCT GCTCGACGAC
CCGGGCAACC CCTACGCCGG CCCGTACCAG GACGAGACGC CCGGCCGGCT GGGCCTGGCC
TCGGTCGAGA CGCCCGACGA CCGCACCATC ACCTTCCGGC TGCGCGCGCC CCAGCCGGAC
TTCCCGTACG TCATGGCGCT GCCCTCGAGC AGCCCGGTCC CGGCGGACCA CGACACCGGC
GCCGACTACG GCCTCGACCC GGTCTCCTCC GGCCCCTACC TGGTCGCCAC GCGGGACGAC
GTGACGGGCA TCGTGCTCGA GCGCAACCCG CAGTGGGACC CGGCCACCGA TGACGTCCGC
ACCGCGCTGC CCGACCGGGT CGTCGTGCGC ACCGGGCTGA CCGGTGTGCA GCGCGACCAG
GCGCTGCTGG CCGGGTCCGC TGACGTCGAC ATCTCCGGCG CCGGCATCCA GGCGCCGACC
ACCGCCCGGC TGGGCGAGGA CACCGGCGGC GAGGTCCGGC TGGTCGACCG GATCGACGAG
GTGACCAACA ACGTCGTCCG GCTGCTCGCG CTGCCGACGG ACGTCGCGCC GTTCGACGAC
CCGGCCTGCC GGGCCGCGGT GGCCGCGGTC GTGGACCGCG CGGCGGTGCA GGAGGTGCTC
GGCGGGCCCG CGGAGGCCGT GCGCACCTCC CAGCTGTGGC CGCGCGGCGT GGACGGCGGG
CCGGAGGAGC CCGACCCGAC CGCCGACCCC GCCACCGCCG AGGAGTCCCT GGCCGCCTGC
GGCCGGCCGG AGGGCTTCGG CACCGTGCTC GCCGTCCCGG ACGCCCCGAC CGGCGTGGCC
GTGGCCGAGG AGGTGGCCGG GCAACTCTCC GGCATCGGCA TCGAGGCCGA GGTACGGGCG
CTGGACGCAC CGAGCTTCTA CGCGACCGAG GTCGGCAACC CCGACCGGGT CCGCGACAAC
GGGATCGGCA TCGTGCTGGC CACCTGGACC GCCGACTTCC CGACCCCCGG CTCGTTCCTC
GTGCCGCTGG TCGACGGCCG GTCAGTCAGC ACCGTCGGCA ACACCAACTT CGCCCGCCTG
CGGGACCCCG GCATCGACGG GCTCATCGAC GCCGCCCGCG CGGCCGCCGG CGACCCCGAG
GCGGCCCGCG CGGCCTGGCG CGAGGTGGCG ACCGCGGCCA CCGCGACCGG CGCGTACGTG
CCGCTGGCCG AGTCGCGGGT GCAGCTGCTG GCCGGCCAGC GGCTGCGCAA CGGCGTGGTG
ATGGGCCCGT ATACGAGCTA CGACCTGGCG ACCGCCGGCG TCCGCTGA
 
Protein sequence
MDSTLERGPA WARGTRGRVA GIAAVVLLVV GVAVSCAKAG EDRATLPVFG GGPDAGVTGV 
RAPSEASGGT LRVVTGEVDN LDPQRSYLPG VWNLMRLYTR TLVSYSSEPG RTAELVPDLA
TDLGTTPDGG ATWTFTLREG VRFETGQPIT SRDVKYGIER SFASDVVVGG PTRVVELLDD
PGNPYAGPYQ DETPGRLGLA SVETPDDRTI TFRLRAPQPD FPYVMALPSS SPVPADHDTG
ADYGLDPVSS GPYLVATRDD VTGIVLERNP QWDPATDDVR TALPDRVVVR TGLTGVQRDQ
ALLAGSADVD ISGAGIQAPT TARLGEDTGG EVRLVDRIDE VTNNVVRLLA LPTDVAPFDD
PACRAAVAAV VDRAAVQEVL GGPAEAVRTS QLWPRGVDGG PEEPDPTADP ATAEESLAAC
GRPEGFGTVL AVPDAPTGVA VAEEVAGQLS GIGIEAEVRA LDAPSFYATE VGNPDRVRDN
GIGIVLATWT ADFPTPGSFL VPLVDGRSVS TVGNTNFARL RDPGIDGLID AARAAAGDPE
AARAAWREVA TAATATGAYV PLAESRVQLL AGQRLRNGVV MGPYTSYDLA TAGVR