Gene Gobs_0480 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_0480 
Symbol 
ID8752134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp524598 
End bp526307 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content70% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003407633 
Protein GI284989079 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.105843 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCGTC GTCCCCAACG CGCGCTCGCC GCCTCGGCGG CCGCGCTCAC CGCAGTCACC 
CTGGCTGCCT GTGCCTCGAG CGACCGGGAC ACCGGTGGTG GCGGGGAGGG CGGTGGTGAG
GCCCGGTCCG GCGGCACCAT GGTCTTCGGC GCCACCGGCG ACCCCGCGAT GCTCGACCCG
GCGTTCGGCA GCGACGGCGA GACCTTCCGC GTCTCCCGCC AGGTCTTCGA GGGCCTGCTC
GGCAACGAGC TCGGTGGCAC CGACCCCGTC CCCGAGCTGG CCGAGGACTG GGAGGTCAGC
GAGGACGGGC TGGAGTACAC CTTCAACCTG CAGCAGGGCG TCCAGTTCCA CGACGGCACA
AACTTCAACG CCGAGGCGGT CTGCTTCAAC TTCGACCGCT GGTACAACTT CGAGGGCCTG
GCCACCAGCC CGAGCGCCTC GTACTACTAC CAGGCGGTCT TCGGCGGCTT CGCCAGCACG
CCGGACACCC CGAGCATCTA CGAGAGCTGC GAGGCGACCG ACGAGAACAC CGCGGTCATC
CGGCTCACCC AGGTCACCTC GAAGTTCCCC GCCGCGCTGG CGCTGCCGGC GTTCTCCATC
CAGAGCCCGA CGGCCCTGCA GGAGTACGAC GCCGACAACC TGTCCGGTGC CGAGGACGCG
CTGGCCTACC CCGAGTACGC CCTGGAGCAC CCGACCGGCA CCGGGCCCTT CCGGTTCGAG
AGCTGGGACC GCGGCAACGG TCAGGTCACC CTGGTCCGCA ACGAGGACTA CTGGGGCGAG
CCGGCGCTGC TCGACGAGCT CATCATCCGC ACCATCCCGG ACGGCAACAC CCGCCGCCAG
GAGCTGCAGG CCGGCTCGAT CGACGGCTAC GACTTCGTGG CGCCGGCGGA CTACCAGTCG
CTGCAGGACG AGGGTCACCA GGTCCTGGTC CGCGACCCGT TCAACATCCT CTACCTGGGC
TTCAACGGCG GGAACGTCCC CGGCACCAGC GCCAACCCGG CGCTGCAGGA CCCGCGGGTG
CGGCAGGCCA TCGCGCACGC GATCGATCGC GACACGATCG TCAGCTCGCT GCTCCCCGAG
GGCGCCGAGG CGGCCATCGA GTTCATGCCG CCGACGGTCG ACGGCTACGC CGAGGACGTC
ACGACCTACG ACCACGACCC GAACCGGGCG CGGCAGCTGC TGCAGGAGGC CGGGGCCGAG
GGCACGACGC TGCGGTTCTA CTACCCGACC GAGGTCAGCC GCCCCTACCT GCCGGACCCG
GCCGCGATGT TCCAGGTCAT CAGCCAGGAC CTGACCGACG CCGGCTTCAC CATCGAGCCG
GTCGCGCTGC CGTGGAACCC GGACTACCTG AACGCCGTCC AGTCCGGTCA GGCTGACATC
CACCTGCTCG GGTGGACCGG TGACTACAAC GACGCCTACA ACTTCATCGG CACCTTCTTC
GCCGAGGCGT CGAACAACCA GGCGTCGGCG GAGTTCGGTG CGTTCAGCGC GCCGGAGATC
TTCCAGGCGC TCGCCCGTGC CGACCAGGAG CCGGACCCGG CCGCCCGTAC CGCGCTCTAC
CAGGAGGCGA ACCGGCTGAT CATGGACTAC CTGCCCGGTG TCCCGATCTC CCACTCGCCG
CCGGCTCTGG TCGTCGCGGA GAACGTCACG GGTCTGGAGC CCAGCCCGCT GACCGCCGAG
GTCTTCTCCA CGGTCTCCAT CAGCGAGTGA
 
Protein sequence
MQRRPQRALA ASAAALTAVT LAACASSDRD TGGGGEGGGE ARSGGTMVFG ATGDPAMLDP 
AFGSDGETFR VSRQVFEGLL GNELGGTDPV PELAEDWEVS EDGLEYTFNL QQGVQFHDGT
NFNAEAVCFN FDRWYNFEGL ATSPSASYYY QAVFGGFAST PDTPSIYESC EATDENTAVI
RLTQVTSKFP AALALPAFSI QSPTALQEYD ADNLSGAEDA LAYPEYALEH PTGTGPFRFE
SWDRGNGQVT LVRNEDYWGE PALLDELIIR TIPDGNTRRQ ELQAGSIDGY DFVAPADYQS
LQDEGHQVLV RDPFNILYLG FNGGNVPGTS ANPALQDPRV RQAIAHAIDR DTIVSSLLPE
GAEAAIEFMP PTVDGYAEDV TTYDHDPNRA RQLLQEAGAE GTTLRFYYPT EVSRPYLPDP
AAMFQVISQD LTDAGFTIEP VALPWNPDYL NAVQSGQADI HLLGWTGDYN DAYNFIGTFF
AEASNNQASA EFGAFSAPEI FQALARADQE PDPAARTALY QEANRLIMDY LPGVPISHSP
PALVVAENVT GLEPSPLTAE VFSTVSISE