Gene Gobs_3934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_3934 
Symbol 
ID8755619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp4124660 
End bp4126003 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content72% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003410873 
Protein GI284992319 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.486973 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGTGT CGAGGAACGG CAAGGGCAGA TCGTCCGGCA GAGCCGCGAC GCGGGGGCGC 
AGACGTCAGC TGGGCGGTGC CGCAGCCGCG GCGGTCCTGG CCTCGACGCT CGCCGCGTGC
GGGGGTGACT CGGGCGCGGC GAACGCGCTC ACCTGGTACA TCAACCCCGA CTCCGGCGGG
CAGGCCGAGA TCGCCGCGCG GTGCAGCCAG GAGTCCGGCG GGGCGTACAC GATCGAGGTC
GCGCAGCTGC CCCGGGAGGC CTCGGCGCAG CGCGAGCAGC TGATCCGGCG CCTGGCGGCC
AATGACGCGT CGATCGACCT GATGAGCCTC GACCCGCCCT TCATCCCCGA GTTCGCCGAG
GCCGGCTTCC TGGCCCCGGT GCCCGAGGAC GTCGCGCAAC GGGTCAGCGA GGACGTCGTG
CAGAGCGCCG TCGCGGGTGC CACCTGGGAC GGCGAGCTGG TGACCGTCCC GTTCTGGGCG
AACACCCAGC TGCTCTGGTA TCGCGAGTCC GTGGCCGAGG AGGCCGGCCT GGACATGAGC
CAGCCGGTCA CCTGGGACCA GATCCTCGAG GCCGCGCAGC AGACCGACAC GCTGATCGGA
GCCCAGGGCG CCCGCGCCGA GTCGCTGACG GTGTGGCTCA ACGCGCTCAT CGAGTCCGCC
GGTGGGTCGA TCATCACCGA GAACGCCGAG GACCCCGGGG ACATCCAGCT CGGTCTCGAG
TCCGAGCAGG CCGCCCGTGC GGCCGAGGTG ATGCGTGCGG TCGCCGACAG CGGGGTCGCC
GGCGCGGCCT TCTCCACGGA GAACGAGGAC GCCTCGGCCA CCGAGTTCGA AGGTCCGAAC
GGCGGCTTCA TGGTCAACTG GCCGTTTGTC TACGGACGGG CGCTGAGCGC CGCGGAGGCC
GGCACCCTCG ATCCGTCGGT GCCGGAGGAC TACGGCTGGG CGGTCTACCC GCGGGTCAAC
CCCAACGACG AGGCTGCCCC GCCCTACGGG GGGATCAACC TCGGGGTCGG CGCGTTCAGC
GCGGCCCCCG AGCTGGCCTA CCAGGCCGCC GAGTGCATCG TCTCGGACCA GAACCAGGCC
TACTACTTCA CGACCAACGG CAACCCGGCC TCCTCCATCC CGGTCTACGA CGACCCCGAG
GTCCTCGAGG TCTTCCCGAT GGCCCCCGAG ATCCGGGAGT CGCTGGAGAT CGCCGCCCCG
CGGCCGCAGA CCGTCTACTA CAACGAGGTC TCGGCCGCCA TCCAGCGGAC CTACCACCCG
CCGGGCTCGG TCGTCCCCGG GGTGACCGGT CCCACCGCGG CCGAGCTGAT CCGAGCCGTC
CTCGCAGGGG AGCAGCTGCT GTGA
 
Protein sequence
MTVSRNGKGR SSGRAATRGR RRQLGGAAAA AVLASTLAAC GGDSGAANAL TWYINPDSGG 
QAEIAARCSQ ESGGAYTIEV AQLPREASAQ REQLIRRLAA NDASIDLMSL DPPFIPEFAE
AGFLAPVPED VAQRVSEDVV QSAVAGATWD GELVTVPFWA NTQLLWYRES VAEEAGLDMS
QPVTWDQILE AAQQTDTLIG AQGARAESLT VWLNALIESA GGSIITENAE DPGDIQLGLE
SEQAARAAEV MRAVADSGVA GAAFSTENED ASATEFEGPN GGFMVNWPFV YGRALSAAEA
GTLDPSVPED YGWAVYPRVN PNDEAAPPYG GINLGVGAFS AAPELAYQAA ECIVSDQNQA
YYFTTNGNPA SSIPVYDDPE VLEVFPMAPE IRESLEIAAP RPQTVYYNEV SAAIQRTYHP
PGSVVPGVTG PTAAELIRAV LAGEQLL