Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_3934 |
Symbol | |
ID | 8755619 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | - |
Start bp | 4124660 |
End bp | 4126003 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003410873 |
Protein GI | 284992319 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.486973 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGTGT CGAGGAACGG CAAGGGCAGA TCGTCCGGCA GAGCCGCGAC GCGGGGGCGC AGACGTCAGC TGGGCGGTGC CGCAGCCGCG GCGGTCCTGG CCTCGACGCT CGCCGCGTGC GGGGGTGACT CGGGCGCGGC GAACGCGCTC ACCTGGTACA TCAACCCCGA CTCCGGCGGG CAGGCCGAGA TCGCCGCGCG GTGCAGCCAG GAGTCCGGCG GGGCGTACAC GATCGAGGTC GCGCAGCTGC CCCGGGAGGC CTCGGCGCAG CGCGAGCAGC TGATCCGGCG CCTGGCGGCC AATGACGCGT CGATCGACCT GATGAGCCTC GACCCGCCCT TCATCCCCGA GTTCGCCGAG GCCGGCTTCC TGGCCCCGGT GCCCGAGGAC GTCGCGCAAC GGGTCAGCGA GGACGTCGTG CAGAGCGCCG TCGCGGGTGC CACCTGGGAC GGCGAGCTGG TGACCGTCCC GTTCTGGGCG AACACCCAGC TGCTCTGGTA TCGCGAGTCC GTGGCCGAGG AGGCCGGCCT GGACATGAGC CAGCCGGTCA CCTGGGACCA GATCCTCGAG GCCGCGCAGC AGACCGACAC GCTGATCGGA GCCCAGGGCG CCCGCGCCGA GTCGCTGACG GTGTGGCTCA ACGCGCTCAT CGAGTCCGCC GGTGGGTCGA TCATCACCGA GAACGCCGAG GACCCCGGGG ACATCCAGCT CGGTCTCGAG TCCGAGCAGG CCGCCCGTGC GGCCGAGGTG ATGCGTGCGG TCGCCGACAG CGGGGTCGCC GGCGCGGCCT TCTCCACGGA GAACGAGGAC GCCTCGGCCA CCGAGTTCGA AGGTCCGAAC GGCGGCTTCA TGGTCAACTG GCCGTTTGTC TACGGACGGG CGCTGAGCGC CGCGGAGGCC GGCACCCTCG ATCCGTCGGT GCCGGAGGAC TACGGCTGGG CGGTCTACCC GCGGGTCAAC CCCAACGACG AGGCTGCCCC GCCCTACGGG GGGATCAACC TCGGGGTCGG CGCGTTCAGC GCGGCCCCCG AGCTGGCCTA CCAGGCCGCC GAGTGCATCG TCTCGGACCA GAACCAGGCC TACTACTTCA CGACCAACGG CAACCCGGCC TCCTCCATCC CGGTCTACGA CGACCCCGAG GTCCTCGAGG TCTTCCCGAT GGCCCCCGAG ATCCGGGAGT CGCTGGAGAT CGCCGCCCCG CGGCCGCAGA CCGTCTACTA CAACGAGGTC TCGGCCGCCA TCCAGCGGAC CTACCACCCG CCGGGCTCGG TCGTCCCCGG GGTGACCGGT CCCACCGCGG CCGAGCTGAT CCGAGCCGTC CTCGCAGGGG AGCAGCTGCT GTGA
|
Protein sequence | MTVSRNGKGR SSGRAATRGR RRQLGGAAAA AVLASTLAAC GGDSGAANAL TWYINPDSGG QAEIAARCSQ ESGGAYTIEV AQLPREASAQ REQLIRRLAA NDASIDLMSL DPPFIPEFAE AGFLAPVPED VAQRVSEDVV QSAVAGATWD GELVTVPFWA NTQLLWYRES VAEEAGLDMS QPVTWDQILE AAQQTDTLIG AQGARAESLT VWLNALIESA GGSIITENAE DPGDIQLGLE SEQAARAAEV MRAVADSGVA GAAFSTENED ASATEFEGPN GGFMVNWPFV YGRALSAAEA GTLDPSVPED YGWAVYPRVN PNDEAAPPYG GINLGVGAFS AAPELAYQAA ECIVSDQNQA YYFTTNGNPA SSIPVYDDPE VLEVFPMAPE IRESLEIAAP RPQTVYYNEV SAAIQRTYHP PGSVVPGVTG PTAAELIRAV LAGEQLL
|
| |