Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_0945 |
Symbol | |
ID | 9155085 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 969967 |
End bp | 971196 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003645917 |
Protein GI | 296138674 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGATCGG GAACCAAAGC GCCCGCCGCG GCGTTACACC GCCGTGTGAA CTCATCCACC CGCACTAGTC TTGCCGTCGG CGCGATGGCC CTCGGTGGCT TCGGCATCGG CACTACCGAG TTCGCGGCGA TGGGTCTGCT GCCCGATATC GCGAGCGACC TCGGCGTCTC GGAGCCCGTC GCCGGCCATG TGATCGCCGC GTACGCGCTC GGCGTGGTGG TAGGTGCCCC GCTCATCGCC GCGGCGTTCG CGCGAGTCCA GCGCCGGACC CTGCTCATCG CACTCATGGT GGCCTTCACC CTGGGCAACA CGCTCTCGGT GATCGCGCCG AGCTACCAGA CCCTCGTGGC CGCGCGATTC ATCGCAGGCC TGCCGCACGG CGCCTTCTTC GGCGTGGCCG CGCTCGTCGC CGCGCACCTC GCGGGACCAG CGGGCCGCGG GCGCGCGGTC GGCCAGGTAC TCATGGGCCT GTCCGTGGCG AACGTGATCG GCGTCCCCCT CACCACCTGG CTCGGCGACG CGTTCGGCTG GCGCTCTGCA CTCTCGGTGG TCGTGGTGAT CGGCGCCGCC ACGGTGATAG CGCTCCTGGT CTGGCTGCCC GCAGTGGACA TCCCCGTCAC CAACCCACTT ACCGAGCTCG GTGCCCTGCG CCGCCCGCAG GTGTGGTTCG CGCTGCTCAC CGGTGTCGTC GGTTTCGGTG GCATGTTCGC CGTCTACACC TACATCAGCA CCACTCTGAC CAGTGTCTCC GGCCTGGAGA AGTCCGCGGT GCCGTTCGTG CTCGCCGTGT ACGGCGTGGG CATGGTGATC GGCAACGTGG TGGGCGGTCG CGCCGCCGAC CACTCCGTGA CCCGCTCGAT CATCGCAACG CTCGCGCTGC TCGTCGTCCT GCAGGCGGTG TTCTCCGCGT TCGCTCCGGA GCCGGTCGCC GCGGTCTCCC TGTTCTTCCT GATCGGTCTC ACCGCCTCGG CGTTGGTGCC TGCGTTGCAG ACTCGACTGA TGGACGTCGC CGGTGAGGCA CAGACCCTCG CGGCCACGCT GAACCACTCC GCGCTCAACA TCGCCAACGC GCTCGGCGCC TTCCTCGGCG GCGCGGTGAT CACTGCGGGT TACGGCTACA CGGCGCCCGC GCTGGTGGGA AGCGGCCTCG CCGTCGCGGG CCTGGCGGTG TTCGGCATCG GACTGCTCGC GGCGCGGCGC GCTCCGGAAC CTGTCGGGGC CCGCCGGTAG
|
Protein sequence | MRSGTKAPAA ALHRRVNSST RTSLAVGAMA LGGFGIGTTE FAAMGLLPDI ASDLGVSEPV AGHVIAAYAL GVVVGAPLIA AAFARVQRRT LLIALMVAFT LGNTLSVIAP SYQTLVAARF IAGLPHGAFF GVAALVAAHL AGPAGRGRAV GQVLMGLSVA NVIGVPLTTW LGDAFGWRSA LSVVVVIGAA TVIALLVWLP AVDIPVTNPL TELGALRRPQ VWFALLTGVV GFGGMFAVYT YISTTLTSVS GLEKSAVPFV LAVYGVGMVI GNVVGGRAAD HSVTRSIIAT LALLVVLQAV FSAFAPEPVA AVSLFFLIGL TASALVPALQ TRLMDVAGEA QTLAATLNHS ALNIANALGA FLGGAVITAG YGYTAPALVG SGLAVAGLAV FGIGLLAARR APEPVGARR
|
| |