Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0169 |
Symbol | |
ID | 7085266 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 196539 |
End bp | 197732 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643697211 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002353860 |
Protein GI | 217968626 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACTACC AGATCCAGCG TCTGAAGGTG CTCGGTGCGG GCATCTTCAG CCTGATGCTC GCGCTCGGCG TGGCGCGCTT CGCCTATACC CCGCTGCTGC CGCTGATGCA GGCCCAGGCC GGGCTCGGTT TGGCCGAGGG CGGCTGGCTC GCGGCAATCA ATTACACCGG TTACCTGAGC GGCGCGCTGC TCGCCGCCTC GATCAGCGAC CTCGTGCTCA AGGACCGCCT CTACCGCATC GGCATGGTGG TCGCGGTGCT GACCACGCTG ATGATGGGGC TCACCACCGA TTTCACGGTG TGGGCGGTGT CGCGCTACCT GGCCGGACTT TCGAGTGCGG CGGCGATGTT GCTCGGCACG GGCCTGATCC TGAACTGGCT GATCCGTCAC AACCATCGCC ACGAGCTCGG CATCCACTTC GCCGGCATCG GTCTGGGCAT CGCCGGCTGC TCGGTGGCCG TGGCGCTGAT GAGCCTGTGG CTGGACTGGC GTGCGCAGTG GTTCGTGTTC ACCGCGATTG CCTGCGTGCT GCTGGTGCCG GCGCTGCGCT GGCTGCCGGC ACCCGACACC AGCGGGCTGA CGCGCAGCGG CGCGCCGATG CACGACGACC CGCCCAGCCC GCTCTTCCTG CGCATCTTCA TGGCGTCCTA CTTCTGTGCC GGCGTCGGCT TCGTGGTCAG CGCGACCTTC ATCGTCGCCA TCGTCAATCG CCTGCCCGGG TTGGAGGGCC AGGGGAGCTG GAGCTTTCTC GCCATCGGCC TGGCGGCGAT GCCGGCCTGC ATCGTGTGGG ATTTCATCGC CCGCCGTACC GGCGCGCTGA ACGCGCTGAT CCTCGCCGCG GTGCTGCAGA TCGTCGGCAT CCTGCTGCCG GTGGTGGTCG GTGGCAGCCT GGGGGCGATC GCCGGCGCCC TGCTCTTCGG CGGCACCTTC GTCGGCATGG TCAGCCTGGT GCTGACCATG GCCGGGCGTT ACTACCCGAC GCGGCCGGCC AAGATGATGG GCAAGATGAC CATCTCCTAT GGCGTCGCGC AGATCCTCGG CCCGGCGGTG ACCGGCTGGC TGGGCGAGAC CTTCGGCAGC TACGCAGGCG GGCTGTGGTT CGCCGCGGCG ATGATGGGCG TGGGCACCGT GTTGCTGGTG CTGCTGAAGA TCGTGGACCG GCGCGACGCT CAGGCCGCGG CAGGCGTCGC CTGA
|
Protein sequence | MDYQIQRLKV LGAGIFSLML ALGVARFAYT PLLPLMQAQA GLGLAEGGWL AAINYTGYLS GALLAASISD LVLKDRLYRI GMVVAVLTTL MMGLTTDFTV WAVSRYLAGL SSAAAMLLGT GLILNWLIRH NHRHELGIHF AGIGLGIAGC SVAVALMSLW LDWRAQWFVF TAIACVLLVP ALRWLPAPDT SGLTRSGAPM HDDPPSPLFL RIFMASYFCA GVGFVVSATF IVAIVNRLPG LEGQGSWSFL AIGLAAMPAC IVWDFIARRT GALNALILAA VLQIVGILLP VVVGGSLGAI AGALLFGGTF VGMVSLVLTM AGRYYPTRPA KMMGKMTISY GVAQILGPAV TGWLGETFGS YAGGLWFAAA MMGVGTVLLV LLKIVDRRDA QAAAGVA
|
| |