Gene Tmz1t_0169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0169 
Symbol 
ID7085266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp196539 
End bp197732 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content69% 
IMG OID643697211 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002353860 
Protein GI217968626 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTACC AGATCCAGCG TCTGAAGGTG CTCGGTGCGG GCATCTTCAG CCTGATGCTC 
GCGCTCGGCG TGGCGCGCTT CGCCTATACC CCGCTGCTGC CGCTGATGCA GGCCCAGGCC
GGGCTCGGTT TGGCCGAGGG CGGCTGGCTC GCGGCAATCA ATTACACCGG TTACCTGAGC
GGCGCGCTGC TCGCCGCCTC GATCAGCGAC CTCGTGCTCA AGGACCGCCT CTACCGCATC
GGCATGGTGG TCGCGGTGCT GACCACGCTG ATGATGGGGC TCACCACCGA TTTCACGGTG
TGGGCGGTGT CGCGCTACCT GGCCGGACTT TCGAGTGCGG CGGCGATGTT GCTCGGCACG
GGCCTGATCC TGAACTGGCT GATCCGTCAC AACCATCGCC ACGAGCTCGG CATCCACTTC
GCCGGCATCG GTCTGGGCAT CGCCGGCTGC TCGGTGGCCG TGGCGCTGAT GAGCCTGTGG
CTGGACTGGC GTGCGCAGTG GTTCGTGTTC ACCGCGATTG CCTGCGTGCT GCTGGTGCCG
GCGCTGCGCT GGCTGCCGGC ACCCGACACC AGCGGGCTGA CGCGCAGCGG CGCGCCGATG
CACGACGACC CGCCCAGCCC GCTCTTCCTG CGCATCTTCA TGGCGTCCTA CTTCTGTGCC
GGCGTCGGCT TCGTGGTCAG CGCGACCTTC ATCGTCGCCA TCGTCAATCG CCTGCCCGGG
TTGGAGGGCC AGGGGAGCTG GAGCTTTCTC GCCATCGGCC TGGCGGCGAT GCCGGCCTGC
ATCGTGTGGG ATTTCATCGC CCGCCGTACC GGCGCGCTGA ACGCGCTGAT CCTCGCCGCG
GTGCTGCAGA TCGTCGGCAT CCTGCTGCCG GTGGTGGTCG GTGGCAGCCT GGGGGCGATC
GCCGGCGCCC TGCTCTTCGG CGGCACCTTC GTCGGCATGG TCAGCCTGGT GCTGACCATG
GCCGGGCGTT ACTACCCGAC GCGGCCGGCC AAGATGATGG GCAAGATGAC CATCTCCTAT
GGCGTCGCGC AGATCCTCGG CCCGGCGGTG ACCGGCTGGC TGGGCGAGAC CTTCGGCAGC
TACGCAGGCG GGCTGTGGTT CGCCGCGGCG ATGATGGGCG TGGGCACCGT GTTGCTGGTG
CTGCTGAAGA TCGTGGACCG GCGCGACGCT CAGGCCGCGG CAGGCGTCGC CTGA
 
Protein sequence
MDYQIQRLKV LGAGIFSLML ALGVARFAYT PLLPLMQAQA GLGLAEGGWL AAINYTGYLS 
GALLAASISD LVLKDRLYRI GMVVAVLTTL MMGLTTDFTV WAVSRYLAGL SSAAAMLLGT
GLILNWLIRH NHRHELGIHF AGIGLGIAGC SVAVALMSLW LDWRAQWFVF TAIACVLLVP
ALRWLPAPDT SGLTRSGAPM HDDPPSPLFL RIFMASYFCA GVGFVVSATF IVAIVNRLPG
LEGQGSWSFL AIGLAAMPAC IVWDFIARRT GALNALILAA VLQIVGILLP VVVGGSLGAI
AGALLFGGTF VGMVSLVLTM AGRYYPTRPA KMMGKMTISY GVAQILGPAV TGWLGETFGS
YAGGLWFAAA MMGVGTVLLV LLKIVDRRDA QAAAGVA