Gene Tmz1t_1810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1810 
Symbol 
ID7084232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2032887 
End bp2034089 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content72% 
IMG OID643698832 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002355458 
Protein GI217970224 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.390541 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGGAC CTCATCTCCT GCCGCTGCTG CTGCTCTACG CCCTGATGCT GCCGGTCACC 
GGCATGGTGC CGGTGCTGCC CGAGTTCACC GCGCAGCGCT TTCCCGGTCT GGGCCAGTTC
GCCAGCCACT TCTTCATGTC GATCAACATG ATCGGCGCGC TGCTCGGCGC GCCGATCGCC
GGCCTGCTCT CCGACCGCCT CGGCAAGCGC CGCCTGCTCG CGGTAGGCGC GCTCGCCGTG
AACGGCATCG CGCTGCTGGG CATCGCCTGG GCCTGGCGCA GCACGGAGAG CTACGCGCTG
CTCCTCGCGC TGCGCCTCGT CGAGGGCTTC GCCCACATGT CGGCGCTGTC GCTGCTGATG
GCGCTCGCCG CCGACCACGC CGGCAAGGCC GGGCTGGGCG CGCGCATGGG CGCGGTGGGC
GCCTCGATCA GCCTGGGGGT GGCCACCGGC GCGCCGCTCG GGGGCATCAT CGGCGACATC
GACCCCTTCT GGGTGCCCCT CGGCGGCGGC CTGCTCTCGC TGGCGATGGC CGCGCTCGGC
TTCGTCCGCC TCGCCGAGGG CGGCGGCACG CGGCCGCGCA TGACGGCATC CGAGATCGTC
GACACCCTGC GCAACCGTCG CCAGTTGCTG ATCCCGCTCG CGTTCTCCTT CGCCGACCGC
CTCACCGTAG GCTTCATCGT GTCGACGCTG TCGCTCTACC TCGGCCTGGT GATCGGTTTC
GATGCGCGCC AGATCGGCAT CGCGATGGCG GCCTTCCTGA TCCCCTTCTC GGTGCTCACC
TGGCCCGCCG GCCACCTGTC GCGGCATTGG GATCCGTTGT GGATGATGGT GATCGGCAGC
GTGCTCTACG GCGTCTTCCT CGCCGTGCTC GGCTTCGTCC CGGGCGATCG GGTGGTGGCG
ACGATGGCCG CGGGCGGCGT GATCGCGGCG CTGATGTATG CGCCCTCACT GGTGCTGGCC
GCGCAATATG GCGGCAGCGA CTGCCGCGCC AGCGCGCTGG CCGCCTTCAA CATGGCGGGC
TCGCTCGGCT TTGCCGCCGG CCCGCTGCTC AGCAGCGCGC TGCTCGCCTT CTTCGGCCTG
GTGCTGGAAC GCCCCTACCC GCCAGTCTTC GTGGCGATCG GGCTGATCGA GGTCGTGCTC
GCCCTCGCGG TGCTGCTGCT GGTGCGCCGC GGCCGGCTGC AGGCCGGCGC CGCGACGGCC
TGA
 
Protein sequence
MTGPHLLPLL LLYALMLPVT GMVPVLPEFT AQRFPGLGQF ASHFFMSINM IGALLGAPIA 
GLLSDRLGKR RLLAVGALAV NGIALLGIAW AWRSTESYAL LLALRLVEGF AHMSALSLLM
ALAADHAGKA GLGARMGAVG ASISLGVATG APLGGIIGDI DPFWVPLGGG LLSLAMAALG
FVRLAEGGGT RPRMTASEIV DTLRNRRQLL IPLAFSFADR LTVGFIVSTL SLYLGLVIGF
DARQIGIAMA AFLIPFSVLT WPAGHLSRHW DPLWMMVIGS VLYGVFLAVL GFVPGDRVVA
TMAAGGVIAA LMYAPSLVLA AQYGGSDCRA SALAAFNMAG SLGFAAGPLL SSALLAFFGL
VLERPYPPVF VAIGLIEVVL ALAVLLLVRR GRLQAGAATA