Gene Tmz1t_1609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1609 
Symbol 
ID7084819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1804110 
End bp1805327 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content70% 
IMG OID643698629 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002355260 
Protein GI217970026 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0283587 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCTCTTCT CGTCTTCCTC CCTTCCCGTC TTCCTGCGCC GCCCCGCGGT CGTCATGGCC 
TGCGGCGCGC TGATCCTCAC GCTGGCCATG GGCGTGCGCC ACACCGGCGG CCTGTTCCTG
CAGCCGATGA CGCTCGACCA CGGCTGGTCG CGCGAGCTGT TCTCCTTCTC GATCGCGCTG
CAGAACCTGC TGTGGGGCGT GTTCCAGCCC TTCGCCGGCG CCTTCGCCGA CCGCCACGGC
GCCGGCCGCA CCCTGGTGGG CGGGGCGCTG CTCTACATCC TGGGCCTGGT GATCATGGCT
CACGCCGACA GCGCGCTCGG CCTCAACCTC GGCGCCGGCC TGCTGATCGG CATGGGCCTC
TCGGGCACCA CCTTCAGCGT GGTGCTGGGC GTGATCGGGC GCATGGCCGC GCCGGAGAAG
CGCAGCCTCG CGCTCGGCAT CGCCTCGGCC GGCGGCAGCT TCGGCCAGTT CGCGGTGCTG
CCGGTGGGAC AGGGGCTGAT CACCGCCTTC GGCTGGCAGG ACGCGCTGCT GTGGATGGCG
GTGGGCATCG CCTTCATCAT CCCGCTCGCC GCCGCGGTCA CCGGCACCAG CGAGCGCGGC
GGCGGTGTCG AACAGTCGCT GCGCCAGGCG CTCGCCGAGG CGATGCGCAC GCCGAGCTTC
CACTTCCTGT TCTGGAGCTT CTTCGTGTGC GGCTTCCAGA CCGCCTTCGT CATGCTGCAT
CTGCCCGCCT ACGTGGTCGA CATGGGTCTG TCGGCCAACA TCGGCATGAG CGCGGTGGCG
ATGATCGGGC TGTTCAACAT CTTCGGCTCC TTCCTGTCGG GCTGGCTGGG CGGCCTCTAC
AGCAAGAAGT GGCTGCTGGC GTGGATCTAC GCGTTGCGTA TCGTGGCCAT CCTGGCGCTG
ATGCTGTTCC CGCTCAGCCC GCTCACGCTC TACGTCTTCG CCGCGGTGAT GGGCCTGCTG
TGGCTGGGCA CGGTGCCGCT CACCAGCGGC CTGGTCGGCC ACATCTTCGG CCTGCGCTAC
GTCGGCATGC TGTACGGCAT CGTCTTCCTC GGCCACCAGA TCGGCGGCTT CCTGGGGGCC
TGGCTGGGCG GGCGCATCTT CGACCTCAGC GGCTCGTACG AGATGGCGTG GTGGCTGTCG
ATCGCGCTCT CGGTGATGGC GGCGGCGCTG TCGCTGCCGG TGCGCGAGGC GCCGCTCGCA
AGGCTGGCGG CGCGATGA
 
Protein sequence
MLFSSSSLPV FLRRPAVVMA CGALILTLAM GVRHTGGLFL QPMTLDHGWS RELFSFSIAL 
QNLLWGVFQP FAGAFADRHG AGRTLVGGAL LYILGLVIMA HADSALGLNL GAGLLIGMGL
SGTTFSVVLG VIGRMAAPEK RSLALGIASA GGSFGQFAVL PVGQGLITAF GWQDALLWMA
VGIAFIIPLA AAVTGTSERG GGVEQSLRQA LAEAMRTPSF HFLFWSFFVC GFQTAFVMLH
LPAYVVDMGL SANIGMSAVA MIGLFNIFGS FLSGWLGGLY SKKWLLAWIY ALRIVAILAL
MLFPLSPLTL YVFAAVMGLL WLGTVPLTSG LVGHIFGLRY VGMLYGIVFL GHQIGGFLGA
WLGGRIFDLS GSYEMAWWLS IALSVMAAAL SLPVREAPLA RLAAR