Gene Tmz1t_3578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3578 
Symbol 
ID7873083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3922171 
End bp3923469 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content72% 
IMG OID643700518 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002890548 
Protein GI237654234 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0372089 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGCC AGTTCCGCCT GCTGCGGGAG CGCCGCTTCC TGCCCTTCTT CCTGACCCAG 
TTCCTCGGCG CCTTCAACGA CAACGTCTAC AAGAACGCGC TGGTGGTGCT GATCACCTTC
CAGGCCGCGC GTCTGGGGGC GGACTCGCCC GGGGTGCTGG TGAACCTCGC CGCGGGTGTC
TTCATCCTGC CCTTCTTCCT GTTCTCGGCC ACCGCCGGCC AGCTCGCCGA CAAGTACGAG
AAGAGCCGCC TGATCCGCGC CACCAAGCTG CTCGAGATCG CCGTGATGGC GCTTGCGGTG
GTCGGCTTCG CCCTGATGTC GCTGCCGCTG CTGCTCGTCG TGCTGTTCCT GATGGGGGCG
CAGTCGGCAC TGTTCGGGCC GGTCAAGTAC GCGATCATCC CGCAGCAGCT CGCCGACGAC
GAGCTGGTGG GCGGCAACGC GCTGGTGGAG GCGGCGACCT TCGTCGCCAT CCTCGCCGGC
ACCATCGTCG GCGGCCTGCT GGTGGCGGGG GATGCCGGGC CCGGCCGTGT GGCGGTCGCG
GTGCTCGCGA TCGCGCTGCT CGGCTGGTGG TCCAGCCGCG CCATCCCGCC CGCGGCGGCG
GCCGACCCCG GGCTGCGGGT GAACTGGAAC CCGGTGACGC AGACGCGCGA GATGCTGCGC
TTCATGCTCG AGGCGCGCGC GGTCTTCGTC GCCATCGTCG GCATCTCGTG GTTCTGGTTC
TACGGTGCCG TGTTCCTGTC GCAGTTTCCC GGCTTCGCCG CGGATCATCT CCGCGGCGAC
GAGCGCGCGG TGACGCTGCT GCTGGCGCTG TTCTCGGTCG GCATCGGCGC GGGCTCGCTG
TTGTGCGGGC GCCTGTCGCG CGGGCGGGTG GAGCCGCGCA TGGTGCTGCC GGGGGCGATC
GGGCTGAGCC TGTTCGCGCT CGACCTGTGG TGGGCGAGCC CGGCGGCGGG CGCCTTCCCG
CCCGGCCAGG GGCTGGATAC GCTGTTTGCC CGCGCCGAGG TGTGGCGGGT GGTGTTCGAC
CTCGTGATGA TCGGGGTGTG CGGCGGCTTC TTCATCGTGC CGCTGTATGC GCTCGTGCAG
CAGCGCTCCG CGCCGGCGCA CCGCGCGCGC GTGATCGCCG GCAACAACAT CCTCAACGCG
CTCTTCATGG TTGCCGCCGC GGCGATGGGC ATCGGGCTGC TGGCCGCGGG CTTCGCGGTG
CCGCAGCTCT TCCTGGCCAC GGCCTTGCTC AACGCGCTGG TCGCCGCGCT GCTGTTCGCC
CGCGAGCCCG CCTTCCGCCA GCGCGGCGGC CCCCCGTAG
 
Protein sequence
MSGQFRLLRE RRFLPFFLTQ FLGAFNDNVY KNALVVLITF QAARLGADSP GVLVNLAAGV 
FILPFFLFSA TAGQLADKYE KSRLIRATKL LEIAVMALAV VGFALMSLPL LLVVLFLMGA
QSALFGPVKY AIIPQQLADD ELVGGNALVE AATFVAILAG TIVGGLLVAG DAGPGRVAVA
VLAIALLGWW SSRAIPPAAA ADPGLRVNWN PVTQTREMLR FMLEARAVFV AIVGISWFWF
YGAVFLSQFP GFAADHLRGD ERAVTLLLAL FSVGIGAGSL LCGRLSRGRV EPRMVLPGAI
GLSLFALDLW WASPAAGAFP PGQGLDTLFA RAEVWRVVFD LVMIGVCGGF FIVPLYALVQ
QRSAPAHRAR VIAGNNILNA LFMVAAAAMG IGLLAAGFAV PQLFLATALL NALVAALLFA
REPAFRQRGG PP