Gene Tmz1t_3964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3964 
Symbol 
ID7873610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4363261 
End bp4364304 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content66% 
IMG OID643700901 
Productprotein of unknown function DUF6 transmembrane 
Protein accessionYP_002890924 
Protein GI237654610 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAACA TCCACGTGAT CCATGCACTG CTCGCGGCGG CCTTGTTCGG CGCGAGCACC 
CCGTTTGCCA AATTGCTGGT CGGCGAGATG TCGCCCTGGC TGCTGGCCGG GCTGCTCTAC
CTTGGAAGCG GGCTCGGGCT GGCTGTGGCG CGCTTAATCC GCGATCGCAG CTGGACGCCC
TCCGGCCTGG GCAAGCGGGA ATGGCCGTGG CTGCTGGGGG CGATCTTCTT CGGCGGCGTG
CTCGGCCCGC TCGCACTGAT GTTCGGCCTC ACGCGCACCA GCGGCTCGAC CGCGTCGCTG
CTGCTCAACC TCGAGGCGGT ACTGACCGCC GTGATCGCGT GGGTCGTATT CAGGGAGAAC
GCCGACCGCC GTATCGTGCT CGGCATGCTC GCGATCGTCG CTGGCGGGGT CGTGCTGTCC
TGGTCGGGTA GTGAAGAGAG CACGAACGAT TGGATCGGCC CGCTCGCCAT CGCCGCCGGC
TGTATGTGCT GGGCAATCGA CAACAACCTG ACGCGGCGCG TGTCGGCCTC GGATGCGCTC
TTCATCGCGG CCACGAAAGG TGCGGTGGCG GGCACGGTCA ACGTCGGACT GGCGTTTGCG
CTCGGCGCGA GCCTGCCGGA CGGTGCGGTT CTGCTCGGCA CCCTGGTCGT CGGGTTGTTC
GGCTATGGCA TCAGCCTGGT CCTCTTCGTG CTTGCGCTGC GCGGACTGGG GACGGCGCGC
ACCGGCGCCT ACTTCTCGAC TGCGCCGTTC ATCGGCGCGG CAGTGTCGCT GGCCCTGCTG
GGGGAGTCGA CCTCGATCTC ATTCTGGATT GCGGCAGCCC TGATGGGCTG GGGGGTGTGG
TTACACCTCA CCGAGCATCA CGAGCACGAG CACGTGCATG AGCCGATGGA GCATGGCCAT
CGGCACACCC ATGACGAACA CCACCAGCAC GAACACGACT TCGCCTGGAA CAGTGACGAG
TCACATGAAC ATTGGCACCG TCACGAGGCG CTGGTTCACA AGCACCCGCA CTTTCCAGAC
ATCCACCATC GGCACTCACA TTGA
 
Protein sequence
MNNIHVIHAL LAAALFGAST PFAKLLVGEM SPWLLAGLLY LGSGLGLAVA RLIRDRSWTP 
SGLGKREWPW LLGAIFFGGV LGPLALMFGL TRTSGSTASL LLNLEAVLTA VIAWVVFREN
ADRRIVLGML AIVAGGVVLS WSGSEESTND WIGPLAIAAG CMCWAIDNNL TRRVSASDAL
FIAATKGAVA GTVNVGLAFA LGASLPDGAV LLGTLVVGLF GYGISLVLFV LALRGLGTAR
TGAYFSTAPF IGAAVSLALL GESTSISFWI AAALMGWGVW LHLTEHHEHE HVHEPMEHGH
RHTHDEHHQH EHDFAWNSDE SHEHWHRHEA LVHKHPHFPD IHHRHSH