Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3964 |
Symbol | |
ID | 7873610 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4363261 |
End bp | 4364304 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643700901 |
Product | protein of unknown function DUF6 transmembrane |
Protein accession | YP_002890924 |
Protein GI | 237654610 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAACA TCCACGTGAT CCATGCACTG CTCGCGGCGG CCTTGTTCGG CGCGAGCACC CCGTTTGCCA AATTGCTGGT CGGCGAGATG TCGCCCTGGC TGCTGGCCGG GCTGCTCTAC CTTGGAAGCG GGCTCGGGCT GGCTGTGGCG CGCTTAATCC GCGATCGCAG CTGGACGCCC TCCGGCCTGG GCAAGCGGGA ATGGCCGTGG CTGCTGGGGG CGATCTTCTT CGGCGGCGTG CTCGGCCCGC TCGCACTGAT GTTCGGCCTC ACGCGCACCA GCGGCTCGAC CGCGTCGCTG CTGCTCAACC TCGAGGCGGT ACTGACCGCC GTGATCGCGT GGGTCGTATT CAGGGAGAAC GCCGACCGCC GTATCGTGCT CGGCATGCTC GCGATCGTCG CTGGCGGGGT CGTGCTGTCC TGGTCGGGTA GTGAAGAGAG CACGAACGAT TGGATCGGCC CGCTCGCCAT CGCCGCCGGC TGTATGTGCT GGGCAATCGA CAACAACCTG ACGCGGCGCG TGTCGGCCTC GGATGCGCTC TTCATCGCGG CCACGAAAGG TGCGGTGGCG GGCACGGTCA ACGTCGGACT GGCGTTTGCG CTCGGCGCGA GCCTGCCGGA CGGTGCGGTT CTGCTCGGCA CCCTGGTCGT CGGGTTGTTC GGCTATGGCA TCAGCCTGGT CCTCTTCGTG CTTGCGCTGC GCGGACTGGG GACGGCGCGC ACCGGCGCCT ACTTCTCGAC TGCGCCGTTC ATCGGCGCGG CAGTGTCGCT GGCCCTGCTG GGGGAGTCGA CCTCGATCTC ATTCTGGATT GCGGCAGCCC TGATGGGCTG GGGGGTGTGG TTACACCTCA CCGAGCATCA CGAGCACGAG CACGTGCATG AGCCGATGGA GCATGGCCAT CGGCACACCC ATGACGAACA CCACCAGCAC GAACACGACT TCGCCTGGAA CAGTGACGAG TCACATGAAC ATTGGCACCG TCACGAGGCG CTGGTTCACA AGCACCCGCA CTTTCCAGAC ATCCACCATC GGCACTCACA TTGA
|
Protein sequence | MNNIHVIHAL LAAALFGAST PFAKLLVGEM SPWLLAGLLY LGSGLGLAVA RLIRDRSWTP SGLGKREWPW LLGAIFFGGV LGPLALMFGL TRTSGSTASL LLNLEAVLTA VIAWVVFREN ADRRIVLGML AIVAGGVVLS WSGSEESTND WIGPLAIAAG CMCWAIDNNL TRRVSASDAL FIAATKGAVA GTVNVGLAFA LGASLPDGAV LLGTLVVGLF GYGISLVLFV LALRGLGTAR TGAYFSTAPF IGAAVSLALL GESTSISFWI AAALMGWGVW LHLTEHHEHE HVHEPMEHGH RHTHDEHHQH EHDFAWNSDE SHEHWHRHEA LVHKHPHFPD IHHRHSH
|
| |