Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1057 |
Symbol | |
ID | 7084041 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1158649 |
End bp | 1160193 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 643698075 |
Product | putative symporter protein |
Protein accession | YP_002354715 |
Protein GI | 217969481 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2211] Na+/melibiose symporter and related transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.872646 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGACC GCGCACGCGC CGCGCCGGAT CGCCACCACG GGGGCGGAGC ATGTGCCCGC GCGGCCTGCG CGCCTGGCGC CGGGCGTGCT CCGGATGCCG ACCGCGTCGA GTGGGGTGCG CTGCTCGCCT ACGGCGCCCT CGGCCTGCCG CTCGCCTTCG CGGCGCTGCC GATCTACGTG CATGTGCCGC GCCTGTATGC GGAAGGCCTG GGCCTGTCGC TCGCGCTGGT CGGTGCCGTG CTGCTCGCGG CGCGCGTCGT CGACGCCATC ACCGACCCGC TCATCGGCTG GGCCAGCGAC CGCCTGCCGC GGCGCCGGCT GTGGATCGCG CTGGCGCTGC CCGCGCTCGG GGCCGGCATG CTCGGTCTGC TCGTGCCGCC CGCGGGGGCC GGGGCGGGCT GGCTGTTCGC GCTGCTGGTC GCGGTGTCGC TGGCCTACTC GGTGGCGAGC ATCGCCTACA ACGCCTGGGG CGCCGAGGTC GCCTCCACCC CCGCGGCGCG CACGCGCTTC GTCGCCAGCC GCGAGGCCTT CGCGCTCGCC GGCGTGGTGC TGGCGGCGGC GCTGCCGGGG CTGCTGGGCG AGGGGGTGAT CGGTGGCACG GCTGGCGTCG GCCAGTCGGC GGTCGCCACT GAAGGCGGCG CCGCGGGCGG TGTCGGTGCG TCAGGGGGCG GTGGCGACGG CGGCGCGGCG GGGCTCGCAC GGCTCGCCTG GCTTTTCCTT CCGCTGCTCG TGGTCTTCGG CCTGTGGACG CTGTGGCGTG CGCCCGCTCC GCCGCGGCTC GCTGCGACGC ACGCGCCGGT GTGGCGGGGC CTGCGCGCGG CGCTCGCCGA CGCGGCCTTC CTCCGCCTGC TCGCGGTGTT CGCGGTCAAT GGCATCGCCG CCGCGATCCC CTCGGCGACG GTGCTCTTCT TCGTCGCCGA CGTGCTGCAG GCCGAGGCGC TTGCCGGCGC CTTCCTTGCC CTGTATTTCC TCGCCGCCGC GGCCGGGCTG CCGCTGTGGA CGCGGCTGTC GCAGCGCATC GGCAAGCTGC GCGCCTGGCT CGCCGGCATG GCGCTCGCCG TGGCGGTGTT CGCCTGGGCA GGCCTGCTCG GCAGCGGCGA TCTGTTCGCC TATGCCTGCA TCTGCGCGCT CTCGGGCCTG GCGCTCGGCG CCGACCTCAC CCTGCCGCCT TCCATGCTCG CGGACCTGCT CGCGCGCGGG CGGGACGGGC GTGCGCCGGC CCGCTCGCGC GCGCTCGAGG CGGGTGCCTG CTTCGGCTGG TGGAGCTTCG TCACCAAGGC CAACCTGGCG CTCGCCGCCG GCCTGGCGCT GCCCCTGCTC GCGCTGCTCG GCTACGCCCC GGGCGCGCGC GAGCCCACTG CGGTCGCCGC GCTCGGGGCG GTGTACGGCT TCGCACCGGT CGCTCTCAAG CTCGCCGCCA TCGCCCTGCT CTGGCACGGG CGAGCCGTGC TCGATCCCGG CACGGGACAC GACCCGCCCG GGATCGCGAC TTGCGTCGCT CCCACAAGCG TCGACCGGCG CCGCACTGGA AACGACGGCG GGTGA
|
Protein sequence | MSDRARAAPD RHHGGGACAR AACAPGAGRA PDADRVEWGA LLAYGALGLP LAFAALPIYV HVPRLYAEGL GLSLALVGAV LLAARVVDAI TDPLIGWASD RLPRRRLWIA LALPALGAGM LGLLVPPAGA GAGWLFALLV AVSLAYSVAS IAYNAWGAEV ASTPAARTRF VASREAFALA GVVLAAALPG LLGEGVIGGT AGVGQSAVAT EGGAAGGVGA SGGGGDGGAA GLARLAWLFL PLLVVFGLWT LWRAPAPPRL AATHAPVWRG LRAALADAAF LRLLAVFAVN GIAAAIPSAT VLFFVADVLQ AEALAGAFLA LYFLAAAAGL PLWTRLSQRI GKLRAWLAGM ALAVAVFAWA GLLGSGDLFA YACICALSGL ALGADLTLPP SMLADLLARG RDGRAPARSR ALEAGACFGW WSFVTKANLA LAAGLALPLL ALLGYAPGAR EPTAVAALGA VYGFAPVALK LAAIALLWHG RAVLDPGTGH DPPGIATCVA PTSVDRRRTG NDGG
|
| |