Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1114 |
Symbol | |
ID | 7084643 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1216364 |
End bp | 1217659 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643698129 |
Product | polysaccharide export protein |
Protein accession | YP_002354769 |
Protein GI | 217969535 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1596] Periplasmic protein involved in polysaccharide export |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0366205 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCGCT TCACCACCGA CCGGATCGCC CACTGCGGCG CATCCAATGG CGCCGTGCGC GCCACCGACG CCAGTGGCCT TGACGCCCGC GCGGCCGGTC CCGCCACCCG CGCGGACGCC ATCCGCAGCG GCGGTCTCGT CCGCACGGCC GTGATCGCCT TCGCCGCCGC AGCCATCGGC GGCTGCACCC TCATCCCCGG CACCAGCGGC AGCCTCATGC GCGAGGAGTC CAGCGTGCCG CTGCCGGTCA CCGAGGGCAC CGACACCGTC CCGGCCAACG TCAAGCTCAA GCCCATCACC GCCGAGCTGA TCATCGAACA GCACCAGGCC AAGGAGTCGC GCTTCCGCAG CGGCAAGGCC GGCGGCCAGA CCGCCACCCG CGCGAGCGAT CCCAAGAAGA TCCCCGGCTT CGACTACCAG GACTACAAGC TCGGCCCCGG CGACATCATC AACGTCATCG TCTGGGACCA CCCCGAGATC ACCATCCCGG CCGGCTCCTA CCGCTCGGCC GAGCAGTCCG GCACCCTGGT CGCCGAAGAC GGCACCATCT TCTTCCCCTT CGCCGGCGTG GTGAAGGTCG CCGGCCTCAC CACCCGCGAG GTGCGCCAGG TCCTCGCCAA GCGCATGGCC AGCGTCATCG AAAACGTCCA GCTCGACGTG CGCATCGTCT CCTTCCGCAG CAAGCGCGTG TATGTCGTCG GCGAAGTCGC CAAGCCCGGC CTGCAGCCGA TCGACGACAT CCGCATGACC CTGGTCGAGG CCATCAACCG CGCCGGCAAC ATCACCGAAG AGGCCGACCA CGGCAACGTG CTGCTCACCC GCAACGGGCA GACCTGGCGG GTCGACCTGC AGGCGCTGTA CGAAGAGGGC GACGTCAGCC AGAACGTGCT GCTGCAGCCG GGCGACATCA TCAACGTGCC CGACCGCCAG CTCAACAAGG TCTTCGTGCT CGGCGAGGTG CGCAACCCGG GCTCGTTCGT CATGAACAAG CGCCGCACCA CGCTCGCCGA GGCGCTGTCC GACGCCGGCT TCGTCAACCA GAGCACGTCC GACCCGGCCT GGGTGTATGT GATGCGCAGC GACAACGGCA GCGCCGAGCT CTTCCACCTC AACGCCCGCT CCCCCGACGC GCTGCTGCTG GCCGAGCGCT TCCCGCTGCT GCCGCGGGAC GTGGTGTATG TCGATGTGGC GGCGATCGCG CGCTGGAACC GCGTGGTCAG CAACATCCTG CCGACCTCGC AGATGCTGCA GCTCACCAGC GAGACGCGGT ATCCGCTGTT CGGCGGCCGG CAGTAA
|
Protein sequence | MSRFTTDRIA HCGASNGAVR ATDASGLDAR AAGPATRADA IRSGGLVRTA VIAFAAAAIG GCTLIPGTSG SLMREESSVP LPVTEGTDTV PANVKLKPIT AELIIEQHQA KESRFRSGKA GGQTATRASD PKKIPGFDYQ DYKLGPGDII NVIVWDHPEI TIPAGSYRSA EQSGTLVAED GTIFFPFAGV VKVAGLTTRE VRQVLAKRMA SVIENVQLDV RIVSFRSKRV YVVGEVAKPG LQPIDDIRMT LVEAINRAGN ITEEADHGNV LLTRNGQTWR VDLQALYEEG DVSQNVLLQP GDIINVPDRQ LNKVFVLGEV RNPGSFVMNK RRTTLAEALS DAGFVNQSTS DPAWVYVMRS DNGSAELFHL NARSPDALLL AERFPLLPRD VVYVDVAAIA RWNRVVSNIL PTSQMLQLTS ETRYPLFGGR Q
|
| |