Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3281 |
Symbol | |
ID | 7874179 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3596072 |
End bp | 3597616 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643700215 |
Product | polysaccharide chain length determinant protein, PEP-CTERM locus subfamily |
Protein accession | YP_002890253 |
Protein GI | 237653939 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.308326 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGAAC TCGTAACACA GATCATCGGC TACCTGCGCG GCATGTGGCG GTTCCGCTGG TGGGGGCTTG CGCTCGCGTG GGTCGCAGGT ATCGCCGGTA GCCTGGCGAT CTACTTCATG CCGGACCACT ACGAGTCGTC CGCACGCATC CACGTGGATA CCCAGTCGGT GCTGCGCCCG CTGATGTCCG GGCTCGCCGT CCAGCCCAAC ATCAGCCAGC AGATCGACTT GCTCAGCCGC ACGCTGATCA GCCGGCCCAA CGTCGAGAAG CTGATCACCA TGGCCGACCT CGACCTCACG GTGCAGAACG CACAGCAGCG CGAGGCCCTG ATCACCTCGG TGACCAGCCG GCTGCGCATC CAGTCCGGCC GCGGCGACAA CCTGTTCACG CTCTCGTACG AAGACGTCGA GCCGCATCGT GCGCAGCGCG TCGTGCAGTC GCTGCTGTCC CTGTTCGTCG AATCCGGTCT CGGCGGCAAG CGTCAGGATA CCGACGCGGC ACGGCGCTTC ATCGAGGACC AGATCCGCAG CTACGAGCAG AAGCTGACCG ACGCCGAGAA CCGGCTCAAG GACTTCCGCC TGCGCAACAT GGCGCTGCTG GGTACCGGCG CGCGCGATTA CGTCACCCAG ATCGCCGAGA CCAACGAGCA ATTGCGCGAA GCACGCCTGG AACTGCGCGA GGCCGAGAAC TCGCGCGACG CGCTCCGCCA GCAACTCTCC GCAGCGCCGT CCGATGCGGG CATCGCGCCG CCGCCGATGG TGGCGACGCC CGAGATCGAT GCCCGCATCG ACCTGCTCAA GAGAAACCTC GACGAGATGT TGCAGCGCTA CACCGAGATG CACCCGGACG TGGTCGGTGC ACGCCGCGTC ATAGAGGATC TCGAGAAGCA GAAGAAGGCG CAGCTGGCCG AGCTGAGCGC GAGCATGGCG GGCAGTACCT TCGGTGCTCC CGGCGCCATG AGTGCGGGGG TCGAGCAGAC CCGGCTCGCG GTGGCGCAGG CCGAGGCCCG GGTCGCGTCG CTGCGTGCGC GGGTGGCCGA GTTCGAGAGC CGCCTGGCGA GCCTCTCGGA GGACGCCAAG CGGATCCCCG AGCTCGAGAC CGAGATGGCG CAGCTCAACC GCGATTACAG CGTCCACAAG AGCAACTACG ACCAGCTGGT ATCGCGGCGG GAGTCGGCGA ACATCGCCGC CGAGATGAGC ACCCAGTCGG GCATCGCCGA CTTCCGCATC ATCGATCCGC CCACCCTGCC CACCAAACCC TCGGCTCCCA ACCGGCTGCT GCTGCTGCCG CTCGCCGCGC TGGTGGGTCT GGGGGCGGGG TTCGCGCTCA CCTTCCTGAT CAGCCAGCTG CGACCGGCGT TCAGCGATGC GCGCCAGCTG CGCGAGATCA CCGGCCTGCC GGTGCTCGGC ACCGTCTCCA TGCTGACCAC TTCGGAGCGT CGTCGTCGTC GCCGCAACGG GCTGTTCGCC TTCGGCAGCG GTCTGGTGGC GTACGCCGGG GTGTTCGCCG CGGTTACGGT CGCGATCGTG TTCCTGCAGA GTTGA
|
Protein sequence | MEELVTQIIG YLRGMWRFRW WGLALAWVAG IAGSLAIYFM PDHYESSARI HVDTQSVLRP LMSGLAVQPN ISQQIDLLSR TLISRPNVEK LITMADLDLT VQNAQQREAL ITSVTSRLRI QSGRGDNLFT LSYEDVEPHR AQRVVQSLLS LFVESGLGGK RQDTDAARRF IEDQIRSYEQ KLTDAENRLK DFRLRNMALL GTGARDYVTQ IAETNEQLRE ARLELREAEN SRDALRQQLS AAPSDAGIAP PPMVATPEID ARIDLLKRNL DEMLQRYTEM HPDVVGARRV IEDLEKQKKA QLAELSASMA GSTFGAPGAM SAGVEQTRLA VAQAEARVAS LRARVAEFES RLASLSEDAK RIPELETEMA QLNRDYSVHK SNYDQLVSRR ESANIAAEMS TQSGIADFRI IDPPTLPTKP SAPNRLLLLP LAALVGLGAG FALTFLISQL RPAFSDARQL REITGLPVLG TVSMLTTSER RRRRRNGLFA FGSGLVAYAG VFAAVTVAIV FLQS
|
| |