Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3790 |
Symbol | |
ID | 7874032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4176691 |
End bp | 4179102 |
Gene Length | 2412 bp |
Protein Length | 803 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643700732 |
Product | polysaccharide export protein |
Protein accession | YP_002890756 |
Protein GI | 237654442 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1596] Periplasmic protein involved in polysaccharide export |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATACCA CCGAACATTG CAACCGCCTG CGGAGCGCTG CGAAGGCGCT GATCCTGGGC TTGGCACTGG CTTGCTCGTC CGTGACGATG GCGCAGAACG CCCCCGCTGC AGCGGGCTTC GCGGCGGCGG GCGGTGCTCC GGCTGCCGTG GGGAGCGGCG GGGTGGGCAT GGGGGCCCCT ACCTTGTCCG GTGGCGCGCC GGGTGCGGCC GTCGGTGCCG CCAGCGGTGC TGCGGCATCT GCCGGTGCCG CGGACGGATT CGAGACCCAG GCTGCTCCGA CCGTGATCGG CGGTGAAGGC CTGAGCCCCT CCCAGGATGC CGCGCTCGAC CAGGCGGTGA CGGTCGACGC CGACGGCAAC GACCGCGTGG GCGGCACGGT GCGCGCCGCG CGGGTGCTCA ACGACTTCCA GCAATTCGTG GCGCGCAGCA CCGGGCAGGT GCTGCCGCTG TACGGTTCGG GCTTCTTCGC GGGCTCGCGA GTGTTCAACA GCCCCACCGC GCCGGTCGCC GACGACTACG TGCTCGGCCC GGGCGACCAG GTGCTGGTGC GGATCTGGGG CGCCTTCGAG TCGCAGACCC GGGCGCAGAT CGACCGCAGC GGCATGATCA CGCTGCCGAC CATCGGCCCG GTGAGCCTGG CCGGGGTGCG GATCGCCAAC GCGGTGCAGG TGATCGAGAA TCAGGTGGGC CGCATCTACC GCGACGTGAG CGTCAGCGTG AGCCTGGACC GGGTGCGCGG CATCACCGTG TTCGTGGTGG GCCAGGCGCG GCGTCCGGGC ACCTACACGG TGTCGGGCAA TTCGACCCTG ATCGGCGCGC TGTTCCAGAG TGGCGGCCCG GGGGCCAATG GTTCGCTGCG CCGCGTGCAG GTCAAGCGCG ACAACCAGGT GATCACCGAG ATCGACCTGT ACCGCTTCCT CGCCAACGGC GACACCTCGG CCGACATCCG CCTGGTCGAT GGCGACGTGA TCGTGATCCC CCCGGCGCAC GGCTACGTGG CGCTGACCGG GCAGGTGAAG GCGCCGGCGA TCTACGAGCT CAAGGACCGT ACGGACACGC TGCGCAGCGT GCTGACGGTG TCGGGCGGGC TGCCCGTCGT GGCCGACCCG CGCCTGGCCT TCGTCGAGCG CCTGGACCCG AGCGCGGACC AGCCGCGCTC GGTGTTCGAG GTGTCGCTGC AGCCGGGGCA GCCCGACTTC GTGCTGAAAT CCGGCGACCT GGTCGCGGTG CAGCCCATCC TGGCCGAGTT CGCCAATGCG GTGACGCTGC GCGGTGGGGT GAGCGCACCG GTGCGCCTGC CCTACCGCGC CGGCATGCGG ATCTCGGACC TGATTCCGGA CAAGGCCACC CTGATCAACC GCTACGTGGT GGACAACCAG AACCGCAGCC TGCTCGATCG CGGCAGCTTC GTCGGCGATG TCGGCAACCT CTTCGTCGAC ATCAACATGG ATTACGCGGT GGTCGAGCGG CTCGAGCGCC CGCAGATGGC GCTCAAGCTG ATCCCGTTCA GCCTCAACGG GCTGTTCGCG GACCCCAACG GCCCGGACAA CCTGCGCCTG CAGGCGGGCG ACACGATCTC GATCTTCACC GCGGGCGACG TGCGCGTGCC GGTGAGCCGC CGCCGCGTGG TGATGCGGGT GGAGGGCGAG GTGAACCGCC CCGGGGTGTA TGTGGCCGAG CCGGGCGAGA CCCTGGTGAA CATCATCGAG AAGGCGGGCG GGCCGACCGC GGACGCCTAC CTGTTCGGCG CCGAGTTCTA CCGCGAGTCG GTGCGCAGGT CGCAGCAGGC CAACCTCGAC AAGCTGGTGC AGCGCCTGGA GCAGCAGGCA GTTGCCGAGT CGGCGCGGGT TTCCGCCAAC ATCATCGGGG ACGCCCAGGC CGTGGCCCAG GCGCAGGCGC AGCTGCGCGC CGAGCGCGAG GCGCGTGGGC GTTTCCTCGC CCGCATGCGC ACGCTGAAGT CCTCGGGCCG GATGTCGCTC GGCCTGCCGG CGGACGAGCC GAGCTTCGCG CAGATCCCCG GCTTCCGGCT GGAGAACGGC GATCGCCTGG TGATCCCGAA TCGGCCCGAT TTCGTGCAGG TGTTCGGTGC GGTCAATACC GAGTCTGCCC TGCTGTGGCG GCCCAGCCGC ACCGTTTCCG ACTACCTGGA GCAGGCCGGC ATGAGCCGCG AGGGCGACCG CAGCGCGGCC TTCGTGCTGC GCGCCGACGG CACGGTGGTG GCCGAAACGG GCAGTTGGTT CAGCAGCGTG ATGGGGACGA CGGTGCTGCC GGGCGACATC ATCGTGATTC CGGAGCTGAT CGATCGCGAG TCGGGCTGGA CGGCCTTCGC GCGCATTGCC AAGGACTGGA CGCAGATCTT CGCCAACCTC GGCCTCGGTG TGGCCGCGGT GCGCTCGATC GAGAACGACT GA
|
Protein sequence | MNTTEHCNRL RSAAKALILG LALACSSVTM AQNAPAAAGF AAAGGAPAAV GSGGVGMGAP TLSGGAPGAA VGAASGAAAS AGAADGFETQ AAPTVIGGEG LSPSQDAALD QAVTVDADGN DRVGGTVRAA RVLNDFQQFV ARSTGQVLPL YGSGFFAGSR VFNSPTAPVA DDYVLGPGDQ VLVRIWGAFE SQTRAQIDRS GMITLPTIGP VSLAGVRIAN AVQVIENQVG RIYRDVSVSV SLDRVRGITV FVVGQARRPG TYTVSGNSTL IGALFQSGGP GANGSLRRVQ VKRDNQVITE IDLYRFLANG DTSADIRLVD GDVIVIPPAH GYVALTGQVK APAIYELKDR TDTLRSVLTV SGGLPVVADP RLAFVERLDP SADQPRSVFE VSLQPGQPDF VLKSGDLVAV QPILAEFANA VTLRGGVSAP VRLPYRAGMR ISDLIPDKAT LINRYVVDNQ NRSLLDRGSF VGDVGNLFVD INMDYAVVER LERPQMALKL IPFSLNGLFA DPNGPDNLRL QAGDTISIFT AGDVRVPVSR RRVVMRVEGE VNRPGVYVAE PGETLVNIIE KAGGPTADAY LFGAEFYRES VRRSQQANLD KLVQRLEQQA VAESARVSAN IIGDAQAVAQ AQAQLRAERE ARGRFLARMR TLKSSGRMSL GLPADEPSFA QIPGFRLENG DRLVIPNRPD FVQVFGAVNT ESALLWRPSR TVSDYLEQAG MSREGDRSAA FVLRADGTVV AETGSWFSSV MGTTVLPGDI IVIPELIDRE SGWTAFARIA KDWTQIFANL GLGVAAVRSI END
|
| |