Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0215 |
Symbol | |
ID | 3831366 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 211573 |
End bp | 212673 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637828151 |
Product | polysaccharide pyruvyl transferase |
Protein accession | YP_429093 |
Protein GI | 83589084 |
COG category | [S] Function unknown |
COG ID | [COG2327] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR03609] polysaccharide pyruvyl transferase CsaB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 52 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGCCAGGG TAGTAATCTC CGGTTATTAC GGTTTTCAGA ACGCCGGGGA CGAGGCGGTC CTCTACAGCA TCGTTAAAGC CTTGCGCTCC CTGGAACCGG ACATAGAGAT CACTGTCCTT TCCCGGCGTC CGGAGCAAAC GGCGGCCTGT TTAAAGGTCC GGGCAGTAGA CCGCTGGCAC CCGGTCCGGG TGGCCGGGGC TATCCGCCGG GCCGACCTGG TTATCAGCGG TGGGGGTAGT TTGTTCCAGG ATGTTACCGG GCCTAAAAGC CTGCTCTATT ACCTGGGTAT AGTGTTGCTG GCCCGGTTAT TGCGGAAGCC GGTAATCGTC TATGCCCAGG GACTGGGCCC TTTGAAACGC CACTGGAGCC GTTGGCTGAC GGGCCGGGTA TTAAACCGCG TCCAGCTTAT CTCCCTGCGG GACAGCGAGT CACGGCGGCT GCTGGAAGAG CTGGGGGTTA CCCGGCCGCC GGTCTATGTA ACGGCCGACC CGGTCCTGGG CCTGGAGCCT GAAAATATGG ACCTGCGGCC CGGCCAGGAT AAATGGGAGC AACTGGAGCT TTCCGGGCCG GTGATCGGGA TTTCAGTGCG CTCCTGGCCG GGGTATGAGG AGTGCTGGCC GTCCCTGGCC AGGGTAGCTG ACGAGCTCGT GGCCGGGGGG TGGCAGGTTT TATTTTTGCC TTTTCACTTT CCCGCTGATG TCGACGCCTG CCGCCAGGTA GCCCGCCTGA TGCACAGTCC GGCAGTTGTT CTACGGGAAA ACCTGGACCT GCCGGCCCTG ATGGGCCTGA TGGGCCGGTT GCAGTTTTTG ATCGGCATGC GTCTCCACGC CCTGATCCTG GCTTCTTTGA TGGGGGTTCC TTTCCTGGCC CTGCCCTATG ACCCCAAAGT GACGGCCCTG GCCAGGATGA TGGAGCAGCC GGTCGCCGGC TTTCTGGCGA GTGTCAGTTA TACCGGCCTG GAAGCAGCCG TCAAACAGGC CCTGGCCGAA CGTGAGGAAA ACGCCCGGCG GGTACAGGCC GCGGTGGCCG AACTGCGGCC TCTGGCCCTG GACACCGCCC GGCTGGTAAT AGAGTATTTG CGGAAAGGAG CAAGGGGCTG A
|
Protein sequence | MARVVISGYY GFQNAGDEAV LYSIVKALRS LEPDIEITVL SRRPEQTAAC LKVRAVDRWH PVRVAGAIRR ADLVISGGGS LFQDVTGPKS LLYYLGIVLL ARLLRKPVIV YAQGLGPLKR HWSRWLTGRV LNRVQLISLR DSESRRLLEE LGVTRPPVYV TADPVLGLEP ENMDLRPGQD KWEQLELSGP VIGISVRSWP GYEECWPSLA RVADELVAGG WQVLFLPFHF PADVDACRQV ARLMHSPAVV LRENLDLPAL MGLMGRLQFL IGMRLHALIL ASLMGVPFLA LPYDPKVTAL ARMMEQPVAG FLASVSYTGL EAAVKQALAE REENARRVQA AVAELRPLAL DTARLVIEYL RKGARG
|
| |