Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0294 |
Symbol | |
ID | 4786903 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 319055 |
End bp | 319999 |
Gene Length | 945 bp |
Protein Length | 314 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640088846 |
Product | putative thioredoxin protein |
Protein accession | YP_001019491 |
Protein GI | 124265487 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG3118] Thioredoxin domain-containing protein |
TIGRFAM ID | [TIGR01068] thioredoxin |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.918989 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0369237 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGACA TCACCCTCCA GAACTTCGAA GCCGAGCTGA TCCAAGCGTC GATGCAGACC CCCGTGCTGC TCGACATCTG GGCGCCGTGG TGCGGACCGT GCAAGTCGCT CGGCCCGGTG CTGGAGAAGC TGGAAGCGGA TTACGCGGGT CGCTTCGCGC TGGCCAAGCT CAACAGCGAC GACCAGCCCG ACATCGCAGG CCAGTTGAGC CAGGCTTTCG GCGTGCGCTC GATCCCGTTC TGCGTGATGT TCGTCGGCGG CCAGCCGGTC GACGGCTTCG TGGGCGCGCT GCCGGAGGCG CAGATCCGCA GTTTCCTCGA CAAGCATGTG CCGAGCGAAG ACGCACTGGC GGCGGAAGAG GAGGCCCTGG AGGCCGAGCA ACTGGCCGCC GAGGGCGACA ACGATGCCGC GCTCGCCAAG CTGTCCGACG CCCTGGCGAT CGCGCCGGGC GACGACGCGA TCCGCGCTGA CTACGTGAAA CGCCTGCTGG AGGCCGGCCG CACCGCCGAC GCGCGCCGCG TGTACGAGCC GCTGGCGCCG AAGGCGATCG TCGACGCACG CGCCAGCGCG CTGGGCCTTT GGCTCGACGC CTGCGAGGCA GCCGAGCGGG CCCGTTCGCC GGAGGCGCTG GCCGCGGCGA TCGGTGCCGA CAGGCGCGAC TTCGCGGCGC GCTTCGAGCT GGCGCAGACG CTGCTCGCCG CCCAGCGGCC GACCGAAGCG ATGGACGAAC TGCTCGAGAT CCTGATGCGC GACAAGGCCT GGTCCGACGA GCGTGCGCGC AAGCTCTATG TCGCCATCCT CGAGCTGCTG AGCAAGCCTC CGCCGAAGGT CGCCTCGCCT GCCGAGGCCA AGGGAACACT GGAGATCGCC GGCAAGGCCG CCGCCGTGGC CAGCGACCCG GTGATCGACG GCTACCGCCG CAAGCTCAGC ATGGTGCTGT TCTGA
|
Protein sequence | MIDITLQNFE AELIQASMQT PVLLDIWAPW CGPCKSLGPV LEKLEADYAG RFALAKLNSD DQPDIAGQLS QAFGVRSIPF CVMFVGGQPV DGFVGALPEA QIRSFLDKHV PSEDALAAEE EALEAEQLAA EGDNDAALAK LSDALAIAPG DDAIRADYVK RLLEAGRTAD ARRVYEPLAP KAIVDARASA LGLWLDACEA AERARSPEAL AAAIGADRRD FAARFELAQT LLAAQRPTEA MDELLEILMR DKAWSDERAR KLYVAILELL SKPPPKVASP AEAKGTLEIA GKAAAVASDP VIDGYRRKLS MVLF
|
| |