Gene Mpe_A0294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0294 
Symbol 
ID4786903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp319055 
End bp319999 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content70% 
IMG OID640088846 
Productputative thioredoxin protein 
Protein accessionYP_001019491 
Protein GI124265487 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3118] Thioredoxin domain-containing protein 
TIGRFAM ID[TIGR01068] thioredoxin 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.918989 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0369237 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACA TCACCCTCCA GAACTTCGAA GCCGAGCTGA TCCAAGCGTC GATGCAGACC 
CCCGTGCTGC TCGACATCTG GGCGCCGTGG TGCGGACCGT GCAAGTCGCT CGGCCCGGTG
CTGGAGAAGC TGGAAGCGGA TTACGCGGGT CGCTTCGCGC TGGCCAAGCT CAACAGCGAC
GACCAGCCCG ACATCGCAGG CCAGTTGAGC CAGGCTTTCG GCGTGCGCTC GATCCCGTTC
TGCGTGATGT TCGTCGGCGG CCAGCCGGTC GACGGCTTCG TGGGCGCGCT GCCGGAGGCG
CAGATCCGCA GTTTCCTCGA CAAGCATGTG CCGAGCGAAG ACGCACTGGC GGCGGAAGAG
GAGGCCCTGG AGGCCGAGCA ACTGGCCGCC GAGGGCGACA ACGATGCCGC GCTCGCCAAG
CTGTCCGACG CCCTGGCGAT CGCGCCGGGC GACGACGCGA TCCGCGCTGA CTACGTGAAA
CGCCTGCTGG AGGCCGGCCG CACCGCCGAC GCGCGCCGCG TGTACGAGCC GCTGGCGCCG
AAGGCGATCG TCGACGCACG CGCCAGCGCG CTGGGCCTTT GGCTCGACGC CTGCGAGGCA
GCCGAGCGGG CCCGTTCGCC GGAGGCGCTG GCCGCGGCGA TCGGTGCCGA CAGGCGCGAC
TTCGCGGCGC GCTTCGAGCT GGCGCAGACG CTGCTCGCCG CCCAGCGGCC GACCGAAGCG
ATGGACGAAC TGCTCGAGAT CCTGATGCGC GACAAGGCCT GGTCCGACGA GCGTGCGCGC
AAGCTCTATG TCGCCATCCT CGAGCTGCTG AGCAAGCCTC CGCCGAAGGT CGCCTCGCCT
GCCGAGGCCA AGGGAACACT GGAGATCGCC GGCAAGGCCG CCGCCGTGGC CAGCGACCCG
GTGATCGACG GCTACCGCCG CAAGCTCAGC ATGGTGCTGT TCTGA
 
Protein sequence
MIDITLQNFE AELIQASMQT PVLLDIWAPW CGPCKSLGPV LEKLEADYAG RFALAKLNSD 
DQPDIAGQLS QAFGVRSIPF CVMFVGGQPV DGFVGALPEA QIRSFLDKHV PSEDALAAEE
EALEAEQLAA EGDNDAALAK LSDALAIAPG DDAIRADYVK RLLEAGRTAD ARRVYEPLAP
KAIVDARASA LGLWLDACEA AERARSPEAL AAAIGADRRD FAARFELAQT LLAAQRPTEA
MDELLEILMR DKAWSDERAR KLYVAILELL SKPPPKVASP AEAKGTLEIA GKAAAVASDP
VIDGYRRKLS MVLF