Gene Mpe_A3771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3771 
Symbol 
ID4786000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3989574 
End bp3990698 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content69% 
IMG OID640092354 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001022959 
Protein GI124268955 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0244727 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCCC TGTCCGCCAA CGTTCCCGAG GGTTGGGCCG CGCCCGCCGA CAAGACCAGC 
CAGACCGATG ACGAACGGAT CGAGGACGTG ATGCCACTGC CGCCGCCCGA GCACCTGATC
CGCTTCTTCC CGGTGCGCGG CACGCCCGTC GAGGAGCTGG TGTCCAGCAC CCGCCGCCGC
ATCCGCGACA TCATGCGCGG CAATGACGAC CGCCTGCTGG TCATCATGGG CCCGTGCTCG
ATCCACGACC CGGTGGCGGC GGTGGACTAC GCGCGCAAGC TCAAGGCCCA GCGCGACAAG
TACGCCGACA CGCTGGAGAT CGTGATGCGC GTGTACTTCG AGAAGCCGCG CACCACCGTC
GGCTGGAAGG GCCTGATCAA TGACCCCTAC CTCGACGAGA GCTTCCGCAT CGACGAAGGC
CTGCGCATTG CGCGCCAGCT GCTGCTGGAG ATCGGCCGCC TCGGGCTGCC GGCCGGCAGC
GAGTTCCTCG ACGTGATCTC GCCGCAGTAC ATCGGCGACC TGATCTCCTG GGGCGCCATC
GGTGCGCGCA CCACCGAAAG CCAGGTCCAC CGCGAGCTCG CCTCGGGCAT CAGCGCGCCG
ATCGGCTTCA AGAACGGCAC CGACGGCAAC ATCAAGATCG CCACGGACGC CATCCAGTCC
GCCAGCCGGC CGCACCACTT CCTGTCGGTG CACAAGAACG GCCAGGTCGC GATCGTCGAG
ACCCGCGGCA ACGCCGATTG CCACGTCATC CTGCGCGGCG GCAAGACGCC GAACTACGAC
GCCAGCAGTG TCGGGGCGGC CTGCGCCGAA CTCGGCAAGG CCGGGCTGCC GGCCTCGCTG
ATGGTCGACT GCTCCCACGC CAACAGCAGC AAGCAGCACC AGAAGCAGAT CGACGTGGCG
CGCGACGTCG CCGACCAGTT GGCCGGCGGC AGCCGCCAGG TCTTCGGCGT GATGGTCGAG
AGCCACCTGA GCGCCGGCGC CCAGAAGTTC AGCGCGGGCA AGGACGACCC GGCGAAGCTC
GCCTACGGCC AGAGCATCAC GGACGCCTGC ATCGGCTGGG ACGATTCACT GGAGGTGCTG
GGCGTGCTCA GCGCCGCGGT GGCGGCCCGG CGCGGACGGG GCTGA
 
Protein sequence
MNALSANVPE GWAAPADKTS QTDDERIEDV MPLPPPEHLI RFFPVRGTPV EELVSSTRRR 
IRDIMRGNDD RLLVIMGPCS IHDPVAAVDY ARKLKAQRDK YADTLEIVMR VYFEKPRTTV
GWKGLINDPY LDESFRIDEG LRIARQLLLE IGRLGLPAGS EFLDVISPQY IGDLISWGAI
GARTTESQVH RELASGISAP IGFKNGTDGN IKIATDAIQS ASRPHHFLSV HKNGQVAIVE
TRGNADCHVI LRGGKTPNYD ASSVGAACAE LGKAGLPASL MVDCSHANSS KQHQKQIDVA
RDVADQLAGG SRQVFGVMVE SHLSAGAQKF SAGKDDPAKL AYGQSITDAC IGWDDSLEVL
GVLSAAVAAR RGRG