Gene Mpe_A2974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2974 
Symbol 
ID4783578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3162745 
End bp3163953 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content71% 
IMG OID640091545 
ProductNADH dehydrogenase 
Protein accessionYP_001022162 
Protein GI124268158 
COG category[C] Energy production and conversion 
COG ID[COG1252] NADH dehydrogenase, FAD-containing subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGA AGATCGTGAT CGCAGGCTCG GGCTTCGCCG GGACGTGGGC GGCGCTGTCG 
GCCGCTCGGG CCGTGTCGCT GGCCGGTCGG GAGCGCGACG TGGAGATCCT CGTCGTGTCG
CCCACGCCCC ATCTGCACAT CCGCCCGCGC CTCTACGAGA CCGCGTTCGA GGAGATGGCG
CCGGACCTTG CGCCGTTGTT CGCGGCGGTC GGTGTCCGCC ACCTTGCCGG GGTGGTGGAC
GCCGTCCACA CGGAGCGGCA CGAGGTGGAC GTGACCGGCG CCGACGGCCG GCGCAGCACG
CTGCCCTACG AGCGCTTCGT GCTGGCGACC GGCAGCCGGC TGTTCCTGCC TGACGTGCCC
GGTCTCGCCG AGCACAGCTT CAATGTCGAT CAACTGGCCA GCGCGATGAC CCTCGACGCC
CATCTGCGCT CGCTGGCCGA GCGGCCGGAG ACGGCCGCGC GCAACACGGT CGTGGTGGCC
GGTGGCGGCT TCACCGGCAT CGAGACCGTG ACCGAGATGC CGCGGCGCTT GCGCGACATC
CTCGGTCCGG GCGCGAAGAT CCGTGTCGTG GTGGTGGAGC AGGCACCGGT CATCGGTCCG
GATCTCGGGC CGGTGCCTCG CCCGGTGATC GAGTCGGCCC TGGCCGAGTG CGGCGTGGAG
GTGAGGACCT CCGCGGGGGT CGTGGCGATC GACACCGACG GCGTGACCCT CTCCAGCGGC
GAACGCATCG CGACCCACAC CGTGGTGTGG ACCGCCGGTG CCCGTGCCCA CCCGCTGGCC
GCGCAGATCG AGGGCGAACA CGATCGCTAC GGGCGCGTGC ACGCCGACCC GCAGTTGCGG
GCTCGCAGCG TGCCGGACGT GTTCGTCACC GGCGATGTCG CGCTGGCGGC CACCGACGAC
GAGGGCCATC ACGCGGCGAT GTCCTGCCAG CACGCGCTGA GCCTGGGCCG CGTGGCGGGC
CACAACGCTG CCGCCGAACT GGTCGGCCTG CCCACCCACC CCTACAGCCA GCCGAAGTAC
GTGACCTGTC TCGATCTGGG GCCCTGGGGC GCCCTCTTCA CCGAAGGCTG GGACCGCCAG
ATCAAGCTGA CGCGCGAGGT CGGCAAGGCC ACGAAGCAGA CGATCAACAC GCAGTGGATC
TATCCGCCCC AGGCGGATCG CGAGGCCGCC TTCGCGATTG CCAACCCCGA CCACGTCATC
GTGCCGTGA
 
Protein sequence
MTQKIVIAGS GFAGTWAALS AARAVSLAGR ERDVEILVVS PTPHLHIRPR LYETAFEEMA 
PDLAPLFAAV GVRHLAGVVD AVHTERHEVD VTGADGRRST LPYERFVLAT GSRLFLPDVP
GLAEHSFNVD QLASAMTLDA HLRSLAERPE TAARNTVVVA GGGFTGIETV TEMPRRLRDI
LGPGAKIRVV VVEQAPVIGP DLGPVPRPVI ESALAECGVE VRTSAGVVAI DTDGVTLSSG
ERIATHTVVW TAGARAHPLA AQIEGEHDRY GRVHADPQLR ARSVPDVFVT GDVALAATDD
EGHHAAMSCQ HALSLGRVAG HNAAAELVGL PTHPYSQPKY VTCLDLGPWG ALFTEGWDRQ
IKLTREVGKA TKQTINTQWI YPPQADREAA FAIANPDHVI VP