Gene Mpe_A3803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3803 
Symbol 
ID4785972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp4021802 
End bp4023448 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content71% 
IMG OID640092386 
Productputative choline dehydrogenase lipoprotein oxidoreductase 
Protein accessionYP_001022991 
Protein GI124268987 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0363471 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACG ACGTCACCCC CACCTTCGAT CACGTGATCA TCGGCGGCGG CACGGCCGGC 
TGCCTGCTGG CCAACCGCCT GAGCGCCGAT CCGGCCAAGC GCGTGCTGCT GCTCGAGGCC
GGTGGCCGCG ACGACTACCA CTGGATCCAC ATCCCGGTCG GCTACCTGCA CTGCATCGGC
AACCCGCGCA CCGACTGGCT CTACCAGACC GAGCCCGATC CGGGGCTGAA CGGCCGCTCG
CTGCGCTACC CGCGCGGCAA GGTGCTGGGC GGCTGCTCGA GCATCAACGG CATGATCTAC
ATGCGCGGCC AGTCGCGCGA CTACGACCAC TGGGCGGCGG TGACCGGCGA CGATGGCTGG
CGCTGGGACG CCTGCCTGCC GGTCTTCAAG CAGCACGAGG ACCACCACGG TGGCGCCGAC
GAGATGCATG GCGCCGGCGG CGAATGGCGG GTCGAGAAGC AGCGCCTGCG CTGGGACGTG
CTGGAGGCGT TCGCGTTGGC CGCGCAGCAG GCCGGCATCC CGGCGAGCGC CGACTTCAAC
CGCGGCAACA ACGAGGGTGT GGGCTACTTC GAGGTCAATC AGCGCGCCGG CTGGCGCTGG
AACACGGCCA AGGCCTTCCT CCGACCGACC TGCTATGGCC GGCCCAACTT CGAGATGTGG
ACCGGCGTGC AGGTGACCCG GCTGCTGATC GAGCGCGGTG CCGACGGCGC GCTGCGCTGC
ACCGGCTGCG ATGTGCAGAC GCCCCACGGC CCCGAGACCG TCCGGGCCAG TGCCGAGGTG
CTGCTCAGCG CCGGTGCGGT GGGTTCGCCG CAGCTGCTGC AGCTGTCGGG CCTGGGACCG
GCGGGGCTGC TGCAGCAGTA CGGCATCGCG GTGGCGCAGG ACCTGCCCGG CGTCGGCGAG
AACCTGCAGG ACCATCTCCA GATTCGCGCC GTCTTCAAGC TGCAGGGCGT GCCGACGCTC
AACACGCTGT CGGCGTCGTG GTGGGGCAGG GCCCGCATCG GCCTGGAATA CGCGTTCAAG
CGCAGCGGAC CGATGAGCAT GGCGCCATCG CAGCTCGGTG CCTTCACCCG CTCGTCGCCG
GACCACACGT GGCCCAACCT CGAGTACCAC GTGCAACCGC TGAGCCTGGA GGCGTTCGGT
GAGCCATTGC ATCGCTTCGA CGCCTTCACG GCGAGCGTCT GCAACCTCAA TCCGACCAGC
CGCGGCGCAA TCCGCATCCG CAGCCCGCGC TTCGAGGACG CGCCGCTCAT CGCGCCGCGC
TACCTGTCCA CCGAAGCCGA CCGCCAGGTG GCGGCCGACA GCCTGCGGGT GACGCGGCGC
ATCGCGGCGC AGCCGGCGTT GGCCAAATAC CGGCCCGAGG AGGTGAAGCC TGGCGTGCAG
TTCCAGACCG ACGCCGAGCT GGCGCGGCTG GCCGGCGACA TCGGCACGAC GATCTTCCAC
CCGGTGGGCA CCTGCCGGAT GGGGCGCGAC GACGACCCGC AGGCTGTCGT CGATGCGCAG
CTGCGGGTAC GCGGCGTGGC CGGCCTGCGG GTGGTCGACG CCAGCGTGAT GCCCACGATC
ACCAGCGGCA ACACCAACAG CCCCACGCTG ATGATTGCGG AGAAGGCGGC CCAGTGGATT
CGGGCCGATC ACGGCCACCG GCGTTGA
 
Protein sequence
MNDDVTPTFD HVIIGGGTAG CLLANRLSAD PAKRVLLLEA GGRDDYHWIH IPVGYLHCIG 
NPRTDWLYQT EPDPGLNGRS LRYPRGKVLG GCSSINGMIY MRGQSRDYDH WAAVTGDDGW
RWDACLPVFK QHEDHHGGAD EMHGAGGEWR VEKQRLRWDV LEAFALAAQQ AGIPASADFN
RGNNEGVGYF EVNQRAGWRW NTAKAFLRPT CYGRPNFEMW TGVQVTRLLI ERGADGALRC
TGCDVQTPHG PETVRASAEV LLSAGAVGSP QLLQLSGLGP AGLLQQYGIA VAQDLPGVGE
NLQDHLQIRA VFKLQGVPTL NTLSASWWGR ARIGLEYAFK RSGPMSMAPS QLGAFTRSSP
DHTWPNLEYH VQPLSLEAFG EPLHRFDAFT ASVCNLNPTS RGAIRIRSPR FEDAPLIAPR
YLSTEADRQV AADSLRVTRR IAAQPALAKY RPEEVKPGVQ FQTDAELARL AGDIGTTIFH
PVGTCRMGRD DDPQAVVDAQ LRVRGVAGLR VVDASVMPTI TSGNTNSPTL MIAEKAAQWI
RADHGHRR