Gene Mpe_A2005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2005 
Symbol 
ID4783792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2147972 
End bp2149972 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content72% 
IMG OID640090575 
Productputative ATP-dependent DNA helicase-related protein 
Protein accessionYP_001021198 
Protein GI124267194 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1199] Rad3-related DNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.349435 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCCGG CCAACGGCAT GCGCAGCGAA CTGCAGAAGG CGGTGAGTGC GGCATTCGCG 
TTCGGCGGTC CGCTGCAGCG GGCTGATGCG CAGTACCGCG AGCGCGGCGC CCAGTTGGAG
CTCGCCGCAG CCGTGGCCGA TGCGATCGAT GCGCGCGAGG TGCTGGTGGC CGAGGCCGGC
ACAGGCGTCG GCAAGACGTT CGCCTACCTG GTCCCTGCGC TGCTGGCCGG GCATCGCACG
CTGATCAGCA CCGCGACCAA GAGCCTGCAG GACCAGCTTT TCCTGCGCGA TCTGCCGCGG
CTGCGCGAGG TGCTGGCCCT GCCGGTCAGC GTGGCCTTGC TGAAGGGACG CGGCAGCTAC
CTGTGCCTGC AGCGGCTGGT GCTGGCCCGG GAGACGACGG CGCTGCCCGA CCGCTATGCG
GCGCGCACGC TGGCGAAGAT CGAGCAGTGG GCGCAGCGCA CCGTGAGCGG CGACCTGGCC
GAGCTCGAAG GCCTGGATGA GCGCTCGTCG GTGATCCCGC TGGTGACCTC GACGCGCGAG
AACTGCCTCG GCAGCGAATG CCCGGAATAC CGCGGCTGCC ACGTGATGAA GGCCCGCCGC
GAGGCGATGG CGGCCGACCT GGTGGTGGTC AACCACCACC TCTTCTTCGC CGACCTGTCG
CTGCGCGACA CCGGCATGGC AGAGCTGCTG CCTTCCGTGG ACGTGGCGGT GTTCGACGAA
GCGCATCAGC TTGCCGAGGC GGGCGTGGCC TTCCTCGGCA GCACGCTGGG CAGCGCCCAG
GTGGTCGATT TCGCGCGCGA CATGCTGGCC GTCGGGCTGG CGCAGGCGCG CGGCCTGCAG
CCTTGGCAGG ACCTGGCGGC AAGATGCGAC CGCGCGGCGC GCGATCTGCG CCTCGCGGCC
AACGGGCCCC TTCGCGAGGT GCGCGGCAGC CTCAAGCTGC GCTGGGGCGA TCGCGCCGAC
CGGGCCGACT TCATCGAGAC ACTCGCCGCG CTGGGGCAGG CCTGCCGGAG TGCGGCCGAC
GCGCTCGACG TCGTGGCGGA GATGTCACCC GATCTGGCCA AGCTCGCGGA GCGGGCGTCG
ACGCTGGCCG CCCGCGCCGC CGGCTTCGGG CTCGAGGCCG AGCCGGGGCG CGTGCGCTGG
ATCGACCTCA CACCGCAGGG GCTGCGGCTG GTGGAATCCC CGCTCGACGT GCGCGACGCG
ATGCGCGAGC AGATCGCGGC GGCGCCCAAG GCCTGGATCT TCACGTCGGC CACGCTCGGA
GCCGACGAGC GGCTCACCTG GTTCACCGAG CAGGCCGGGT TGGAGACGGC GCGCACGCTG
CGCGTCGACA GCCCCTTCGA CTACCCGATG CACGCGCGGC TCTACGTCCC CACGCGTTTC
CCGAAGCCGA ACGAGCCGGG CCACACGGCG GCGGTTGCAC GCCTGGCGGC GGCCTGCGCG
CGGGCCGCGG GGGGGCGCAC CTTCGTGCTG ACGACGACCC TGCGTGCGCT GTCCGGCGTG
GGCGAGGCGC TGCAGTCCGA CTTCGAGGGC GATGCGGAGC CGATCACCGT GCTGATGCAG
GGCAGCGGGC CCAAGCGCCA GCTGCTGCAG CGCTTCCTCG ACACACCTCG CGCGGTGCTG
GTCGGTTCGC AGAGCTTCTG GGAAGGCATC GACGTGCCCG GCGAGGCACT GCAGTGCGTG
ATCATCGACA AGCTGCCGTT CCCGCCGCCG AACGATCCGC TGGTCGAGGC GCGCGTGAAG
CGGCTCGAGA GCGAGGGGCG CAACCCCTTC AACGACTACT TCGTGGCCGA GGCTGCGGTC
TCGCTGAAAC AGGGGGCCGG CCGGCTGATC CGCAGCGAGA GCGATCGCGG TCTGCTGGTC
GTGTGCGACC CCCGCATGGG CACGATGGGC TATGGACAGC GCATGAGGGC CGCTTTGCCG
CCGATGCAGG TGCTGCGCGA GGAAGCGCAG GCGCTGGCGT GGTTGGCCGA GATATCGAAG
GCGGCGTCGG CCTGGGCTTG A
 
Protein sequence
MSPANGMRSE LQKAVSAAFA FGGPLQRADA QYRERGAQLE LAAAVADAID AREVLVAEAG 
TGVGKTFAYL VPALLAGHRT LISTATKSLQ DQLFLRDLPR LREVLALPVS VALLKGRGSY
LCLQRLVLAR ETTALPDRYA ARTLAKIEQW AQRTVSGDLA ELEGLDERSS VIPLVTSTRE
NCLGSECPEY RGCHVMKARR EAMAADLVVV NHHLFFADLS LRDTGMAELL PSVDVAVFDE
AHQLAEAGVA FLGSTLGSAQ VVDFARDMLA VGLAQARGLQ PWQDLAARCD RAARDLRLAA
NGPLREVRGS LKLRWGDRAD RADFIETLAA LGQACRSAAD ALDVVAEMSP DLAKLAERAS
TLAARAAGFG LEAEPGRVRW IDLTPQGLRL VESPLDVRDA MREQIAAAPK AWIFTSATLG
ADERLTWFTE QAGLETARTL RVDSPFDYPM HARLYVPTRF PKPNEPGHTA AVARLAAACA
RAAGGRTFVL TTTLRALSGV GEALQSDFEG DAEPITVLMQ GSGPKRQLLQ RFLDTPRAVL
VGSQSFWEGI DVPGEALQCV IIDKLPFPPP NDPLVEARVK RLESEGRNPF NDYFVAEAAV
SLKQGAGRLI RSESDRGLLV VCDPRMGTMG YGQRMRAALP PMQVLREEAQ ALAWLAEISK
AASAWA