Gene Mpe_A1812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1812 
Symbol 
ID4786811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1952323 
End bp1954356 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content71% 
IMG OID640090383 
ProductDNA ligase (NAD+) 
Protein accessionYP_001021006 
Protein GI124267002 
COG category[L] Replication, recombination and repair 
COG ID[COG0272] NAD-dependent DNA ligase (contains BRCT domain type II) 
TIGRFAM ID[TIGR00575] DNA ligase, NAD-dependent 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGACC GCGCGGAAGA TCCGGCCGCC CGCGCCGCGC AACTGCGCGA GCAGCTCGAG 
TACCACGCCC ACCGCTACTA CGTGCTCGAC GCGCCGGAGA TCCCGGACGC CGAGTACGAC
CGCCTGTTCA CCGAACTGCA GGCGCTGGAG GCGGCGCACC CCGGGTTGCG GACGCCGGAC
TCGCCGACCC AGCGCGTCAT CGGCGCGGTG CTCGAAGGCC TGTCGGCGGT GCGGCACGCG
GTGCCGATGC TGTCGATCAA GACCGAGACC GACACCACGC CCACCGGTGC GCTGAAGTTC
GACGCCGCGG TGCGCAACGC ACTGAAGCTG CCGCCCGACG CGCCGCCGCT GCGTTACGCC
GCCGAGCTGA AATTCGACGG CTTGGCGATC AACCTGCGCT ACCAGGCCGG CCGGCTGGTA
CAGGCCGCCA CGCGCGGCGA CGGCGAGACC GGCGAGGACG TGACGCACAC CGTCGGCACG
ATCGAGTCGG TGCCGAAGCA GCTGCGCGGC ATCACGGCGC CGGTGCTAGA GGTGCGCGGC
GAGGTCTTCA TGCGGCGAGA CGACTTCGAG GCGCTCAACG AGCGCCAGCG CGAGGCCGGG
CTCAAGACCT TCGTGAACCC GCGCAATGCG GCGGCCGGCA TCGTGCGCCA GCTCGACGCC
AGCATCGCAC GCCAGCGGCC GCTGAGCTTC TTCGCTTATG GACTGGGCGA CGTGCAGGGC
TGGGACGTGC CGCCCACGCA CGCCGGGTTG CTGGACGCGC TGGCGGCGCT GGGGCTGCCG
GTCGACGCGC ACCGCACCGT GGTCGAGGGC GGCGAGGCGC TGGCCGCCTT CCACGCCGGC
ATCGCGGCTG AGCGCGACGC GCTGCCGTTC GACATCGACG GTGTGGTCTA CAAGGTCGAC
GAGCGTGCGC TGCAGCAGCA GCTCGGCTTC AAATCGCGCG AGCCGCGCTG GGCGGTGGCG
CACAAGTACC CGGCGCAGGA GCAGTCGACC CAGCTGGCCG GCATCGAGAT CCAGGTCGGC
CGCACCGGCA AGCTCACGCC GGTGGCCAAG CTGCAGCCGG TGTTCGTGGG CGGCACGACG
GTGAGCAATG CCACGCTGCA CAACCGCTTC GAGCTGCGCC GCAAGGGCAT CCGCATCGGC
GACACGGTGA TCGTGCGGCG CGCGGGCGAC GTGATCCCCG AGGTGGTGGG TCGCGTGCCC
GTCCCGCGAA CGGCGTACAT CCCCAACTTC CGCATGCCGC GCGCCTGCCC GGTGTGCGGC
AGCCAGGCGC TGCGCGAGCG CGGCAGCGTC GACTACCGCT GCTCGGGCGG CCTGTTCTGC
GCTGCACAGC GCAAGCAGGC GCTGCTGCAT TTCGCCGGGC GGCGCATGAT GGACATTGAG
GGACTGGGCG ACAAGCTGGT CGAGCAGCTT GTCGACGGCG GCATCATCCG CACGCTGCCG
GAGCTCTACA GGCTCGGCGT GGCCAAGCTC GTCGCGCTGG AGCGCATGGG CGACAAGAGC
GCGGCCAACC TGGTGGCAGC GCTGGAGGCG AGCAAGGCCA CGACGCTGGC GCGCTTCCTG
TTCTCGCTGG GCATCCGCCA CATCGGCGAG GCGACCGCCA AGGACCTGGC GCGACATTTC
GGCGCGCTCG ACCGCGTGAT GGACGCGAGC GTCGAGCAAC TGCTGGAGGT CAACGACGTG
GGCCCGGTGG TGGCGCAGAG CCTGCGTACC TTCTTCGATC AGCCGCACAA CCGCGAGGTG
GTCGAGCAGT TGCGCGCCGC CGGCGTGCAC TGGGACGAGC ACTCCGGCGA GGCCGACCTC
ACGCCCAGGC CTCTGGCCGG CAAGACCTTC GTGTTGACCG GCACGCTGCC GAGCCTGGGG
CGCGAGGCCG CCAAGGAGCT GATCGAGGCC GCCGGCGGCA AGGTGGCCGG CTCGGTGTCG
AAGAAGACCG ACTACGTGGT GGCCGGCGAG GAAGCCGGCA GCAAGCTCGA GAAGGCTCAG
GCACTGGGTG TGGCCGTGAT CGACGAGGCA GCGCTGCGCG CGCTGCTGGA CTGA
 
Protein sequence
MSDRAEDPAA RAAQLREQLE YHAHRYYVLD APEIPDAEYD RLFTELQALE AAHPGLRTPD 
SPTQRVIGAV LEGLSAVRHA VPMLSIKTET DTTPTGALKF DAAVRNALKL PPDAPPLRYA
AELKFDGLAI NLRYQAGRLV QAATRGDGET GEDVTHTVGT IESVPKQLRG ITAPVLEVRG
EVFMRRDDFE ALNERQREAG LKTFVNPRNA AAGIVRQLDA SIARQRPLSF FAYGLGDVQG
WDVPPTHAGL LDALAALGLP VDAHRTVVEG GEALAAFHAG IAAERDALPF DIDGVVYKVD
ERALQQQLGF KSREPRWAVA HKYPAQEQST QLAGIEIQVG RTGKLTPVAK LQPVFVGGTT
VSNATLHNRF ELRRKGIRIG DTVIVRRAGD VIPEVVGRVP VPRTAYIPNF RMPRACPVCG
SQALRERGSV DYRCSGGLFC AAQRKQALLH FAGRRMMDIE GLGDKLVEQL VDGGIIRTLP
ELYRLGVAKL VALERMGDKS AANLVAALEA SKATTLARFL FSLGIRHIGE ATAKDLARHF
GALDRVMDAS VEQLLEVNDV GPVVAQSLRT FFDQPHNREV VEQLRAAGVH WDEHSGEADL
TPRPLAGKTF VLTGTLPSLG REAAKELIEA AGGKVAGSVS KKTDYVVAGE EAGSKLEKAQ
ALGVAVIDEA ALRALLD