Gene Mpe_A2055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2055 
Symbol 
ID4784632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2195062 
End bp2196594 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content70% 
IMG OID640090625 
Productputative alpha-deoxyribodipyrimidine photolyase-related protein 
Protein accessionYP_001021248 
Protein GI124267244 
COG category[R] General function prediction only 
COG ID[COG3046] Uncharacterized protein related to deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.527245 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCACC TGGTCGTGGT GCTGGGCGAT CAGCTGGATC TGCACGCCGC CGCGTTCGAC 
GGCTTCGACG CCGCACGCGA TGCGGTGTGG ATGGCCGAGG TGGCCGAGGA ATCGGCGCAT
GTGTGGTCGA GCCAACCGCG CACGGTGATG TTCCTCGCCG CGATGCGCCA CTTCGCGCTC
GCCCTGCGTG CGGCTGGGCG AGCCTTGCAT TACGCCGCGC TGGACGATCC CGCCACGCAC
GGCAGCCTGC AGGCGCAGCT GCACGCCGAC ATCGAACGCC TGCGCCCCAC CGGCCTCGTG
ATGACGGCCC CAGGCGACTG GCGCGTGCTG CAGGCGATCA AGTGCGTGGC GCAGGCGCAG
GGCCTGCCGC TCGAGATCCG CGAGGACCGC CATTTCTACG GCAGCGTGCG CGAGTTCGCG
GCCCACGCGC GCGGCCGCAA GGCGCTACGC ATGGAGTACT TCTACCGCGA GATGCGCAAG
CGCCACGGCG TGTTGATGCA CGGAGACGAG CCCGAGGGCG GCCAGTGGAA TTTCGATGCC
GACAACCGCG AGGCCTTCGG TGCCGCCGGC CCGGGCGCCG TGCCGCCGCG TGCGGTGTTC
GAGCCCGATG CGCTCACGCG CGAGGTGATC GCGCTGGTCG AGAAGCGCTT CGGCACCCAT
CCCGGCCGAC TCGATGGCTT TGCCTGGCCG GTGACGCGTG CACAGGCACT CGTCGCGCTG
CAGCGCTTCA TCACCGAGCG CCTGCCGCTG TTCGGCCGCT ACCAGGACGC CATGTGGCCC
GGCGAACCCT GGCTGTACCA CGCCCACCTG GCGGCCGCGC TGAACCTGAA GCTGCTGAAC
CCGCGCGAGG TCGTGGACGC GGCCGTAGCG GCCTACCGCG GGGGCGCCGC GCCGCTGGCC
GGCGTGGAAG GCTTCGTGCG CCAGATCCTC GGCTGGCGCG AGTACGTGCG CGGCATCTAC
TGGACCCGCA TGCCGGGCTA TGCCGAGCTC AATGCACTGG ACGCCCGCGA AGACCTGCCC
GCCTGGTACT GGACCGGCGA CACCGAGATG GCCTGCCTGC GCGACGCGAT CACGCAGACG
CTGCAGCACG GCTACGCGCA CCACATCCAG CGCCTGATGG TCACCGGCCT GTACGCGCTG
ATGCTGGGCG TGCAGCCGCA GCAGGTGCAC GCCTGGTACC TGGCGGTGTA CGTGGATGCC
GTCGAATGGG TGGAACTGCC CAACACCCTG GGCATGAGCC AGTACGCGGA CGGCGGGCTG
ATGGGCAGCA AGCCCTACGT GGCGACCGGC AAGTACATCC AGCGCATGAG CCCGCACTGC
CAGGGCTGCC GCTACGACCC CGCCCAGCGC ACGGGCGAGA AGGCCTGCCC GTTCACCACG
CTGTACTGGG ATTTCCTGAT GCGCCACGAG GAGACGCTGG CCGGCAATCC GCGCATGGCG
CTGCAGGTGA AGAACGTGGC GCGTCTGGAC GACGCGCAGA AGCAGGCGAT CCGCGAGCGC
GCAGTGGCTA TCCGCAACGG CGCGGTCGCG TGA
 
Protein sequence
MRHLVVVLGD QLDLHAAAFD GFDAARDAVW MAEVAEESAH VWSSQPRTVM FLAAMRHFAL 
ALRAAGRALH YAALDDPATH GSLQAQLHAD IERLRPTGLV MTAPGDWRVL QAIKCVAQAQ
GLPLEIREDR HFYGSVREFA AHARGRKALR MEYFYREMRK RHGVLMHGDE PEGGQWNFDA
DNREAFGAAG PGAVPPRAVF EPDALTREVI ALVEKRFGTH PGRLDGFAWP VTRAQALVAL
QRFITERLPL FGRYQDAMWP GEPWLYHAHL AAALNLKLLN PREVVDAAVA AYRGGAAPLA
GVEGFVRQIL GWREYVRGIY WTRMPGYAEL NALDAREDLP AWYWTGDTEM ACLRDAITQT
LQHGYAHHIQ RLMVTGLYAL MLGVQPQQVH AWYLAVYVDA VEWVELPNTL GMSQYADGGL
MGSKPYVATG KYIQRMSPHC QGCRYDPAQR TGEKACPFTT LYWDFLMRHE ETLAGNPRMA
LQVKNVARLD DAQKQAIRER AVAIRNGAVA