Gene Mext_2531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2531 
Symbol 
ID5831546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2839778 
End bp2841181 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content67% 
IMG OID641368332 
Productdeoxyribodipyrimidine photo-lyase 
Protein accessionYP_001639996 
Protein GI163851953 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID[TIGR00591] photolyase PhrII 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.695435 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATCC AACCGGCCCG CATCCGGGTG CTGAACGACG TGAAGCCGCG CGAGGGCGCC 
GGTTACGTCC TGTACCTGAT GCAGCAGGCC ATGCGCGTCC CGTTCAATCC GGCGCTGGAG
CTGGCGATCG AGGAAGCCAA CCGCCTCAAG CTTCCGCTCC TCGTCTGCTT CGGCCTGCTC
GACGGCGCCA ACGGCTTCCC CGAGGCCAAT GCCCGGCACT ACGCCTTCCT GTTGCAGGGG
CTCGCCGACG CAGCCGCTGC TCTGGAAAAG CGCGGCATCG CCTTTCTCCT GCGCCGGGCA
ACGCCGGCTG AGGTCGCCAT CGATCTCTCG GCAGACGCCG CGCTGCTGGT GCTCGACCGC
GGCTACCTCG CGATCCAGAA ACGCTGGTAC GGCGAGATCG AACGGGAGGC GCAATGCCGG
ATCGTTCAGG TCGAGGGCGA TGTGGTGGTG CCGCTCGAGA CTACCTCGAC CAAGCACGAA
TACGCCGCGC GCACCCTGCG CCCGAAGCTC CGGAAGCTCT GGGACGACTA TCTCGACCCC
GTTGAGCCGC GCACGGTCGA TCACCCGGCC GAAGGGCTGA TCCAGCGGCT GAAGCTGAGG
GATGGGCTCG ACGTCTCCGA CCCGGAAAAG CTTCTCGCGA CGCTGACGCT CGACACGACG
GTCGGCCCGG TGAAGCGCTT CCGCGGCGGC TACACCGAGG CCGCCGGGCA TCTGAAGCGC
TTCCTGGAGC ACGCCTTCGC CGGCTACGGG GCCGGGCGCA ACAAGCCGGA GGCCGGGGCG
GCCTCGCATA TGAGCCCCTA CCTGCATTTC GGGCACATCT CGCCGGTCGA GATTGCGCTG
GCGATCCGCG CCGCGAAGGA CGCTGACGAT GACGACCGCT CGGCTTATCT CGAGGAGCTG
ATCGTCCGGC GCGAATTGGC GATGAACCAC GTCTTCCACA CGCAGGGCTA CGACGACTAC
GCCCGCGCCG TGCCCGACTG GGCCCGCAAG ACGCTGGCCG AGCACGCCGA CGACCCGCGC
CCGAAACTCT ATTCCGAGGA GGAGTTGGCG GAGGGGAAGA CCCACGACCA TTACTGGAAC
GTCGCCATGC GCGAGATGCG CGAGACCGGC TACATGCACA ACCAGCTCCG CATGTACTGG
GGCAAGAAGA TCCTCGAATG GTCGCCCTCG CCGGAGGAGG CGTTCGCCCG GACGCTGCGG
CTCAACAACC GCTACTTCCT CGACGGGCGC GACGCGAACT CCTTCACCAA CGTCGCCTGG
ATCTTCGGCC TGCACGACCG CCCGTGGCAG ACCCGCCAGA TTTTTGGAAG CGTGCGCTAT
CAGAGCGAGA ACTCGCTCAG GAAGTTCGAC GCGAAGGGAT ACGAGCGGGC AGTGACACGG
CTGTGCGAGG CGGAAGAAGG CTGA
 
Protein sequence
MAIQPARIRV LNDVKPREGA GYVLYLMQQA MRVPFNPALE LAIEEANRLK LPLLVCFGLL 
DGANGFPEAN ARHYAFLLQG LADAAAALEK RGIAFLLRRA TPAEVAIDLS ADAALLVLDR
GYLAIQKRWY GEIEREAQCR IVQVEGDVVV PLETTSTKHE YAARTLRPKL RKLWDDYLDP
VEPRTVDHPA EGLIQRLKLR DGLDVSDPEK LLATLTLDTT VGPVKRFRGG YTEAAGHLKR
FLEHAFAGYG AGRNKPEAGA ASHMSPYLHF GHISPVEIAL AIRAAKDADD DDRSAYLEEL
IVRRELAMNH VFHTQGYDDY ARAVPDWARK TLAEHADDPR PKLYSEEELA EGKTHDHYWN
VAMREMRETG YMHNQLRMYW GKKILEWSPS PEEAFARTLR LNNRYFLDGR DANSFTNVAW
IFGLHDRPWQ TRQIFGSVRY QSENSLRKFD AKGYERAVTR LCEAEEG