Gene TM1040_2281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2281 
Symbol 
ID4078465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2398532 
End bp2400055 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content61% 
IMG OID638007603 
Productdeoxyribodipyrimidine photolyase-related protein 
Protein accessionYP_614275 
Protein GI99082121 
COG category[R] General function prediction only 
COG ID[COG3046] Uncharacterized protein related to deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.558867 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACGGC TTGTTTTGGT GCTGGGGGAC CAGCTTTCAG AAGGGTTGTC GGCGCTCAAA 
GCGGCAGACC CTGCACATGA TGTGGTGGTG ATGGCGGAGG TGATGGACGA GGGCACATAT
GTCCCGCATC ACCCCAAGAA GATCGCGTTG GTGCTGGCGG CGATGCGCAA GTTTGCGCAG
CGTCTAGAAG GGCAGGGCTG GCGTGTGGCC TATCGGCGTC TGGACGAGGA TGGCGCGGAG
AGCATCGTGG GCGAACTCTT GCGCCGGGCC GAAGAATTTG GCGCCTCGGA GGTGCTGGCC
ACCACGCCGG GCGAGTGGCG GCTGATCCAC GCGTTGAAAT ATGCCCCCCT CAAGGTTCAT
TTTCATGCCG ACGACCGCTT TCTCGCCACG CGCGAGGATT TCACCGAGTG GGCCGAGGGC
AAAAAACAGC TCCGGATGGA GTATTTCTAT CGCCTGATGC GCAAGAAAAC CGGCCTTCTG
ATGGAGGGAG ACACCCCGGT TGGCGGCAAG TGGAATTACG ATTCAGAGAA CCGCAAGGCG
CCCCCCAAGA TCATCGACCA CAAGGGGCCA CCCCGGTTTG AACCGGACGC CGAGGTCGAG
GAGGTGCTGG ATCTGGTGGA GGCGCGCTTT GGCACTCATT TCGGGGATCT GCGCCCGTTC
TGGTTTGCCA CAACCCGCGA AGAAGCGCAG GAGGCGCTTG CGCATTTCAT CACCCACGCT
CTGCCGCAGT TCGGGGACTA TCAGGATGCG ATGATGACGG ATGAGCGCTG GCTCTATCAC
TCCATCCTGT CGCCCTATCT CAACATCGGT CTGTTGACCC CGCTGGAGAT CTGCGAGGCG
GCCGAGGTCG CACATCAGGA CGGCCATGCG CCGCTCAATG CGGTCGAAGG TTTCATCCGG
CAGATCCTCG GGTGGCGGGA GTATGTGCGG GGGATCTATT TCCTCGAGGG GGAGGATTAC
CCCACGCGCA ACGCACTGGA ACAAACCCGA GCGCTGCCCG CGCTCTATTG GGGGGCGGAG
ACGGACATGC ATTGCCTCTC GCAAGCGGTG GAGCAGACCG GGCAGGAGGC CTACGCCCAC
CACATCCAGC GGCTGATGGT GACCGGGAAT TTTGCGCTTT TGGCCGGGGT TGATCCGGCA
CAGGTGCACG AATGGTATCT CGCAGTGTAT GCAGATGCGT TTGAGTGGGT CGAGGCGCCC
AACACCGTCG GCATGAGCCA GTTTGCCGAT GGCGGCATCA TTGCCTCCAA ACCCTATGTC
TCCAGCGGTG CCTACATCGA CAGAATGTCC AATTATTGCG GAAGTTGCGC CTATAAGGTG
AAGCAAAAAA CGGGCGAGGG CGCCTGTCCG TTCAACCTGC TGTACTGGGA TTTCCTGAAC
CGCCACCGGG CACGGTTTGA GGGCAACCCG CGCATGGGCA ACATGTATCG CACCTGGGAC
CGGATGGACG AAGAGAAACG CGATGTGATT TTGCAGGAAG CGAGCGCGTT CCTCGCAAAA
CTTGACGCGG GCGAAAGAGT TTAG
 
Protein sequence
MTRLVLVLGD QLSEGLSALK AADPAHDVVV MAEVMDEGTY VPHHPKKIAL VLAAMRKFAQ 
RLEGQGWRVA YRRLDEDGAE SIVGELLRRA EEFGASEVLA TTPGEWRLIH ALKYAPLKVH
FHADDRFLAT REDFTEWAEG KKQLRMEYFY RLMRKKTGLL MEGDTPVGGK WNYDSENRKA
PPKIIDHKGP PRFEPDAEVE EVLDLVEARF GTHFGDLRPF WFATTREEAQ EALAHFITHA
LPQFGDYQDA MMTDERWLYH SILSPYLNIG LLTPLEICEA AEVAHQDGHA PLNAVEGFIR
QILGWREYVR GIYFLEGEDY PTRNALEQTR ALPALYWGAE TDMHCLSQAV EQTGQEAYAH
HIQRLMVTGN FALLAGVDPA QVHEWYLAVY ADAFEWVEAP NTVGMSQFAD GGIIASKPYV
SSGAYIDRMS NYCGSCAYKV KQKTGEGACP FNLLYWDFLN RHRARFEGNP RMGNMYRTWD
RMDEEKRDVI LQEASAFLAK LDAGERV