Gene TM1040_1469 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1469 
Symbol 
ID4077766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1570319 
End bp1571401 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content64% 
IMG OID638006780 
Productradical SAM family protein 
Protein accessionYP_613464 
Protein GI99081310 
COG category[L] Replication, recombination and repair 
COG ID[COG1533] DNA repair photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.637197 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAACA CGGATCCTTT TTCGCCTCGA TGTCGCAAAG CGCGCGGTGC GCTCAGCAAT 
GCGGCAGGCC GGTTTGATCT GGCGCGCGAG GCGCAGGACG ATGGCTGGTG GCAGGAGGCG
CCGGATCCTT CTGTTGCGAC AGAGATTCGC AACGAGCTAG CGCGCAGCCT GATCTCTTAC
AACCGTTCGC CGGATCTGCC CTTTGACCGC TCGATCAACC CCTATCGCGG CTGTGAGCAA
GGTTGCGTGT ATTGTTTTGC GCGCCCGTCG CATGCCTATC TCGGCCTGTC GCCGGGGCTG
GATTTCGAAA CCCGGCTTGT TGCACGCAGC AACGCGGCAG AGGTGCTGCG AAAAGAGCTC
TCGGCGCGGA GCTACAAGGT CGCGACCCTG GCCATCGGCA CCAACACGGA TCCCTATCAG
CCCTGCGAGC GGGATCATCT CTTGATGCGC CAGTGTCTTG AGGTGCTGCA GGCGTTCAAT
CACCCGGTGG CAATTGTCAC CAAGGGCACT CTGATCGAGC GCGATATTGA TATTCTGACC
GATATGGCGC GCCGCGGCCT GGTGCGAGTC GGGATTTCCG TGACCACGTT GGATGCGAAC
CTGTCGCGGC GGATGGAGCC GCGCGCGCCG GTGCCTGCGC GGCGCCTGGC GACGATCCGA
CGGCTGAGTG CTGCGGGGGT TCCAGTACGC ATAATGACGT CGCCCTTGGT GCCGGGGCTC
ACCGATCACG AGCTTGAGGC GCTATTGGCA GCCGGGCAGG ACGCGGGTGC AGATGCGGCC
AGCTGGATCA TGTTGCGCCT GCCGCGCGAG GTATCGCAGC TCTGGCAGGA CTGGCTTCAA
GAGCATGCAC CCGATCGGGC CGCCAAGGTG ATGGCGCGGC TGCGCGAGAT GCATGGCGGG
CGCGATTACG ATCCGCGCTG GGGGCACCGG ATGCGAGGGG AGGGCGAGTA TGCCGAGATG
ATCGCGCGCC GTTTTCGGCT GGCTTGCAAG CGGCTTGGAT TGGCAGAGCG CACCGCGCCC
CTCAGAACAG ACCTCTTTGC AAAACCGCTG CAGCCCGGCG ATCAGCTCAG CCTGTTTTCC
TGA
 
Protein sequence
MQNTDPFSPR CRKARGALSN AAGRFDLARE AQDDGWWQEA PDPSVATEIR NELARSLISY 
NRSPDLPFDR SINPYRGCEQ GCVYCFARPS HAYLGLSPGL DFETRLVARS NAAEVLRKEL
SARSYKVATL AIGTNTDPYQ PCERDHLLMR QCLEVLQAFN HPVAIVTKGT LIERDIDILT
DMARRGLVRV GISVTTLDAN LSRRMEPRAP VPARRLATIR RLSAAGVPVR IMTSPLVPGL
TDHELEALLA AGQDAGADAA SWIMLRLPRE VSQLWQDWLQ EHAPDRAAKV MARLREMHGG
RDYDPRWGHR MRGEGEYAEM IARRFRLACK RLGLAERTAP LRTDLFAKPL QPGDQLSLFS