Gene TM1040_2166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2166 
Symbol 
ID4076765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2275574 
End bp2277385 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content66% 
IMG OID638007486 
Productsingle-stranded-DNA-specific exonuclease RecJ 
Protein accessionYP_614160 
Protein GI99082006 
COG category[L] Replication, recombination and repair 
COG ID[COG0608] Single-stranded DNA-specific exonuclease 
TIGRFAM ID[TIGR00644] single-stranded-DNA-specific exonuclease RecJ 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.145891 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGAAAAG ATGAAGGCCG GGTCCCGCGC GGGTCCGGCC TTTTTTACAC AGAAGACGAG 
GGAATAGAGT TGAGCTTTCT GGGTGTCGAG ACCTCACTGA CCGGACGCCG CTGGGTGGGA
CCCGGTGTGG ATCAGGCCCG CGCAAGCGAG CATCTGGCGC AGGAAACCGG GCTACCTCCT
GCGGTGTGTC AGGTCCTCGC ACGCCGGGGC GTGCCCGCGC ATGAGGCCAC CGGGTTTCTG
ACGCCGCAGC TCAAGGATCT CTTGCCCGAT CCGCGCCGCA TGAAGGACAT GGAGACCGCA
GCAGCGCGCT TTCTGCAAGC CGTGGAGCGG CGCGAGCGGA TTGCGATCTT TGCCGATTAT
GACGTCGATG GCGGCTCTTC GGCGGCGCTT TTGCTGGTGT ACCTGCGACA GATGGGCCAG
CAGGCGACAC TATATATTCC GGATCGCATT GATGAGGGTT ATGGCCCCAA TGATGCGGCA
ATGGCGGCGC TGGCGCGCGA TCATGACCTT ATCATCTGCG TGGACTGTGG CACACTTTCG
CACGGGCCCA TCGCAGCGGC GAAGGGCGCG GATGTGGTGG TTCTGGATCA CCACCTTGGC
GGCGAGACCC TGCCCGACTG CGTGGCAGTG GTGAACCCCA ACCGGCAGGA TGAGGACGGC
GATCTGGGCT ATTTCTGCGC CGCTGGTGTG GTGTTCCTGA TGCTCGTCGA GGTGCGCCGT
CAGGCGCGCG ACAAGGGGCT TGGCGCAGGC CCTGATCTGA TGGCGATGCT CGATCTGGTG
GCGCTGGCCA CCGTGGCAGA TGTGGCCCCC CTGATCGGCG CGAACCGCGC GCTGGTGCGC
CAAGGCCTGA AGGTGATGGG GCGACGCCAA CGTCCGGGTC TGGTTGCCCT CGCGGATGTG
AGCCGGATGG ATGCGGCGCC CTCAACCTAT CACCTCGGCT TTCTCCTGGG GCCGCGGGTC
AATGCGGGCG GGCGGATCGG CAAGGCGGAC CTTGGTGCGC GCCTGCTGTC GACGGATGAC
CCGCATGAGG CTGCTGCGCT CTCCGAACGG CTTGACCAGC TCAATACGGA GCGGCGCGAC
ATTGAGAACG CGGTGCGTGC TGCCGCACTG GAACAGGCCG AGGCGCGCGG CTTTGACGCG
CCTTTGGTCT GGGCGGCGGG GGAAGGCTGG CATCCGGGCG TGGTCGGCAT CGTCGCCTCG
CGCCTCAAGG AAGCAACGGG CCGACCTGCG GTGGTCATCG GCCTAGACGG CGAAGAGGGC
AAAGGATCGG GACGCTCTGT CTCGGGCATC GACCTTGGGG CCTCGATCCA GAAGGTCGCA
ATGGAAGGGC TGCTCTTGAA GGGCGGCGGG CACAAGATGG CTGCGGGCCT CACGGTGGCG
CGCGGCCAGC TGGAACCCGC GATGGCCCGC CTTGCAGAGC TCTTGGACAA ACAGGGCGCA
GGCGATCTGG GCCCCGCGGA TCTGAAGCTC GACGGGATGC TGATGCCCGG CGCCGCCAGC
GTGGAGTTGA TCGAGCAGAT CGAACAGGCC GGGCCGTTTG GCGCTGGGGC CCCTGCCCCC
CGCTATGGGT TTCCTGACGT CGCCGTACGC TTTGCCAAAC GGGTTGGAGA ATCGCATCTC
AAGGTCAGTT TCACCGATGG GATGGGCGGC AATATCGACG CCATCGCCTT TGGCGCCTTT
GACACCAACC TCGGGCCGCG CTTGCTCGAA CACGGAGGCG CGCGCTTTCA TGTCTCCGGA
CGGCTAGAGG TCAACACCTG GGGCGGGCGC CAAAGCGCGC AATTGCGACT GGAGGACGCA
GCCGAGGCCT GA
 
Protein sequence
MGKDEGRVPR GSGLFYTEDE GIELSFLGVE TSLTGRRWVG PGVDQARASE HLAQETGLPP 
AVCQVLARRG VPAHEATGFL TPQLKDLLPD PRRMKDMETA AARFLQAVER RERIAIFADY
DVDGGSSAAL LLVYLRQMGQ QATLYIPDRI DEGYGPNDAA MAALARDHDL IICVDCGTLS
HGPIAAAKGA DVVVLDHHLG GETLPDCVAV VNPNRQDEDG DLGYFCAAGV VFLMLVEVRR
QARDKGLGAG PDLMAMLDLV ALATVADVAP LIGANRALVR QGLKVMGRRQ RPGLVALADV
SRMDAAPSTY HLGFLLGPRV NAGGRIGKAD LGARLLSTDD PHEAAALSER LDQLNTERRD
IENAVRAAAL EQAEARGFDA PLVWAAGEGW HPGVVGIVAS RLKEATGRPA VVIGLDGEEG
KGSGRSVSGI DLGASIQKVA MEGLLLKGGG HKMAAGLTVA RGQLEPAMAR LAELLDKQGA
GDLGPADLKL DGMLMPGAAS VELIEQIEQA GPFGAGAPAP RYGFPDVAVR FAKRVGESHL
KVSFTDGMGG NIDAIAFGAF DTNLGPRLLE HGGARFHVSG RLEVNTWGGR QSAQLRLEDA
AEA