Gene TM1040_0097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0097 
Symbol 
ID4078763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp102371 
End bp103426 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content65% 
IMG OID638005384 
Productdeoxyribose-phosphate aldolase 
Protein accessionYP_612092 
Protein GI99079938 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0274] Deoxyribose-phosphate aldolase 
TIGRFAM ID[TIGR00126] deoxyribose-phosphate aldolase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAGCC AAACCGCAGA AGACGGCACG CAGCCGCCGA AGACCTCCGC TGCCAAACCC 
TCCGCCAAGG CCAAGCCGAC CGCAGCCGCC ACGCAGGACC ATGAACACAG CGACCTGCCC
CGCAATCCGG GTATCGCGCT TGATCTCGAC TGGGCCCTGA GCCTGGAGGC CAATACCTCC
GCGATTGAAC GCCGCTGCGC GAGCCTGCCG GGTCGGCGCT CGGTCAAGAA GGACTATCAG
GCCGCTTGGC TCTTGAAGGC GATCAGCCTC ATCGATCTCA CCACGCTCTC GGGCGATGAT
ACCGAGGCCC GCGTGCGCCG TCTCTGCGCC AAGGCGCGCC AGCCGGTGCG AGCGGATATT
CTCTCGGCAC TTGGCATCGA CGGGATCACG ACCGGAGCCG TCTGTGTCTA TCACGACATG
ATCCCCGCCG CCGTGCGCGC GCTGCATGGC ACCCATATCC CGGTTGCCGC CGTCTCTACG
GGCTTTCCGG CGGGGCTGTC GCCCTTCCAC CTGCGGCTCG AGGAAATCCG CGAAAGCGTG
CGGGCCGGCG CCAAGGAAAT CGACATTGTG ATCTCGCGCC GCCATGTGCT CTCGGGGGAC
TGGCAGGCGC TTTATGACGA GATGAAAGCC TTCCGCGAGG CCTGCGGAGA TGCCCACGTC
AAGGCGATCC TCGCCACTGG CGAGCTTGGC ACTCTGCGCA ATGTCGCCCG CGCCTCGATG
ATCTGCATGA TGGCGGGTGC CGACTTCATC AAGACGTCCA CTGGCAAAGA GAGCGTCAAC
GCCACCCTGC CCGTCTCGTT GGTGATGATC CGCGCCATTC GCGAGTATTA CGAGCGCACC
GGCTTTCATG TGGGCTACAA GCCCGCTGGT GGCATCTCCA AGGCCAAAGA CGCGCTGGTG
TACCTCAGCC TCATGAAAGA GGAACTTGGC AACCGCTGGC TGCAGCCGGA CCTGTTCCGC
TTTGGCGCGT CCAGCCTTCT GGGCGACATC GAACGCCAGC TCGAACACCA TGTGACCGGC
GCCTATTCCG CTGGCTACCG CCACCCGATG GCGTGA
 
Protein sequence
MDSQTAEDGT QPPKTSAAKP SAKAKPTAAA TQDHEHSDLP RNPGIALDLD WALSLEANTS 
AIERRCASLP GRRSVKKDYQ AAWLLKAISL IDLTTLSGDD TEARVRRLCA KARQPVRADI
LSALGIDGIT TGAVCVYHDM IPAAVRALHG THIPVAAVST GFPAGLSPFH LRLEEIRESV
RAGAKEIDIV ISRRHVLSGD WQALYDEMKA FREACGDAHV KAILATGELG TLRNVARASM
ICMMAGADFI KTSTGKESVN ATLPVSLVMI RAIREYYERT GFHVGYKPAG GISKAKDALV
YLSLMKEELG NRWLQPDLFR FGASSLLGDI ERQLEHHVTG AYSAGYRHPM A