Gene TM1040_0224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0224 
Symbol 
ID4076257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp239155 
End bp240252 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content59% 
IMG OID638005518 
ProductGTP-dependent nucleic acid-binding protein EngD 
Protein accessionYP_612219 
Protein GI99080065 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0012] Predicted GTPase, probable translation factor 
TIGRFAM ID[TIGR00092] GTP-binding protein YchF 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.626495 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTTTA AAATGGGAAT CGTGGGTCTG CCCAATGTGG GCAAGTCGAC CCTGTTCAAC 
GCGCTGACCA AAACCGCCTC GGCGCAGGCG GCAAATTTTC CGTTCTGTAC GATCGAACCG
AACGTGGGTG AGGTGGGCGT TCCGGACGCG CGTCTCGACA AATTGGCGGC GATTGCGCAG
TCCAAACAGA TCATCCCAAC CCGCATGACG TTTGTGGATA TTGCTGGCCT CGTCAAAGGC
GCCTCAAAGG GCGAAGGTCT GGGCAACCAG TTCCTTGCCA ATATCCGTGA GGTGGACGCA
ATTGCCCATG TTTTACGGTG CTTTGAGGAC GGTGACGTTA CCCATGTCGA TGGTCGCGTT
GATCCGGTTG CGGACGCCGA TACCATCGAA ACCGAGCTGA TGCTGGCGGA TCTTGAGAGC
ATCGAGAAAC GCCGCGCCAA CCTCGTACGC AAGCTCAAGG GCAACGACAA GGAAGCCCAG
CAGCAGGACC GCCTGCTCGC AGCGGCGCAG GCCATGCTCG AAGATGGCAA ACCAGCCCGT
CTGGTTGAGG TCGACGCAGA GGACCAGAAG GCCTGGACCA TGCTGCAACT GCTGACCACA
AAGCCGGTGC TTTACGTCTG CAATGTGGGT GAAAGCGAGA GCGTCGAAGG CAACGCACAT
TCCGCCAAAG TTGCCGAGAT GGCCGCGGCT CAGGGTAACG CGCATGTGAT CATTTCGGCG
CAGATCGAAG AGGAAATCAG CCAGCTTGAG CCCGAAGAAG CGCAGATGTT CCTCGATGAG
ATGGGTCTCG CAGAAGCCGG TCTCGACCGC CTGATCCGCG CCGGTTACGA GCTCTTGCAT
CTGGAAACCT ATTTCACGGT CGGCCCCAAG GAAGCGCGCG CCTGGACCAT TCGCTCGGGC
ACCGCTGCGC CCCAGGCGGC AGGCGTTATC CACGGCGATT TTGAAAAGGG TTTCATCCGC
GCGGAGACCA TCGCCTATGA CGACTACATC GCTTGCGGCG GTGAATCCGG CGCCAAAGAA
GCGGGCAAGA TGCGCGCCGA GGGCAAGAGC TACATCGTCA AGGATGGCGA TGTGATGCAC
TTCTTGTTCA ACACCTGA
 
Protein sequence
MGFKMGIVGL PNVGKSTLFN ALTKTASAQA ANFPFCTIEP NVGEVGVPDA RLDKLAAIAQ 
SKQIIPTRMT FVDIAGLVKG ASKGEGLGNQ FLANIREVDA IAHVLRCFED GDVTHVDGRV
DPVADADTIE TELMLADLES IEKRRANLVR KLKGNDKEAQ QQDRLLAAAQ AMLEDGKPAR
LVEVDAEDQK AWTMLQLLTT KPVLYVCNVG ESESVEGNAH SAKVAEMAAA QGNAHVIISA
QIEEEISQLE PEEAQMFLDE MGLAEAGLDR LIRAGYELLH LETYFTVGPK EARAWTIRSG
TAAPQAAGVI HGDFEKGFIR AETIAYDDYI ACGGESGAKE AGKMRAEGKS YIVKDGDVMH
FLFNT