Gene TM1040_1024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1024 
Symbol 
ID4078536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1095376 
End bp1096962 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content66% 
IMG OID638006328 
ProductYjeF-related protein-like 
Protein accessionYP_613019 
Protein GI99080865 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGAGG TTGTGACCGC GCAGCAGATG CGCGCGCTGG AGCAGGCGGC AATTGCGAGC 
GGGGCTGTGA CGGGTTGCGA GCTTATGGAA TGCGCGGGCA ACGCGGTTGT GGCGGCGATT
CTTGAGCACT GGCCGGAGTT TGCCGGTCGG GCTGACGGGG GCGCTGCCCC CCGGCCTGCG
GCCGTCCCCC GGGATATTTC TGAAAAGCTG AAGCGCGCCG TGGTGCTTTG CGGGCCGGGC
AACAATGGCG GTGACGGCTT TGTGGTGGCG CGGCTGCTGA AGCTCCGGGG CTGGGCGGTG
GAGGTTTTTT TCTACGGGGA TGCGGCGAAA CTGCCAGCAG ATGCCAAGGT GAATTACCAG
CGCTGGTGCG CGCTTGGTGA CGTGCATCGC CTTGATGCGG CAACGCTTGC GTCAGGGACC
AAGGTGCAGC ACGCGGATCT GGTTGTGGAT GCGCTTTTTG GGACGGGGTT GACGCGTCCA
GTGGATTTGC CGCTGGGAGT GATCTGTGAG GTCGCACGGA ATGTGGTGGC GATTGATGTG
CCCTCGGGGC TTTGCAGTGA CAGCGGGCGT GTGATCGGGG CGGCTGCGGT GCAGGCGGAT
CTGACGGTCA CCTTTCATGC ACAGAAGCTG GGGCATGTGT TGGCGGAAGG GCCGCAGCAC
TGCGGGACGG TGAGGGTGGC CTCGATCGGG TTGGAGGGCC ATGAGCTGCC CGTCGCAGCG
GTGCCCACAG TATCTTTTGA TGCACCGGAT CACGCGGACC TCGCGAAGGG GTCTGGCGCG
CATAAGTTTT CCTATGGCCA TGCGCTGATC TTGTCTGGCG GAGCGGGACA GACGGGGGCG
GCGCGGTTGG CGGCGCGCGG GGCGCTTCGG ATCGGCGCGG GATTGGTGAC GCTGGGTGTG
TCCCCTGCCG CGCAGATGGA GGTGGCAGGC CATGTCACCG CCGTGATGCT GCGCCGCGTC
GAGGGAGCGG ACGGGTTGGA GGCTGTTCTG CAGGATGCGC GGATCAATGC GCTCTGTCTC
GGGCCGGGGC TGGGGCATGA GCGGGCGCGG GCGCTTGTGC CGGTGGCGCT GGCGGCAGGG
CGGGCAACGG TTCTGGATGC AGATGCGCTC TCAGCCTACT CTGACGCGCC GGATGCGCTG
TTCGACCAGC TGCACAAAAA CTGCGTTCTG ACCCCACATG GGGGAGAGTT TGCGCGGCTG
TTTCCGGATC TTGCGCAGCG GCTCGCAGAG CCCGCAACGC GCGGACCCGC CTATTCCAAG
GTGGACGCCA CCCGCGACGC CGCAGCGCGC GCGGGTTGCA CCATCCTGTT CAAAGGGGCC
GATACAGTGA TTGCAGATGA AGCCGGCGAC TGCGCGATTC ATGACGCCGC TGGCGCGCGC
GCGGCGCCCT GGCTGGCGAC CGCGGGGGCC GGGGACGTGC TTGCTGGGTT TATCACGGGG
CTTCTTGCGC GCGGTCTTGC CGCCCATGAT GGGGCCACAA CGGGGGCCTG GTTACATGCC
GACTGCGCGC GGCAGTTTGG TCCGGGGCTG ATCGCAGAGG ATTTGCCTGA GCAACTGCCC
CATGTGTTTC GCAAGTTGGG GCTCTAG
 
Protein sequence
MVEVVTAQQM RALEQAAIAS GAVTGCELME CAGNAVVAAI LEHWPEFAGR ADGGAAPRPA 
AVPRDISEKL KRAVVLCGPG NNGGDGFVVA RLLKLRGWAV EVFFYGDAAK LPADAKVNYQ
RWCALGDVHR LDAATLASGT KVQHADLVVD ALFGTGLTRP VDLPLGVICE VARNVVAIDV
PSGLCSDSGR VIGAAAVQAD LTVTFHAQKL GHVLAEGPQH CGTVRVASIG LEGHELPVAA
VPTVSFDAPD HADLAKGSGA HKFSYGHALI LSGGAGQTGA ARLAARGALR IGAGLVTLGV
SPAAQMEVAG HVTAVMLRRV EGADGLEAVL QDARINALCL GPGLGHERAR ALVPVALAAG
RATVLDADAL SAYSDAPDAL FDQLHKNCVL TPHGGEFARL FPDLAQRLAE PATRGPAYSK
VDATRDAAAR AGCTILFKGA DTVIADEAGD CAIHDAAGAR AAPWLATAGA GDVLAGFITG
LLARGLAAHD GATTGAWLHA DCARQFGPGL IAEDLPEQLP HVFRKLGL