Gene TM1040_3173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3173 
Symbol 
ID4075343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp153696 
End bp155003 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content63% 
IMG OID638004676 
ProductROK domain-containing protein 
Protein accessionYP_611409 
Protein GI99078151 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.66636 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.682387 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCCTT TCTCACAGGG AGAGATTGCC GCGGCGTGCC CAAGGTGGCA TAAGCGAGCG 
CCTTTTGGCG GGCGGCGAGG GAGGAGCTTT GAATTGGCAC GATCAGGACG GCTGACGCCG
AGCAGCGAGC AACGCGAAAG CGGGCGCCAG CAGATACTGG ATGTGATCCG CGCGCAGGAA
AGCATCGCCC GCATCGACAT CGCCCAGGCC ACAGGCATGA GCCCGGCGAC GGTGACCGCG
ATCACCGCCG AGTTGCTGGC AGCGGGCCTG ATCGAAGAGA TCGCGCCCGA GCTGGCGCCC
GGTGCGCGCC GGGGGCGTCC GCGTGTGGCT CTGCGTCTGC GCGGCGCGGC GCGCCTGATC
GCCGGTCTCA AGGTTTCCCA TCATGTGATC TCCACCGTGA TTACCGATTT TGTCGGACAG
GAACTTGCCA GCCACGAGAT GCCGCTGGTG CAGGGCACGA TGCCGGTGCC CGAACTGTGC
GCGCAGATCC GCCGCGCGCT TGACCTCACC TGCGAGAAAG GCGGCTTCAG CATCGAGGAT
CTCTCCGGTG TCGGTCTCGG AATGGCCGGG ATGATGGATG CGGACCGGGG CTTTATCTAT
TGGTCCTCAT CGCTCGAAGA GCGCAATGTC GCCTTCACCG CCGCCATCAG TGCCGAGCTG
CCCTGTCCGG TGTTTCTGGA CAATGACGCA AACCTCGTGG CCAAGGCCGA ACATCTTTTT
GGCGAGGGGC GCACCTGCGA CAATTTCATT GTCATCACCA TCGAACACGG CGTTGGCATG
GGGATCGTGA TCGACCAGCA GATCTATCGC GGCACCCGCG GCTGCGGCGC CGAATTGGGT
CACACGAAGG TCCATCTCGA AGGGGCGCTG TGCCAATGCG GGCAACGCGG CTGTCTGGAG
GCCTATGTGG GCGATTACGC GCTCCTGCGC GAGGCGAATA TTTCGAGCGG CAGTGAACGC
CACACCACCA TCGCCTCACT GTTTCAGTCG GCTGAAAATG GCGATGTGGT GGCTAAGTCC
ATCCTTGACC GCGCGCGGCG GATGTTTGCG ATGGGGTTGG CAAATGTCGT CAACATTTTT
GACCCGAGCA AGATCATCCT CGCGGGGGCC CGGTTGTCAT TCGACTATCT CTATTCCGAC
AAGCTCATCG AGGAGATGCG TCAGTGGGTG GTGCAGGTGG ATGCCCCGCT GCCAGAGGTC
ATGGTCCATG ACTGGGGCGA TCTGATGTGG GCCAAGGGGG CGGCGGCCTA TGCGCTCGAA
GAGGTGACGG CGCGCACCGT GCGGGAGCTT GCAAATGCGG CGGCCTGA
 
Protein sequence
MYPFSQGEIA AACPRWHKRA PFGGRRGRSF ELARSGRLTP SSEQRESGRQ QILDVIRAQE 
SIARIDIAQA TGMSPATVTA ITAELLAAGL IEEIAPELAP GARRGRPRVA LRLRGAARLI
AGLKVSHHVI STVITDFVGQ ELASHEMPLV QGTMPVPELC AQIRRALDLT CEKGGFSIED
LSGVGLGMAG MMDADRGFIY WSSSLEERNV AFTAAISAEL PCPVFLDNDA NLVAKAEHLF
GEGRTCDNFI VITIEHGVGM GIVIDQQIYR GTRGCGAELG HTKVHLEGAL CQCGQRGCLE
AYVGDYALLR EANISSGSER HTTIASLFQS AENGDVVAKS ILDRARRMFA MGLANVVNIF
DPSKIILAGA RLSFDYLYSD KLIEEMRQWV VQVDAPLPEV MVHDWGDLMW AKGAAAYALE
EVTARTVREL ANAAA