Gene Saro_1786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1786 
Symbol 
ID3918345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1884003 
End bp1885409 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content71% 
IMG OID640444527 
ProductYjeF-related protein-like 
Protein accessionYP_497060 
Protein GI87199803 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCGCA GCCGTGACCG CTTGCTCACG CAGGTCCTTA ACGTCGCGCA GATGCACGCG 
GCCGAGCAGG CGTTGATCGC GGCGGGAACC GACGTCCACC AGCTCATGCA GCGGGCCGGG
CGCGGCGCGG GCGAGTGGGT GCGACGGATC GCGGCCGGCC GCCCCGTCAC GGTGCTCTGC
GGGCCGGGCA ATAATGGCGG CGACGGTTGG GTCATCGCCG AATATCTGCG AGAGCACGGC
AATCCGGTAA CGGTCGTTGT CGCACGCGAG CCGGGCACGG GTGCGGCGAA GACCGCTCGC
TCGCTCTATC GCGGTGCCGC CGTGCCCGGT GACGCTGCGG TAGAGGGCGA AGTGCTTGTC
GATTGCCTGT TCGGTAGCGG CCTGACGCGG GGCCTGTCGG ACGATCTCTT CGAGCTGCTT
GCCTGTCTTG CCCGGCGCCA TCCTCATCGT ATCGCCATCG ATGTGCCGAG CGGCGTGGAA
AGCGATAGCG GACGCCCGCT CAATGCCGGC CTGCCGCAAT CGACCCTGAC CATCGCCCTC
GGGGCCTGGA AGCATGCGCA TTTCGCGATG CCCGCCTGCG CGATGATGGG CGTGCTGCGC
CTCGTTGACA TCGGCGTGAA CGAAGTGCCG GGAGCGGCGC GCGTGCTGGC AAGGCCATCC
ATCTCCGTGC CCGCCGCCGA TGCCCACAAG TACCGCCGGG GCATGCTCGG GATCGTGGCC
GGGGCAATGC CGGGGGCGAC CATTCTCGCC TCGACGGCGG CGCTGCGGGC AGGGGCGGGC
TATGTGAAGC TTGCCGCCTC CGCCGCGCCC GCGAACGCTC CAGCCGAACT GGTGGTGACC
TCCGATCTTT CCGCGATGCT TGCCGACGAT CGCCTTGCGG CGCTGCTGGT CGGCCCCGGC
TTCGGGCGCG GCGACGAGGC CGCGCGCATC CTTGCCCGGT CGCTGCACGC CGCGAGGCCC
AGCGTGGTCG ACGCGGACGG GCTCATGCTC CTTCGCCCCG CCATGCTTTC GGGGACGCCG
ATGGTGCTGA CGCCGCACGA CGGGGAAATG GCCGCACTGG AACGCGCGTT CGACCTTCCG
GCGAGCGGGC TCCGCCGCGA GCGTGCGCTC GCGCTGGCTG CTGCCAGCAA GGCCGTGGTC
GTGCTCAAGG GGCCGGACAG CGTGATCGCA GGGCCGGAGG GCGAACTCGT CGTTTCGCCG
CGCGCTTCGT CGTGGCTGTC CGTGGCCGGG ACCGGCGATG TCCTGGCCGG GACCATCGCG
AGCCGCCTGG CCGTTCATGG AGATGCCATG CGCGCCGCCG AGGAAGGTTT GTGGCTGCAC
GGCGAGGCGG CCAGGATCGT CGGCTCCGCC TTTACCGCCG GGGAACTGGC CTGCGCCGTG
CGTGCGGCTG TCGAGGAATG TCTTTGA
 
Protein sequence
MPRSRDRLLT QVLNVAQMHA AEQALIAAGT DVHQLMQRAG RGAGEWVRRI AAGRPVTVLC 
GPGNNGGDGW VIAEYLREHG NPVTVVVARE PGTGAAKTAR SLYRGAAVPG DAAVEGEVLV
DCLFGSGLTR GLSDDLFELL ACLARRHPHR IAIDVPSGVE SDSGRPLNAG LPQSTLTIAL
GAWKHAHFAM PACAMMGVLR LVDIGVNEVP GAARVLARPS ISVPAADAHK YRRGMLGIVA
GAMPGATILA STAALRAGAG YVKLAASAAP ANAPAELVVT SDLSAMLADD RLAALLVGPG
FGRGDEAARI LARSLHAARP SVVDADGLML LRPAMLSGTP MVLTPHDGEM AALERAFDLP
ASGLRRERAL ALAAASKAVV VLKGPDSVIA GPEGELVVSP RASSWLSVAG TGDVLAGTIA
SRLAVHGDAM RAAEEGLWLH GEAARIVGSA FTAGELACAV RAAVEECL