Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1786 |
Symbol | |
ID | 3918345 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1884003 |
End bp | 1885409 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640444527 |
Product | YjeF-related protein-like |
Protein accession | YP_497060 |
Protein GI | 87199803 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCGCA GCCGTGACCG CTTGCTCACG CAGGTCCTTA ACGTCGCGCA GATGCACGCG GCCGAGCAGG CGTTGATCGC GGCGGGAACC GACGTCCACC AGCTCATGCA GCGGGCCGGG CGCGGCGCGG GCGAGTGGGT GCGACGGATC GCGGCCGGCC GCCCCGTCAC GGTGCTCTGC GGGCCGGGCA ATAATGGCGG CGACGGTTGG GTCATCGCCG AATATCTGCG AGAGCACGGC AATCCGGTAA CGGTCGTTGT CGCACGCGAG CCGGGCACGG GTGCGGCGAA GACCGCTCGC TCGCTCTATC GCGGTGCCGC CGTGCCCGGT GACGCTGCGG TAGAGGGCGA AGTGCTTGTC GATTGCCTGT TCGGTAGCGG CCTGACGCGG GGCCTGTCGG ACGATCTCTT CGAGCTGCTT GCCTGTCTTG CCCGGCGCCA TCCTCATCGT ATCGCCATCG ATGTGCCGAG CGGCGTGGAA AGCGATAGCG GACGCCCGCT CAATGCCGGC CTGCCGCAAT CGACCCTGAC CATCGCCCTC GGGGCCTGGA AGCATGCGCA TTTCGCGATG CCCGCCTGCG CGATGATGGG CGTGCTGCGC CTCGTTGACA TCGGCGTGAA CGAAGTGCCG GGAGCGGCGC GCGTGCTGGC AAGGCCATCC ATCTCCGTGC CCGCCGCCGA TGCCCACAAG TACCGCCGGG GCATGCTCGG GATCGTGGCC GGGGCAATGC CGGGGGCGAC CATTCTCGCC TCGACGGCGG CGCTGCGGGC AGGGGCGGGC TATGTGAAGC TTGCCGCCTC CGCCGCGCCC GCGAACGCTC CAGCCGAACT GGTGGTGACC TCCGATCTTT CCGCGATGCT TGCCGACGAT CGCCTTGCGG CGCTGCTGGT CGGCCCCGGC TTCGGGCGCG GCGACGAGGC CGCGCGCATC CTTGCCCGGT CGCTGCACGC CGCGAGGCCC AGCGTGGTCG ACGCGGACGG GCTCATGCTC CTTCGCCCCG CCATGCTTTC GGGGACGCCG ATGGTGCTGA CGCCGCACGA CGGGGAAATG GCCGCACTGG AACGCGCGTT CGACCTTCCG GCGAGCGGGC TCCGCCGCGA GCGTGCGCTC GCGCTGGCTG CTGCCAGCAA GGCCGTGGTC GTGCTCAAGG GGCCGGACAG CGTGATCGCA GGGCCGGAGG GCGAACTCGT CGTTTCGCCG CGCGCTTCGT CGTGGCTGTC CGTGGCCGGG ACCGGCGATG TCCTGGCCGG GACCATCGCG AGCCGCCTGG CCGTTCATGG AGATGCCATG CGCGCCGCCG AGGAAGGTTT GTGGCTGCAC GGCGAGGCGG CCAGGATCGT CGGCTCCGCC TTTACCGCCG GGGAACTGGC CTGCGCCGTG CGTGCGGCTG TCGAGGAATG TCTTTGA
|
Protein sequence | MPRSRDRLLT QVLNVAQMHA AEQALIAAGT DVHQLMQRAG RGAGEWVRRI AAGRPVTVLC GPGNNGGDGW VIAEYLREHG NPVTVVVARE PGTGAAKTAR SLYRGAAVPG DAAVEGEVLV DCLFGSGLTR GLSDDLFELL ACLARRHPHR IAIDVPSGVE SDSGRPLNAG LPQSTLTIAL GAWKHAHFAM PACAMMGVLR LVDIGVNEVP GAARVLARPS ISVPAADAHK YRRGMLGIVA GAMPGATILA STAALRAGAG YVKLAASAAP ANAPAELVVT SDLSAMLADD RLAALLVGPG FGRGDEAARI LARSLHAARP SVVDADGLML LRPAMLSGTP MVLTPHDGEM AALERAFDLP ASGLRRERAL ALAAASKAVV VLKGPDSVIA GPEGELVVSP RASSWLSVAG TGDVLAGTIA SRLAVHGDAM RAAEEGLWLH GEAARIVGSA FTAGELACAV RAAVEECL
|
| |