Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4248 |
Symbol | |
ID | 5708098 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4820287 |
End bp | 4821786 |
Gene Length | 1500 bp |
Protein Length | 499 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641273667 |
Product | carbohydrate kinase, YjeF related protein |
Protein accession | YP_001539020 |
Protein GI | 159039767 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.380487 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0166005 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACCAG TGTGGCGGGT CGCGGACGTA CGCGCCGCGG AGGCGGGCCT GATGGCGGCG CTTCCATCCG GGACGCTGAT GCAGCGCGCC GCGGCCGGGC TCGCCCGCCG GTGTGCACGT GTCCTGACCG ACCGGGGCGG CGTCTACGGT GCCTCGGTGT TGCTGCTGGT CGGCTCGGGT GACAACGGCG GTGACGCACT CTTCGCCGGT GCCCGCCTGG CCCGGCGCGG GGCGGCGGTG TCAGCCCTGC TGTTGTCCCC GGATCGGGTA CACGCCGAGG CGTTGACGGC GTTGCGAGCC GCCGGCGGCC GGCTGGTCGA GCGCCCCCCG GCACGGGTGG ACCTGGTGGT CGACGGCATC GTCGGGATCG GTGGCAGCGG CGGGCTCCGT GAGCCGGCGG AACAGCTCGC GGCAAGTCTG GCGGGGTGCT GTGGGCGAGA CGGTGACCGG GCGACCGTGG TCGCGGTGGA TGTTCCCAGT GGGGTGTCGG TCGACACGGG GCACGTGCCG CAGTCCGCCT CCGGACGACC GGCGGCGGTC CACGCCGACG TGACAGTGAC CTTCGGCGCG TTGAAACCCG CGCTGGTGGT GGGACCGGCG GCGCCGCTCG CCGGCCAGGT CGATCTGGTC GACATCGGAC TGGAGCCCTG GCTGCGTAGC ACGCCGGCGC TACGCGTCAC CGAGTGGGCG GATGTGACCG GCTGGTGGCC CACTCCTGGT CCGGCAACCG AAAAGTACAC CCGGGGCGTC GTCGGGGTTG CGACCGGCTC GGCCACCTAT CCCGGCGCCG CGGTGCTCTC GGTCGCCGGT GCCCTGGCCG GCCCGACCGG CATGGTGCGA TACGCCGGGA GCGCTCGGGT CGAGGTGCTG CGCCAGCACC CGTCGGTGAT CGCCACCGAC CGGGTCGCCG ACGCCGGCCG GGTGCAGGCG TGGGTATGCG GTTCCGGGCT CGGTACCGAT GACGAGGCGG CCGGGGAACT GCGGGCGGTG CTCGCGGCGC CGGTGCCGGC GGTGCTCGAC GCGGACGCGT TGACCCTGCT CGTGGACGGA TCCCTCGCCC ACCTGCTGCG GCGACGCGAC GCCCCGATCG TGGTCACCCC GCACGACCGG GAGTTCGCCC GGCTCTGCGG CGAGACCCCC GGGACCGACC GGGTCGCCGC CGCGCTGCGC CTGGCCGCCT GGATGAACGC CGTGGTGCTA CTCAAGGGCG ACCGGACGGT GATCGGCACG CCGGACGGCC GGGCGTATGT CAATCAGACC GGAACGCCGG CCCTGGCCAC CGGTGGCACG GGCGATGTGC TGGCCGGACT GCTTGGCTCG TTGCTCGCCG CGGGCCTCAA CCCGGAGCGA GCCGCCGCCG CCGCGGCGTA CCTGCACGGG CTGGCCGGCC GGGAGGCGGC CCAGGGTGGC CCGGTCACCG CTCCCGATGT CGCCACCGCG CTGCGCCCGG TGCTGGCTCG CGTCGGGTGG ATCGACGGGC GGGCTGGGCC GAACTGCTGA
|
Protein sequence | MRPVWRVADV RAAEAGLMAA LPSGTLMQRA AAGLARRCAR VLTDRGGVYG ASVLLLVGSG DNGGDALFAG ARLARRGAAV SALLLSPDRV HAEALTALRA AGGRLVERPP ARVDLVVDGI VGIGGSGGLR EPAEQLAASL AGCCGRDGDR ATVVAVDVPS GVSVDTGHVP QSASGRPAAV HADVTVTFGA LKPALVVGPA APLAGQVDLV DIGLEPWLRS TPALRVTEWA DVTGWWPTPG PATEKYTRGV VGVATGSATY PGAAVLSVAG ALAGPTGMVR YAGSARVEVL RQHPSVIATD RVADAGRVQA WVCGSGLGTD DEAAGELRAV LAAPVPAVLD ADALTLLVDG SLAHLLRRRD APIVVTPHDR EFARLCGETP GTDRVAAALR LAAWMNAVVL LKGDRTVIGT PDGRAYVNQT GTPALATGGT GDVLAGLLGS LLAAGLNPER AAAAAAYLHG LAGREAAQGG PVTAPDVATA LRPVLARVGW IDGRAGPNC
|
| |