Gene A9601_14891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_14891 
Symbol 
ID4718210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1270719 
End bp1272284 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content34% 
IMG OID640079210 
Productfused sugar kinase/uncharacterized domain-containing protein 
Protein accessionYP_001009879 
Protein GI123969021 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGAAA TTGTATGGCC AACAATTGAC TCTGAACATT TAATTGTTGA TTCAAAGCAA 
ATGCTGGTAT TAGAGAAAGA AATGTTTTCT GATGGGATGC CAAAAGAAGC ATTGATGGAA
AAAGCTGGTA TCCAAATTAG TAGATGGCTC TTAAAAAAGA AACCTCTTCT CAAGCATGGA
ATAACTTTTT TTATAGGTCC TGGGCATAAT GGCGGGGATG GTGCAGTAAT AGCTAGAGAA
CTTTTTTTGA GAGGTTTTAA TGTTCAGGTA TGGTGTCCAT TCCCGATAAA AAAAACATTA
ACAAATAACC ACCTTAATTA TCTTACATCT ATTGGTGTCA CAAAATTAGT AGAGCCTCCT
GATGCAAATG GGAAAGAGCT TTGGATTGAT GCAGTTTTTG GTAACAATCA AACAAGAAAA
GTTGATGATA AATTAATTAA ACTGTTTAAT CAAAAATTTC ATAAAAAACA TGGAAAGGTA
ATAAGTATTG ATATTCCAAC AGGATTATGT CCTGATAAAG GAGAGCCTTT TTTAGATAAC
GCAGTAAAGG CAGATTATAC CTTGACCATT GGTCTTTATA AGATTGGGTT GACCCAAGAT
TCTGCTTTGC CTTTTATTGG AGAATTGCAC CATATAGATG TTGGGATACC TACTAGTAAA
CTGTCCAAAG TTGATAAAAA GATTTTTAAG GTTACTTACA AAGACATTAA ACATATTGAT
TTACCTTCTT TACCAAAAAA TTACAACAAA TATCAAAGAG GTAGGACATT ACTAATAGCA
GGAAGTGAAA AATATCCTGG TGCCGCATAC TTGGCATTAA AAGGGGCTAT GTCAAGTGGA
GCAGGCTTTA TCTCTGCGGT CCTTCCTGGG ATAGTTTCTG AATCTATTTG GCAAGTTGCC
CCAGAAATAG TTTTGAAAGA AACTATGCAA TCTAACCAAA ATGGCAATGC ATCTTTATTT
AGTGCATTAA GGAATATTGA TTTAAGTGCA TTTGATTCAT TAGCTGTCGG TCCAGGAATA
GGAATTGATA ATGATGATTG GCAAAAATCA AAAGACTATC TTATTGATTT TGGAGGATTA
TTGATCTTGG ATGCAGATGC ACTTAATAGA ATTTCGGAAT CTAAGTTGGG GGCAAATTTC
TTTTTAGAGA GAAAGTTTAA GACATGGATT ACACCACATA GCAAAGAATT TGGAAGATTA
TTTCCCAATA TCAATTCTCA TACCAATGTT GGGCTTGCCC GTGAAGCGGC AAAAGAATTC
AATATCAGTA TTTTGTTAAA AGGAGCTAAC AGTGTAGTTG CTGATAATAG AAAAGTATGG
CAACTTTTTG GAACCGATTC TCAGACAGCT CGAGCTGGAT TGGGAGATCT TTTATCTGGT
TTTGTAGCTG GTAGTTCTGC AATTGATTCA ACCTTGTGTA GAAATATATC AACAGATTTT
TTTGCTAAGT ATGTACTTCT GCATTCATTT GCTGCATCAA AGTGCAAAAA AGGGTCAAAT
GCTTCTGTTA TTGGTGACGA ATTGTCAAAA TTAATGAGAA ATATAAAAAC GAGACAAATA
TCTTGA
 
Protein sequence
MNEIVWPTID SEHLIVDSKQ MLVLEKEMFS DGMPKEALME KAGIQISRWL LKKKPLLKHG 
ITFFIGPGHN GGDGAVIARE LFLRGFNVQV WCPFPIKKTL TNNHLNYLTS IGVTKLVEPP
DANGKELWID AVFGNNQTRK VDDKLIKLFN QKFHKKHGKV ISIDIPTGLC PDKGEPFLDN
AVKADYTLTI GLYKIGLTQD SALPFIGELH HIDVGIPTSK LSKVDKKIFK VTYKDIKHID
LPSLPKNYNK YQRGRTLLIA GSEKYPGAAY LALKGAMSSG AGFISAVLPG IVSESIWQVA
PEIVLKETMQ SNQNGNASLF SALRNIDLSA FDSLAVGPGI GIDNDDWQKS KDYLIDFGGL
LILDADALNR ISESKLGANF FLERKFKTWI TPHSKEFGRL FPNINSHTNV GLAREAAKEF
NISILLKGAN SVVADNRKVW QLFGTDSQTA RAGLGDLLSG FVAGSSAIDS TLCRNISTDF
FAKYVLLHSF AASKCKKGSN ASVIGDELSK LMRNIKTRQI S