Gene P9301_14751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_14751 
Symbol 
ID4911040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1243824 
End bp1245389 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content34% 
IMG OID640161067 
Productfused sugar kinase/uncharacterized domain-containing protein 
Protein accessionYP_001091699 
Protein GI126696813 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGAAA TTGTATGGCC AACAATTGAT TCTAAACATT TAATTGTTGA TTCAAAGCAA 
ATGTTGATAG TAGAGAAAGA AATGTTTTCT GATGGAATGC CACAAGAAGC ATTGATGGAA
AAAGCTGGTA TCCAAATTAG TAGATGGCTT CTAAAAAAGA AACCTCTTCT CAAGAATGGA
ATAACTGTTT TTATAGGTCC TGGGCATAAT GGCGGGGATG GTGCAGTGAT AGCTAGAGAG
CTTTTTTTGA AAGGTTTTTA TGTTCAAGTA TGGTGTCCAT TCCCGATAAA AAAAACACTA
ACAAATGACC ACCTTAATTA TCTTACATCT ATTGGTGTCA CAAAATTAGT AGAGCCTCCT
GATGCAAATG GGAAAGATCT TTGGATTGAT GCAGTTTTTG GTAATAATCA AACAAGAAAA
GTCGATGATA GATTAATTAA ACTGTTTAAT CAAAAATTTC ATAACAAATA TAGCAAGGTA
ATAAGTATTG ATATTCCAAC AGGATTATGT CCTGATAAAG GACAGCCTTT TTTAGATAAA
GCTGTAAAGG CAGACTATAC CTTGGCAATT GGTCTTAATA AGATTGGGTT AACTCAAGAT
TCTGCATTGC CTTTTATTGG AGAATTGCAC CATATAGATG TTGGGGTACC TATTAGTAAA
CTGTCAAAAG TTGATAAAAA GATTTTTAAG GTTACTTACA AAGATTTAAA AAATATTGAT
TTACCTTCTT TACCAAAAAA TTCCAACAAA TATAAAAGAG GTAGAACATT ATTAATAGCT
GGAAGTGAAA AATATCCTGG CGCTGCATAC TTAGCATTAA AAGGCGCGAT ATCAAGTGGA
GCAGGTTATG TCTCCGCTGT CCTTCCTGAG GTAGTTGCTG AATCTATTTG GCAAGTTGCT
CCAGAAATAG TTTTAAAGGA TATTATGCAG TCTAATCAGA ATGGAAATGC ATCTTTATTT
GGTGCATTAA AAAATATTGA TTTAAGTGCA TTTGATTCAG TAGCTGTCGG CCCAGGAATA
GGAATTGATA ATGATGATTG GCAAAAATCA AAAGACTATC TTTTGGATTA TGGAGGGTTA
TTGATCTTGG ATGCAGATGC ACTTAATAGA ATTTCGGAAT CTAAGTTAGG GGCAAATTTC
TTTTTAGAGA GAAAATTTAA AACATGGATT ACACCTCATA GCAAAGAATT TCGAAGGTTA
TTCCCTAATA TCAATTCTTC TACCAATTTA GGGCTAGCTC TTGATGCAGC AAAAGAATTT
AACATTGGTA TTTTATTTAA GGGAGCTAAT AGCATAGTTG CTGATAGTAA AAAAGTTTGG
CAACTTTTCG GAACCGATTC ACAGACAGCT CGAGCTGGAT TAGGTGATCT TTTATCTGGA
TTTATAGCAG GGAGTTCTGC GATTGATTTA ACCTTTTGTA GAAACATAAC AGCAGAATAT
TTTGCTAAAT ACGTACTTTT GCATTCATTT GCTGCATCAA AGTGCAAAAA AGGGTCAAAT
GCATCTGCTA TTGGTGACGA ATTGTCAAAA TTAATGAGAA ATAAAAAAAT GAGACAAATA
TCTTGA
 
Protein sequence
MNEIVWPTID SKHLIVDSKQ MLIVEKEMFS DGMPQEALME KAGIQISRWL LKKKPLLKNG 
ITVFIGPGHN GGDGAVIARE LFLKGFYVQV WCPFPIKKTL TNDHLNYLTS IGVTKLVEPP
DANGKDLWID AVFGNNQTRK VDDRLIKLFN QKFHNKYSKV ISIDIPTGLC PDKGQPFLDK
AVKADYTLAI GLNKIGLTQD SALPFIGELH HIDVGVPISK LSKVDKKIFK VTYKDLKNID
LPSLPKNSNK YKRGRTLLIA GSEKYPGAAY LALKGAISSG AGYVSAVLPE VVAESIWQVA
PEIVLKDIMQ SNQNGNASLF GALKNIDLSA FDSVAVGPGI GIDNDDWQKS KDYLLDYGGL
LILDADALNR ISESKLGANF FLERKFKTWI TPHSKEFRRL FPNINSSTNL GLALDAAKEF
NIGILFKGAN SIVADSKKVW QLFGTDSQTA RAGLGDLLSG FIAGSSAIDL TFCRNITAEY
FAKYVLLHSF AASKCKKGSN ASAIGDELSK LMRNKKMRQI S