Gene P9211_13371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_13371 
Symbol 
ID5731847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1205095 
End bp1206669 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content41% 
IMG OID641285708 
Productfused sugar kinase/uncharacterized domain-containing protein 
Protein accessionYP_001551222 
Protein GI159903878 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.873266 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACTGGC CTAAATCAGA TGCAGATCAT TTAATTGCCT CAACTGCTCA GATGAGAAGC 
TTTGAAAACA ATCTCATCAA TAGAGGCATG CCTGTTGAGG CATTAATGGA AAAAGTAGGG
CAAAAACTGA AAGATTGGTT TTTGGAAAGG CCTGAATTAC TTTTAAACGG TGTATTGATT
TTAGTTGGTC CAGGGCACAA TGGAGGGGAC GGGCTGGTTT TAGCTAGAGA ATTGTTCTTG
GAGAACATAG ATGTATCTAT TTGGTGTCCT TTGCCAATTA CGCAATCTTT AACTCGGCGT
CATCAGGCTT ATTCGCATTG GCTGGGTATT AAGGAATTGC ATGAAGAACC TAATGTTTAT
TGTAAGACTC TTTGGATTGA CGCCCTTTTT GGACTTGGAC AGACCCGTCC TCTTCCTGAG
AAAATAGCAA ATCTTTTAAA GGGTAGAGAG GAAAAACAAC CAAGAAGATT AATCTCTATT
GATGTTCCAT CTGGGATTTG TTCAGATACA GGAAAGACTT TCTCAAACGT TGCAGCTACA
GCTTCTTATA CCTTGACTAT TGGCTTGAAT AAGCAGGGAT TGATTCAAGA CATTGCTATC
CCATATGTGG GTGAATTAAT AAGACTTGAT ATTGGTATTC CACAGAAAGT TCTCAAGAAA
GATAAGCAAA TTAACTTCTT CAAACTTGCA GGTGATGATC TTTTTTCGAT CCCTTGGCCT
GAGCCTTCTT CAATTTCTTC TAAATATCAG CGTGGAAGAC TTTTGGTTGT TGCCGGCAGT
GACAAGTATA AAGGTGCTGC TTTACTTGCT TTACAAGGCG CATTAGCAAG TGGTGTGGGC
AGTATTAAGG CGCTTTTGCC CCAAAAAATG GCAGATAATA TTTGGCAAGT TTCTCCAGAA
ATTGTTGTTT CAGGAATATT GGGAAATTCA ATTGATGGAG AATCTGAGAT AGGGGAGGCG
ATCTTAAAAC ACAAATTAGA CCGTCTAGAC TCATTATTAA TTGGTCCTGG TTTAGCTTTG
GCTAACGAAA AGTGGAGTGA TATTGCTTGC CAGCTTGAAA AATTTCCAGG GTTATTAGTT
CTTGATGCTG ATGCTTTAAA TAGATTAGCT TTTTCTACTG AGAGCTGGGA GTGGATTAAT
AAGCGTGAAG GACCAACATG GCTGACTCCT CATTTACAGG AGTTCTATCG TTTATTCCCA
AATCTTAAGG ACTTATCACC TTTGGATGCA GCGCCTAAAG CGGCAAAGAT GACTGGGGCA
AATGTATTGC TTAAAGGGGC ACATAGCCTT ATAGCATCCC CCTCTGGGGT GAGATGGCAA
TTGGTGTCTA CATCTAGCTG TGCTGCGCGT GCAGGTTTAG GGGATGTTTT GGCTGGATTT
GTTGCTGGAG TAGGTGCAAT AGGGTTTTCT ATAGGAGGAG CAATGAATGA TGAATTACTT
GCTTTAGCAG CATTGCTTCA CGCGGAAGCG GCAAAAAGTT GCAAAAAAGG AAGTAAAGCA
AGCTCAATTG CATTGGCTTT AGAAAAACTT GTAAGACACA TTCAGTGTCC TGAATGTGTT
CAAAGAAACA TTTAA
 
Protein sequence
MNWPKSDADH LIASTAQMRS FENNLINRGM PVEALMEKVG QKLKDWFLER PELLLNGVLI 
LVGPGHNGGD GLVLARELFL ENIDVSIWCP LPITQSLTRR HQAYSHWLGI KELHEEPNVY
CKTLWIDALF GLGQTRPLPE KIANLLKGRE EKQPRRLISI DVPSGICSDT GKTFSNVAAT
ASYTLTIGLN KQGLIQDIAI PYVGELIRLD IGIPQKVLKK DKQINFFKLA GDDLFSIPWP
EPSSISSKYQ RGRLLVVAGS DKYKGAALLA LQGALASGVG SIKALLPQKM ADNIWQVSPE
IVVSGILGNS IDGESEIGEA ILKHKLDRLD SLLIGPGLAL ANEKWSDIAC QLEKFPGLLV
LDADALNRLA FSTESWEWIN KREGPTWLTP HLQEFYRLFP NLKDLSPLDA APKAAKMTGA
NVLLKGAHSL IASPSGVRWQ LVSTSSCAAR AGLGDVLAGF VAGVGAIGFS IGGAMNDELL
ALAALLHAEA AKSCKKGSKA SSIALALEKL VRHIQCPECV QRNI