Gene P9303_19611 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_19611 
Symbol 
ID4778200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1725898 
End bp1727499 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content57% 
IMG OID640087471 
Productfused sugar kinase/uncharacterized domain-containing protein 
Protein accessionYP_001017968 
Protein GI124023661 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0831835 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCTGGC CCCCATCGAA CGCTGATCAT CTTTTGGTTA CTGCTGCGCA GATGGCAGCT 
CTCGAGAAGG AGATGTTTTC CAGCGGCTTG CCGGTGGCTG CTTTGATGGA AAAAGTAGGC
CAGGCGATGG CGGCTTGGTT TCGCCAACAG TCTGAGTTGT TGGCAGAGGG TGTGGTGGTG
TTGGTGGGCC CTGGCCATAA CGGTGGTGAT GGATTGGTGG TGGCCAGGGA GTTGCATCTT
GCAGGGGTGA AGGTCCAGCT CTGGGCGCCC TTGCCGATCC GTCAACCATT AACGGCCCAG
CATTGGACTT ACGTTAAATC GCTTGGCATT CAGCAACTAG ATCAAGCTCC TGATGTAGCT
GGTGAGTCTC TTTGGATCGA GGCTCTGTTT GGGCTCGGAC AATCTCGCCC ACTCCCTGAA
ACGTTGGCAA CGTTGTTGCA GGCGCGCCAG CGCTGCCAGC CAGGCAAGTT GGTGAGTTTG
GATGTGCCTG CTGGGCTGTG TTCAGATTCC GGCATCCCTT TCCCAGGTGG GGCTGCCGTG
GCGATGACGA CGCTCACTGT GGGGTTGCTC AAGCAAGGCC TTATTCAGGA TGCGGCGATC
GATCATGTTG GCCGCCTGGT GCGGGTTGAT ATGGGCGTGC CGAAGATCTT GTTGAAGCAG
TTGCCAAAGT CGCAACCTCG GCGGCTCTGT TCTGCGGATG TGGCCACCGT TCCCTGGCAG
CATCCAGCAG CAGGCGCGAT GAAATACGAA CGAGGGCGGG TGTTGGTGAT TGCTGGTAGT
GATGATTACC CTGGGGCGGC TTTTCTGGCC ATTCAGGGTG CTATCGCTAG CGGTGCAGGC
AGCATTCAAG CCGCTGTGCC TGCTGCAGTA GCCGATCAGC TTTGGCAAGT GGCGCCTGAA
GTTGTTTTGG CGGCCGCACT TGAGAGTTCT GCGGCAGGTG GCATGGCCTT AGCTACTTGG
TTGGCGAGTC ATGATCTCAG CCGGTTCGAT GCCGTCTTGA TTGGGCCAGG CTTAAGTCGA
GGTGGAGAAC CTTGGTCAGT GTTGGCAGAA CCGTTGCAGC GCTTTGCAGG CTTGTTGGTT
TTGGATGCTG ATGGTCTGAA TCGATTGGCG CTGGCTACTG ATGGATGGCA ATGGTTACAG
CAGCGCCAAG GGCATACCTG GCTTACTCCC CATGCCGGTG AGTTCAGGAG ATTGTTTCCG
CAGCTCAAAG CTCGGCAACC TCTCGATTCG GCTCTGGAAG CATCCCGGCT TTGTGGAGCA
GCTGTGCTGC TCAAGGGAGC ACACAGTGTG GTTGCGGATC CGTCTGGTGC CGCCTGGCAG
CTAGGAGAGA CAGCAAGTTG GGTTGCTCGT ACTGGGCTCG GGGATCTGTT GGCTGGTTAT
GCAGCTGGCT TGGGATCTAT GGATGCTGCT AAGGCTCAGG CTTGCCATTG CCAGGGTGAG
TCTTTGGCCG TAGTGGCGTT GCTTCATGCC GAGGCTGCAC GTCGATGCCG TCAAGGCAGT
TCAGCAAGGT CTATCGCTCA ATCCCTTGCA GAACTCACGA TTAGCTTGCA ATCAAATGAA
TGTGATCAAG GGCACGTCAA AGGGTATGAA TGCAAACGAT AA
 
Protein sequence
MSWPPSNADH LLVTAAQMAA LEKEMFSSGL PVAALMEKVG QAMAAWFRQQ SELLAEGVVV 
LVGPGHNGGD GLVVARELHL AGVKVQLWAP LPIRQPLTAQ HWTYVKSLGI QQLDQAPDVA
GESLWIEALF GLGQSRPLPE TLATLLQARQ RCQPGKLVSL DVPAGLCSDS GIPFPGGAAV
AMTTLTVGLL KQGLIQDAAI DHVGRLVRVD MGVPKILLKQ LPKSQPRRLC SADVATVPWQ
HPAAGAMKYE RGRVLVIAGS DDYPGAAFLA IQGAIASGAG SIQAAVPAAV ADQLWQVAPE
VVLAAALESS AAGGMALATW LASHDLSRFD AVLIGPGLSR GGEPWSVLAE PLQRFAGLLV
LDADGLNRLA LATDGWQWLQ QRQGHTWLTP HAGEFRRLFP QLKARQPLDS ALEASRLCGA
AVLLKGAHSV VADPSGAAWQ LGETASWVAR TGLGDLLAGY AAGLGSMDAA KAQACHCQGE
SLAVVALLHA EAARRCRQGS SARSIAQSLA ELTISLQSNE CDQGHVKGYE CKR