Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_19611 |
Symbol | |
ID | 4778200 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1725898 |
End bp | 1727499 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640087471 |
Product | fused sugar kinase/uncharacterized domain-containing protein |
Protein accession | YP_001017968 |
Protein GI | 124023661 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0831835 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCCTGGC CCCCATCGAA CGCTGATCAT CTTTTGGTTA CTGCTGCGCA GATGGCAGCT CTCGAGAAGG AGATGTTTTC CAGCGGCTTG CCGGTGGCTG CTTTGATGGA AAAAGTAGGC CAGGCGATGG CGGCTTGGTT TCGCCAACAG TCTGAGTTGT TGGCAGAGGG TGTGGTGGTG TTGGTGGGCC CTGGCCATAA CGGTGGTGAT GGATTGGTGG TGGCCAGGGA GTTGCATCTT GCAGGGGTGA AGGTCCAGCT CTGGGCGCCC TTGCCGATCC GTCAACCATT AACGGCCCAG CATTGGACTT ACGTTAAATC GCTTGGCATT CAGCAACTAG ATCAAGCTCC TGATGTAGCT GGTGAGTCTC TTTGGATCGA GGCTCTGTTT GGGCTCGGAC AATCTCGCCC ACTCCCTGAA ACGTTGGCAA CGTTGTTGCA GGCGCGCCAG CGCTGCCAGC CAGGCAAGTT GGTGAGTTTG GATGTGCCTG CTGGGCTGTG TTCAGATTCC GGCATCCCTT TCCCAGGTGG GGCTGCCGTG GCGATGACGA CGCTCACTGT GGGGTTGCTC AAGCAAGGCC TTATTCAGGA TGCGGCGATC GATCATGTTG GCCGCCTGGT GCGGGTTGAT ATGGGCGTGC CGAAGATCTT GTTGAAGCAG TTGCCAAAGT CGCAACCTCG GCGGCTCTGT TCTGCGGATG TGGCCACCGT TCCCTGGCAG CATCCAGCAG CAGGCGCGAT GAAATACGAA CGAGGGCGGG TGTTGGTGAT TGCTGGTAGT GATGATTACC CTGGGGCGGC TTTTCTGGCC ATTCAGGGTG CTATCGCTAG CGGTGCAGGC AGCATTCAAG CCGCTGTGCC TGCTGCAGTA GCCGATCAGC TTTGGCAAGT GGCGCCTGAA GTTGTTTTGG CGGCCGCACT TGAGAGTTCT GCGGCAGGTG GCATGGCCTT AGCTACTTGG TTGGCGAGTC ATGATCTCAG CCGGTTCGAT GCCGTCTTGA TTGGGCCAGG CTTAAGTCGA GGTGGAGAAC CTTGGTCAGT GTTGGCAGAA CCGTTGCAGC GCTTTGCAGG CTTGTTGGTT TTGGATGCTG ATGGTCTGAA TCGATTGGCG CTGGCTACTG ATGGATGGCA ATGGTTACAG CAGCGCCAAG GGCATACCTG GCTTACTCCC CATGCCGGTG AGTTCAGGAG ATTGTTTCCG CAGCTCAAAG CTCGGCAACC TCTCGATTCG GCTCTGGAAG CATCCCGGCT TTGTGGAGCA GCTGTGCTGC TCAAGGGAGC ACACAGTGTG GTTGCGGATC CGTCTGGTGC CGCCTGGCAG CTAGGAGAGA CAGCAAGTTG GGTTGCTCGT ACTGGGCTCG GGGATCTGTT GGCTGGTTAT GCAGCTGGCT TGGGATCTAT GGATGCTGCT AAGGCTCAGG CTTGCCATTG CCAGGGTGAG TCTTTGGCCG TAGTGGCGTT GCTTCATGCC GAGGCTGCAC GTCGATGCCG TCAAGGCAGT TCAGCAAGGT CTATCGCTCA ATCCCTTGCA GAACTCACGA TTAGCTTGCA ATCAAATGAA TGTGATCAAG GGCACGTCAA AGGGTATGAA TGCAAACGAT AA
|
Protein sequence | MSWPPSNADH LLVTAAQMAA LEKEMFSSGL PVAALMEKVG QAMAAWFRQQ SELLAEGVVV LVGPGHNGGD GLVVARELHL AGVKVQLWAP LPIRQPLTAQ HWTYVKSLGI QQLDQAPDVA GESLWIEALF GLGQSRPLPE TLATLLQARQ RCQPGKLVSL DVPAGLCSDS GIPFPGGAAV AMTTLTVGLL KQGLIQDAAI DHVGRLVRVD MGVPKILLKQ LPKSQPRRLC SADVATVPWQ HPAAGAMKYE RGRVLVIAGS DDYPGAAFLA IQGAIASGAG SIQAAVPAAV ADQLWQVAPE VVLAAALESS AAGGMALATW LASHDLSRFD AVLIGPGLSR GGEPWSVLAE PLQRFAGLLV LDADGLNRLA LATDGWQWLQ QRQGHTWLTP HAGEFRRLFP QLKARQPLDS ALEASRLCGA AVLLKGAHSV VADPSGAAWQ LGETASWVAR TGLGDLLAGY AAGLGSMDAA KAQACHCQGE SLAVVALLHA EAARRCRQGS SARSIAQSLA ELTISLQSNE CDQGHVKGYE CKR
|
| |