Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_24411 |
Symbol | |
ID | 4778130 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2147240 |
End bp | 2148202 |
Gene Length | 963 bp |
Protein Length | 320 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640087961 |
Product | N-acetylglucosamine kinase |
Protein accession | YP_001018437 |
Protein GI | 124024130 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2971] Predicted N-acetylglucosamine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACGGCC TTTCCCTGGC AGGCTTTGAT GCAGGCCAAA CCCAAACCCG ATGCCGCCTA AGCCGCTGGC ATCAAAATCA ATGGTTGCCG ATTGCAGAGG GCCTCGGCAG CGGCGTCATT CACCTGCAGG CCTCTGACGG TGAAGAACGC TTTGAGAAAG CACTGCGAAG CAGCTTCAGC AAGGCTGTCG GCAATGCTGG ATTGAGCTCT GAAAAAGCTC TGATCTCTGC AGCGGCCGTC GGAGCAAGCG GAATAGAACA TGACACCCCA CTGCAAGAGC AAGCGCAGCA CTTACTCGCG CGTTGTCTAA ACATCCCATC AAACCAATGC CTAGCAACTG GGGATGAACG CACAGCCCTG CATGGGGCCT TCCCCCAAGA CGCAGGCATT GTGTTGATCA GTGGCACCGG CATGATCTGC ATCGGGCGAA ACGATCAAGG AAAAGAACAA CGCTGTGGCG GCTGGGGTTG GCTACTTGAT GAGGGTGGCT CAGCTCAAAA CCTCGGACAA AAGGGGCTAC AGCTCAGCCT GCGCATGGCC GATGGCCGCA TCCCCGATCG ACCTCTGCGC GAGAAACTCT GGCGCTCTCT CAATTGCTCA TCGGCAGCAG CTATCAAAGC CCTTGTTGTA CAGCCCAGTT TCGGTGCTGC CGGGTTTGCT CAACTCGCGC CACTCGTCGT TGCAGAAGCC CAGGTAGGCG ATCAGGATGC CATTGCAATC CTTGAACAAT CAGCCCACTG CATCGCCGAA GCGATCGCAG GAGTTGCCCA AAGCCTTGAG CTATCGGCTC CCCAGATCTG TGGCAACGGC GGAGCCTTTG AACATCTACA ACCCTTTCGC GAGCTGATCG AGCAAGCCAT TGCCAAGCGA CTGCCTACTG CAAGCTGGAT CAAAGGCCAG GGGGATGCCC TGGATGGTGC TTTGCAGCTT GCACTCCGCC AACTCAAACG CAACCCCGAT TGA
|
Protein sequence | MNGLSLAGFD AGQTQTRCRL SRWHQNQWLP IAEGLGSGVI HLQASDGEER FEKALRSSFS KAVGNAGLSS EKALISAAAV GASGIEHDTP LQEQAQHLLA RCLNIPSNQC LATGDERTAL HGAFPQDAGI VLISGTGMIC IGRNDQGKEQ RCGGWGWLLD EGGSAQNLGQ KGLQLSLRMA DGRIPDRPLR EKLWRSLNCS SAAAIKALVV QPSFGAAGFA QLAPLVVAEA QVGDQDAIAI LEQSAHCIAE AIAGVAQSLE LSAPQICGNG GAFEHLQPFR ELIEQAIAKR LPTASWIKGQ GDALDGALQL ALRQLKRNPD
|
| |