Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_02421 |
Symbol | nagA |
ID | 4779598 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 223118 |
End bp | 224278 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640083507 |
Product | N-acetylglucosamine-6-phosphate deacetylase |
Protein accession | YP_001014071 |
Protein GI | 124024955 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1820] N-acetylglucosamine-6-phosphate deacetylase |
TIGRFAM ID | [TIGR00221] N-acetylglucosamine-6-phosphate deacetylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAAAA TTACAAACAT TAAAGTACCT CAACCAGAGA AAAGCGAAGA TGAAAAGAAT TTATTTTCAC TAGTTTTAGA TGAACATGGA ATTATTCTTT CTATTGATAA AACTTGCAAG AAGGAATCAA GTTATGACGA AGATTGGCAT GGAGATTGGC TAAGTCCTAG AGCCATTGAT CTTCAAATGA ATGGAGGTTT AGGGATCTCA TTTACAGATT TAAATTTTAA TAAAATACCT AAATTATTAA CATTGCTTGA TCAACTTTGG CTAGATGGAG TTGAAGGAAT TTGTCCAACT TTTGTTAGTT GTTCTCTAGA TCAATTTCAA TTAGGAATGG AGGTCTTAAA GGAAACGAAA AAATATACTT CAACAAATCG ATGTCGACTA CTTGGCGCCC ATATGGAAGG ACCTTTTTTA TGTGAGACAT ATTCTGGAGC ACATGACACA GGAAGTATTT GTAGACCTAG TATTTCTGCC TTAAATGAGA GAATAAAGGG ATTTGAGGAC GACATAGCAC TAATGACATT GGCTCCAGAA TTAAAAGGTT CTTTGGATGT AATTTCTCGT CTAAGAGAAT TAAATATCGT TGTCTCACTG GGACATTCTT CAGCTGATTT TGATTCAGCC ATAAAAGCTT TTAATAATGG TGTGAGCATG ATTACGCATA CTTTTAATGC AATGAAAGGT TTACATCATC GAGCTATTGG ACCTATAGGA GCGGCTGCAA GAAGAGACGA TATTTTCCTA GGATTGATAG GTGATGGTGT TCATGTTCAT TCAGATATGA TTAGGCTTTT AAAGATACTT GCACCAAAGC AAATTGTTTT AGTTAGCGAT GCAATATCAG CTTATGGTTT AGGAGATGGA TCGTTTAATT GGGATAAACG CTTGATAACA GTAGAAAATG GTTTATGCAG ACTTCCAAAT GAAACAATTG CAGGATCCAC TTTGCCTTTA CTAGACGCAT GCAAAAAAAT AGCAAATTGG ATTAATGATC CCTCTGCTGC TATTTGGATG GCAACACTTG CTCCACGTAT GGTATTAAGT CAGAACAAAA TGACTGCAAA ATCTTTTTTT ATTGGAAAGA ATATTAAGTC TTTACTTAGA TGGAAAATGA ATTCTTGTAA TCATCAGTTG TCTTGGCAGT TGGCTGCGTA A
|
Protein sequence | MRKITNIKVP QPEKSEDEKN LFSLVLDEHG IILSIDKTCK KESSYDEDWH GDWLSPRAID LQMNGGLGIS FTDLNFNKIP KLLTLLDQLW LDGVEGICPT FVSCSLDQFQ LGMEVLKETK KYTSTNRCRL LGAHMEGPFL CETYSGAHDT GSICRPSISA LNERIKGFED DIALMTLAPE LKGSLDVISR LRELNIVVSL GHSSADFDSA IKAFNNGVSM ITHTFNAMKG LHHRAIGPIG AAARRDDIFL GLIGDGVHVH SDMIRLLKIL APKQIVLVSD AISAYGLGDG SFNWDKRLIT VENGLCRLPN ETIAGSTLPL LDACKKIANW INDPSAAIWM ATLAPRMVLS QNKMTAKSFF IGKNIKSLLR WKMNSCNHQL SWQLAA
|
| |