Gene NATL1_02421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_02421 
SymbolnagA 
ID4779598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp223118 
End bp224278 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content35% 
IMG OID640083507 
ProductN-acetylglucosamine-6-phosphate deacetylase 
Protein accessionYP_001014071 
Protein GI124024955 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1820] N-acetylglucosamine-6-phosphate deacetylase 
TIGRFAM ID[TIGR00221] N-acetylglucosamine-6-phosphate deacetylase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAAA TTACAAACAT TAAAGTACCT CAACCAGAGA AAAGCGAAGA TGAAAAGAAT 
TTATTTTCAC TAGTTTTAGA TGAACATGGA ATTATTCTTT CTATTGATAA AACTTGCAAG
AAGGAATCAA GTTATGACGA AGATTGGCAT GGAGATTGGC TAAGTCCTAG AGCCATTGAT
CTTCAAATGA ATGGAGGTTT AGGGATCTCA TTTACAGATT TAAATTTTAA TAAAATACCT
AAATTATTAA CATTGCTTGA TCAACTTTGG CTAGATGGAG TTGAAGGAAT TTGTCCAACT
TTTGTTAGTT GTTCTCTAGA TCAATTTCAA TTAGGAATGG AGGTCTTAAA GGAAACGAAA
AAATATACTT CAACAAATCG ATGTCGACTA CTTGGCGCCC ATATGGAAGG ACCTTTTTTA
TGTGAGACAT ATTCTGGAGC ACATGACACA GGAAGTATTT GTAGACCTAG TATTTCTGCC
TTAAATGAGA GAATAAAGGG ATTTGAGGAC GACATAGCAC TAATGACATT GGCTCCAGAA
TTAAAAGGTT CTTTGGATGT AATTTCTCGT CTAAGAGAAT TAAATATCGT TGTCTCACTG
GGACATTCTT CAGCTGATTT TGATTCAGCC ATAAAAGCTT TTAATAATGG TGTGAGCATG
ATTACGCATA CTTTTAATGC AATGAAAGGT TTACATCATC GAGCTATTGG ACCTATAGGA
GCGGCTGCAA GAAGAGACGA TATTTTCCTA GGATTGATAG GTGATGGTGT TCATGTTCAT
TCAGATATGA TTAGGCTTTT AAAGATACTT GCACCAAAGC AAATTGTTTT AGTTAGCGAT
GCAATATCAG CTTATGGTTT AGGAGATGGA TCGTTTAATT GGGATAAACG CTTGATAACA
GTAGAAAATG GTTTATGCAG ACTTCCAAAT GAAACAATTG CAGGATCCAC TTTGCCTTTA
CTAGACGCAT GCAAAAAAAT AGCAAATTGG ATTAATGATC CCTCTGCTGC TATTTGGATG
GCAACACTTG CTCCACGTAT GGTATTAAGT CAGAACAAAA TGACTGCAAA ATCTTTTTTT
ATTGGAAAGA ATATTAAGTC TTTACTTAGA TGGAAAATGA ATTCTTGTAA TCATCAGTTG
TCTTGGCAGT TGGCTGCGTA A
 
Protein sequence
MRKITNIKVP QPEKSEDEKN LFSLVLDEHG IILSIDKTCK KESSYDEDWH GDWLSPRAID 
LQMNGGLGIS FTDLNFNKIP KLLTLLDQLW LDGVEGICPT FVSCSLDQFQ LGMEVLKETK
KYTSTNRCRL LGAHMEGPFL CETYSGAHDT GSICRPSISA LNERIKGFED DIALMTLAPE
LKGSLDVISR LRELNIVVSL GHSSADFDSA IKAFNNGVSM ITHTFNAMKG LHHRAIGPIG
AAARRDDIFL GLIGDGVHVH SDMIRLLKIL APKQIVLVSD AISAYGLGDG SFNWDKRLIT
VENGLCRLPN ETIAGSTLPL LDACKKIANW INDPSAAIWM ATLAPRMVLS QNKMTAKSFF
IGKNIKSLLR WKMNSCNHQL SWQLAA