Gene P9303_27141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_27141 
SymbolnagA 
ID4776889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2392314 
End bp2393516 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content58% 
IMG OID640088237 
ProductN-acetylglucosamine-6-phosphate deacetylase 
Protein accessionYP_001018709 
Protein GI124024402 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1820] N-acetylglucosamine-6-phosphate deacetylase 
TIGRFAM ID[TIGR00221] N-acetylglucosamine-6-phosphate deacetylase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACAA TGCCATGGTC CCGCACTAGC ACCACCTGGC CACCTCCCAT GCATCGGATC 
ACCCACATCC GCCTGCCCCA ACCTCTAAAT GCCATAGACA CCAAGCTGTG GTGGATGGCC
GTAGATGAGC ACGAGCGGGT GCTCAGTGTT CAACCCATGG CAGATGGCTC TGCCATGGAC
GGGGAGAGCT GGCAGGGCGA CTGGATCAGT CCCATGGGCA TCGATCTACA AATCAATGGA
GGGCTGGGAT TGGCCTTCCC CGAGCTCACC GCCAAAGACA TTCCCCAGCT CCTGAAGCTG
CTCGACAGAC TCTGGCAAGA CGGTGTACAG GCAATCTGCC CCACGCTTGT GAGCTGCGGC
GTAGCAGCCC TGCGTCAATC TTTAACGGTG CTGCATGCAG CCCGAGAACA ACACTGCCCG
CAACGCTGTG AACTACTAGG GGCCCACCTT GAAGGCCCTT TTCTGGCAAT GGCACGCCAC
GGCGCCCATC CCCTGGAGCA TCTCTGTGCT CCGAGCCTAA GGGCACTGGA TGAACGCATT
CGCGGCTTTG AACAAGACAT CAGTCTGATG ACCCTGGCTC CAGAACTGCC CGGATCCTCT
GAAGTGATTG AACGACTAAG GACCCTAGAC ATCGTGGTAT GCCTAGGGCA CTCGAACGCA
GATGGGGAAG CCTCTGCCGA TGCCTTCTCC CAGGGAGTGG GAATGCTGAC CCACTCCTTC
AATGCCATGC CCGGTCTTCA TCATCGTGCA GCTGGCCCGG TGGGGGAAGC CTGCATGCAT
GGAGAGATCG CTATGGGACT GATCGCCGAT GGCGTTCATG TTGACCCCAC CATGGCGGTG
CTATTGCAAA GACTGGCACC ACAACAGCTG GTACTTGTGA GCGATAGTCT CGCTCCCTAC
GGCCTCAAAG ATGGCAAATA TCGCTGGGAT GAAAGAGTTC TGCTGGTCGA AAAAGGAACC
TGTCGTTTGG AAGATGGCAC TCTGGCAGGA GTCACACTGC CCCTCCTGGA AGGGAGTCGA
CGTTTAGCCA CTTGGAGTGG TGAACCTGCC GCGGCCATCT GGGCTGCCAC CATGGCCCCT
CGTCAGGTGA TGGGCAATGG CCGCACACTG GATGAGCTAC TTGTGAATCA GCCCTTAACA
GACTTACTCC GCTGGCAGTG GAAACCGGAT ACTGAAGAGC TGATCTGGAA GCATGCTGCT
TAA
 
Protein sequence
MTTMPWSRTS TTWPPPMHRI THIRLPQPLN AIDTKLWWMA VDEHERVLSV QPMADGSAMD 
GESWQGDWIS PMGIDLQING GLGLAFPELT AKDIPQLLKL LDRLWQDGVQ AICPTLVSCG
VAALRQSLTV LHAAREQHCP QRCELLGAHL EGPFLAMARH GAHPLEHLCA PSLRALDERI
RGFEQDISLM TLAPELPGSS EVIERLRTLD IVVCLGHSNA DGEASADAFS QGVGMLTHSF
NAMPGLHHRA AGPVGEACMH GEIAMGLIAD GVHVDPTMAV LLQRLAPQQL VLVSDSLAPY
GLKDGKYRWD ERVLLVEKGT CRLEDGTLAG VTLPLLEGSR RLATWSGEPA AAIWAATMAP
RQVMGNGRTL DELLVNQPLT DLLRWQWKPD TEELIWKHAA