Gene P9211_08721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_08721 
SymbolargC 
ID5730755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp765297 
End bp766376 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content39% 
IMG OID641285237 
ProductN-acetyl-gamma-glutamyl-phosphate reductase 
Protein accessionYP_001550757 
Protein GI159903413 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0002] Acetylglutamate semialdehyde dehydrogenase 
TIGRFAM ID[TIGR01850] N-acetyl-gamma-glutamyl-phosphate reductase, common form 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.788097 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00268448 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTTTAT CAAACAATAA TTCAAACCGC GTCGCGATCA TAGGGGCTTC TGGATATGGG 
GGGCTTCAAC TAATGAGATT ATTAAGTGAT CACCCGCATT TCAAAGTAAC TTTTTTGGGA
GGTAATAAAA CAGCTGGTAA TAAATGGCAC CAAATTGCAC CTTTTATTAA GTCCTCTGAG
GACCTTACTG TTAGGAAAGC CGATCCAGAA GATATAGCTG AGCATGCAGA CTTTGCATTG
CTTAGTCTTC CTAATGGCCT TTCTTCTCAG TTAACCCCTG AATTGATAAA AAGAAATATT
CGTATAGTTG ATCTTTCAGC AGATTATCGT TACCGCTCCT TGTATCAATG GAAGCAAGTA
TATGTAAAAG AATCCACCAA ATATATCCGT AATGACGATT CCTTATGCAG AGAAGCAACT
TATGGAATTC CGGAATGGAA TGAAGGTGAT ATTAGGAAGT CAAAACTTGT AGCTTGCCCA
GGTTGTTTCC CAACAGCTTC ATTACTTCCT TTAATGCCTT TTCTCAAACA AGGACTTGTT
GAGAATGAAG GTTTAATAAT TGATGCCAAA AGTGGTACCT CTGGAGGCGG TAGGGAGCCT
AAAGAGCATT TGCTCTTATC TGAGTGCTCA GAATCAATAG CACCATATTC TGTTTTGGGA
CATAGACATA CTTCTGAAAT TGAGCAGCAA GCAACTCTTG TTTCAGGGAC CCCCATTCAA
CTTCAATTCA CACCACATTT AGTTCCAATG GTCCGTGGAT TGCTATCTAC AGTCTATGCA
CGATTAAGGG ATCCGGGTTT GACTGCAGAA GATTGCAAAA CCGTATTGGA AGCTTTTTAC
AGATCACATA GCACTGTAGA AATACTTCCT GTTGGAATTT ATCCCTCAAC TAAGTGGGCT
AGGTATACAA ACAAAGCATT ATTATCATTA CAGGTTGATA AAAGGAATGG GCGTCTTATC
TTAGTTTCAG TAATTGATAA CCTAATTAAA GGTCAGGCAG GTCAAGCAAT ACAGAATCTA
AATATTATGG CTGGATTCAG TCATGAATTA GGCTTACCTT TAACTACTTT CTATCCATAG
 
Protein sequence
MSLSNNNSNR VAIIGASGYG GLQLMRLLSD HPHFKVTFLG GNKTAGNKWH QIAPFIKSSE 
DLTVRKADPE DIAEHADFAL LSLPNGLSSQ LTPELIKRNI RIVDLSADYR YRSLYQWKQV
YVKESTKYIR NDDSLCREAT YGIPEWNEGD IRKSKLVACP GCFPTASLLP LMPFLKQGLV
ENEGLIIDAK SGTSGGGREP KEHLLLSECS ESIAPYSVLG HRHTSEIEQQ ATLVSGTPIQ
LQFTPHLVPM VRGLLSTVYA RLRDPGLTAE DCKTVLEAFY RSHSTVEILP VGIYPSTKWA
RYTNKALLSL QVDKRNGRLI LVSVIDNLIK GQAGQAIQNL NIMAGFSHEL GLPLTTFYP