Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_08721 |
Symbol | argC |
ID | 5730755 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 765297 |
End bp | 766376 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641285237 |
Product | N-acetyl-gamma-glutamyl-phosphate reductase |
Protein accession | YP_001550757 |
Protein GI | 159903413 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0002] Acetylglutamate semialdehyde dehydrogenase |
TIGRFAM ID | [TIGR01850] N-acetyl-gamma-glutamyl-phosphate reductase, common form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.788097 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00268448 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCTTTAT CAAACAATAA TTCAAACCGC GTCGCGATCA TAGGGGCTTC TGGATATGGG GGGCTTCAAC TAATGAGATT ATTAAGTGAT CACCCGCATT TCAAAGTAAC TTTTTTGGGA GGTAATAAAA CAGCTGGTAA TAAATGGCAC CAAATTGCAC CTTTTATTAA GTCCTCTGAG GACCTTACTG TTAGGAAAGC CGATCCAGAA GATATAGCTG AGCATGCAGA CTTTGCATTG CTTAGTCTTC CTAATGGCCT TTCTTCTCAG TTAACCCCTG AATTGATAAA AAGAAATATT CGTATAGTTG ATCTTTCAGC AGATTATCGT TACCGCTCCT TGTATCAATG GAAGCAAGTA TATGTAAAAG AATCCACCAA ATATATCCGT AATGACGATT CCTTATGCAG AGAAGCAACT TATGGAATTC CGGAATGGAA TGAAGGTGAT ATTAGGAAGT CAAAACTTGT AGCTTGCCCA GGTTGTTTCC CAACAGCTTC ATTACTTCCT TTAATGCCTT TTCTCAAACA AGGACTTGTT GAGAATGAAG GTTTAATAAT TGATGCCAAA AGTGGTACCT CTGGAGGCGG TAGGGAGCCT AAAGAGCATT TGCTCTTATC TGAGTGCTCA GAATCAATAG CACCATATTC TGTTTTGGGA CATAGACATA CTTCTGAAAT TGAGCAGCAA GCAACTCTTG TTTCAGGGAC CCCCATTCAA CTTCAATTCA CACCACATTT AGTTCCAATG GTCCGTGGAT TGCTATCTAC AGTCTATGCA CGATTAAGGG ATCCGGGTTT GACTGCAGAA GATTGCAAAA CCGTATTGGA AGCTTTTTAC AGATCACATA GCACTGTAGA AATACTTCCT GTTGGAATTT ATCCCTCAAC TAAGTGGGCT AGGTATACAA ACAAAGCATT ATTATCATTA CAGGTTGATA AAAGGAATGG GCGTCTTATC TTAGTTTCAG TAATTGATAA CCTAATTAAA GGTCAGGCAG GTCAAGCAAT ACAGAATCTA AATATTATGG CTGGATTCAG TCATGAATTA GGCTTACCTT TAACTACTTT CTATCCATAG
|
Protein sequence | MSLSNNNSNR VAIIGASGYG GLQLMRLLSD HPHFKVTFLG GNKTAGNKWH QIAPFIKSSE DLTVRKADPE DIAEHADFAL LSLPNGLSSQ LTPELIKRNI RIVDLSADYR YRSLYQWKQV YVKESTKYIR NDDSLCREAT YGIPEWNEGD IRKSKLVACP GCFPTASLLP LMPFLKQGLV ENEGLIIDAK SGTSGGGREP KEHLLLSECS ESIAPYSVLG HRHTSEIEQQ ATLVSGTPIQ LQFTPHLVPM VRGLLSTVYA RLRDPGLTAE DCKTVLEAFY RSHSTVEILP VGIYPSTKWA RYTNKALLSL QVDKRNGRLI LVSVIDNLIK GQAGQAIQNL NIMAGFSHEL GLPLTTFYP
|
| |