Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_15121 |
Symbol | argC |
ID | 4777902 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1315204 |
End bp | 1316319 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640087020 |
Product | N-acetyl-gamma-glutamyl-phosphate reductase |
Protein accession | YP_001017521 |
Protein GI | 124023214 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0002] Acetylglutamate semialdehyde dehydrogenase |
TIGRFAM ID | [TIGR01850] N-acetyl-gamma-glutamyl-phosphate reductase, common form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.179423 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGCAA TGGAAAGTAC CCATCGCTCT CCAGCCTCTG ACATTTCTCA GGTTCGAGGC AGCCGAGTAG CGGTTATCGG CGCTACGGGC TACGGCGGTC TTCAAACCAT CAGATTGCTT GAGGACCATC CTCAGCTCCA TGTCACCTAC CTGGGCGGCG AACGAAGTGC CGGCAGGCGA TGGAGTGAGC TTTGCCCTTT TTTACCCATC CCTGATGATC CTGAAGTTCA ATCTCCCGAT CCTGACAAGA TCGCTGAATA CGCCGATTAC GCAGTCCTCA GCCTTCCCAA TGGACTTGCC AGCCAGTTAG CACCGCAACT GCTGAAGCGC AATGTTCGGG TTGTAGACCT CTCTGCTGAC TTTCGCTATC GCTCACTGGA ACAATGGAAA CAAGTTTATG TCCATGAAGC TCAAAACCTC AATCGAGATG ATGTTCAGCT TTGTAGAGAG GCTGTTTATG GGTTACCTGA ATGGAAGGGA CCTGAAATTG CAGTTGCAAA TCTTGTAGCT GCACCAGGCT GCTTCCCAAC CGCAAGTCTG TTGCCCTTAC TGCCTTTTCT AAAACAGGGA TTGATCGAAA ACGATGGTTT GATTATCGAT GCAAAGACTG GCACCTCTGG TGGAGGGCGG GTGGCCAAAG AACAATTTCT TCTGTCAGAG GCCTCGGAGT CGATCATGCC TTATGGCGTG GTTGGGCATC GACATACATC AGAAATCGAA CAACTTGCTA GTGAGGTTGC CGGACAACCG ATAGAGCTTC AGTTCACACC CCATCTCGTA CCGATGGTGC GTGGACTGCT AGCGACGGTT TATGGACGCT TGAGAGATCC TGGATTGACA GCTGAGGACT GCACAACTGT GCTTAAAGCG GTCTATCGAC ATCATCCCTG CATCGATGTT CTTCCTGTGG GAACGTATCC AGCGACTAAG TGGGTGAAGT ACTCCAACAA AGCTGTTCTT TCTGTTCAGG TCGACAACCG CAACAGCCGG CTTATCTTGA TGAGCGCTGT CGACAACCTA ATTAAAGGAC AAGCTGGTCA AGGGGTTCAA TGTCTAAACC TGATGGCTGG ACTGCCACCA ACCACAGGGA TGTCTTTACT AACTTTCTAC CCTTAA
|
Protein sequence | MKAMESTHRS PASDISQVRG SRVAVIGATG YGGLQTIRLL EDHPQLHVTY LGGERSAGRR WSELCPFLPI PDDPEVQSPD PDKIAEYADY AVLSLPNGLA SQLAPQLLKR NVRVVDLSAD FRYRSLEQWK QVYVHEAQNL NRDDVQLCRE AVYGLPEWKG PEIAVANLVA APGCFPTASL LPLLPFLKQG LIENDGLIID AKTGTSGGGR VAKEQFLLSE ASESIMPYGV VGHRHTSEIE QLASEVAGQP IELQFTPHLV PMVRGLLATV YGRLRDPGLT AEDCTTVLKA VYRHHPCIDV LPVGTYPATK WVKYSNKAVL SVQVDNRNSR LILMSAVDNL IKGQAGQGVQ CLNLMAGLPP TTGMSLLTFY P
|
| |