Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_09671 |
Symbol | argC |
ID | 4911690 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | + |
Start bp | 834362 |
End bp | 835417 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 640160551 |
Product | N-acetyl-gamma-glutamyl-phosphate reductase |
Protein accession | YP_001091191 |
Protein GI | 126696305 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0002] Acetylglutamate semialdehyde dehydrogenase |
TIGRFAM ID | [TIGR01850] N-acetyl-gamma-glutamyl-phosphate reductase, common form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGTTG CAATAGTAGG TGCTACAGGT TACGGCGGTA TTCAAGCGGT AAATCTTTTA AAGAAAAATA AAAATTACAA AATTTCATTT TTAGGAGGTA ATAAAACATC TGGATTAAAA TGGAATGATA ATTTTCCTTT TATTTATCTT GATAATGATC CTTATATAGA AGAAATTTCA GTTGATAATA TTTCAAAAAA TGCTGATGTT GCTTTGCTTT GCTTACCAAA TGGCTTATCT TCAACATTGA CAAGGAAATT ATTAGATAAA GGACTTAAAG TTATTGATTT ATCTGCTGAT TATAGATATA AGTCTTTAGA TGAATGGAAA AAAGTATATT CCAAAGAAGC TGCTATTTAT AAAAGGAATG ATGATGATTT ATGTAAAGAA GCAGTTTACG GTCTTCCTGA AATAAATAAA GAAGCCATTT CAAAAGGAAG ATTAATTGCC TGTCCAGGAT GTTATCCAAC ATCTGCTCTT ATTCCATTGG CTCCTTATCT CTCTCAAGGA ATTATTGAAA ATGAAGGTAT AGTTATTGAC TCTAAAAGCG GAACTTCTGG AGGTGGTCGA GAACCAAACC AAAAGCTACT CTTATCAGAA TGTGGAGAAG GTCTGTCAGC ATATGGATTG ATAAACCATA GACATACCTC AGAGATCGAG CAAGTGGCAT CATTAATTTC TGGAACTAAA ATTGAACTGC TTTTTACACC TCATTTAGTC CCAATCTCAA GGGGTATGCA TTCGACTATA TATGGGAGAT TAAGAGATCC AGGATTAACT TCTGATGACT GCAGGATTCT TCTGGATAAT TATTATAGAA ATTTTAAAAA TATTAAAGTC TTACCTGTAG ATACATTCCC ATCAACAAAA TGGGTTAAAA ATACAAACCA AATATTACTT TCTGTTAAAG TTGATAATCG AAATGGAAGA ATAATTATAT TGTCTGTAAT TGATAATTTG TTAAAAGGTC AGACTGGGCA GGCAATTCAA AATTTAAATA TTATGAGTGG ATTTTCAATG GATGAAGGTC TTGATTTAAC TAATAATTTT CCATAA
|
Protein sequence | MNVAIVGATG YGGIQAVNLL KKNKNYKISF LGGNKTSGLK WNDNFPFIYL DNDPYIEEIS VDNISKNADV ALLCLPNGLS STLTRKLLDK GLKVIDLSAD YRYKSLDEWK KVYSKEAAIY KRNDDDLCKE AVYGLPEINK EAISKGRLIA CPGCYPTSAL IPLAPYLSQG IIENEGIVID SKSGTSGGGR EPNQKLLLSE CGEGLSAYGL INHRHTSEIE QVASLISGTK IELLFTPHLV PISRGMHSTI YGRLRDPGLT SDDCRILLDN YYRNFKNIKV LPVDTFPSTK WVKNTNQILL SVKVDNRNGR IIILSVIDNL LKGQTGQAIQ NLNIMSGFSM DEGLDLTNNF P
|
| |