Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_08601 |
Symbol | hisA |
ID | 4717565 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 745680 |
End bp | 746447 |
Gene Length | 768 bp |
Protein Length | 255 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640078572 |
Product | 1-(5-phosphoribosyl)-5-[(5- phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase |
Protein accession | YP_001009251 |
Protein GI | 123968393 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0106] Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase |
TIGRFAM ID | [TIGR00007] phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.176912 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCTAA TACCAGCAAT TGATTTAATG AATGGTAAGT GTGTAAGGCT TTTTAAAGGC GACTTTAATA AAAGAAAAGA CTTCGCCAAA GAGCCTCATG AGCAAGCTGA ATTTTGGGAA AGAGAAGGAG CAAAATATAT ACATATAGTT GATTTGGATG CTGCAAAAAC TGGATCCCCA ACAAACGATA AATCTATAAA AAAGATTGCA AAAACAGTAA ACATACCTAT TCAAATAGGT GGGGGTATAA GGTCTCAAGA AAGGATAGAA CAATTATTTT CTTATGGTGT TGAGAAAGTT ATCATGGGAA CCTCTGCAAT AGAAAATAAA GAACTAGTTA AAGACTTATC AAATAAATTT CCTGGAAGGA TAATTGTTGG GATAGATGCA AAAGATGGAA AAGTTAGTAC AAGGGGGTGG CTTGAGCAAT CTAATATTTT TGCCACAGAT CTAGTAAAGG AGTTTTCTTC ATTTAAAATT GCTAGTTTTA TTGTTACGGA TATAAATACA GATGGGACGT TGGAAGGGAC AAATGAAGAA TTCATAAAAA GCATACTTGA AATTACAGAT ATTCCAGTAA TAGCCTCAGG AGGTGTTGGT TCAATTTCTG ATTTATTATC CTTAGTAAAA TTTGAAAACT CTGGCCTATT TGGAGTAATA GTAGGTAAAG CTCTATATGA AAATAAATTC ACGATAAACG AAGCGAATAA TGTATTGTCA GCAGAGAGAT TAAATGACAT TGATTTAAAC ACAAATTACT ACGCTTAA
|
Protein sequence | MDLIPAIDLM NGKCVRLFKG DFNKRKDFAK EPHEQAEFWE REGAKYIHIV DLDAAKTGSP TNDKSIKKIA KTVNIPIQIG GGIRSQERIE QLFSYGVEKV IMGTSAIENK ELVKDLSNKF PGRIIVGIDA KDGKVSTRGW LEQSNIFATD LVKEFSSFKI ASFIVTDINT DGTLEGTNEE FIKSILEITD IPVIASGGVG SISDLLSLVK FENSGLFGVI VGKALYENKF TINEANNVLS AERLNDIDLN TNYYA
|
| |