Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_08571 |
Symbol | hisA |
ID | 4911190 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 744118 |
End bp | 744885 |
Gene Length | 768 bp |
Protein Length | 255 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640160439 |
Product | 1-(5-phosphoribosyl)-5-[(5- phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase |
Protein accession | YP_001091081 |
Protein GI | 126696195 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0106] Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase |
TIGRFAM ID | [TIGR00007] phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.457248 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCTAA TACCAGCAAT TGATTTAATG AATGGTAAGT GCGTAAGGCT TTTTAAAGGC GACTTTAATA AAAGAAAAGA CTTTACCAAA GAGCCTCATG AGCAAGCTAA ATTTTGGGAA AGCGAAGGGG CAAAATGTAT ACATATAGTT GATTTGGATG CTGCAAAAAC TGGATCCCCA ACAAACGATA AATCAATAAA AAAGATTGCA AAAACAGTTA ACATACCTAT TCAAATAGGT GGGGGGATAA GGTCTCAAGA AAGGATAGAA CAATTATTTT CTTATGGTAT TGAGAAAGTT ATCATGGGAA CCTCTGCAAT AGAAAATAAA GAACTAGTTA AAGACTTATC AAATAAATAT CCTGGAAGGA TAATTGTTGG AATAGATGCA AAAAATGGAA AAGTTAGTAC AAGGGGTTGG CTTGAGCAAT CTAATATTTT TGCCACAGAT CTAGTAAAAG AGTTTTCTTC ATTTAAAATT GCTAGTTTTA TTGTTACAGA TATAAATACA GATGGGACGT TAGAAGGGAC AAATGAAGAA TTCATAAAAA GCATACTTGA AATTACAGAT ATTCCAGTAA TAGCCTCAGG AGGTGTTGGT TCAATTTCTG ATTTATTATC CTTAGTAAAA TTTGAAAACT CTGGACTTTT TGGAGTAATT GTAGGTAAAG CTCTATATGA AAATAAATTC ACGATAAAAG AAGCTAATAA TGTATTGTCA TCAGAGAGAT TAAATGACTT TGATTTAAAC AGAAATTACT ACGCCTAA
|
Protein sequence | MNLIPAIDLM NGKCVRLFKG DFNKRKDFTK EPHEQAKFWE SEGAKCIHIV DLDAAKTGSP TNDKSIKKIA KTVNIPIQIG GGIRSQERIE QLFSYGIEKV IMGTSAIENK ELVKDLSNKY PGRIIVGIDA KNGKVSTRGW LEQSNIFATD LVKEFSSFKI ASFIVTDINT DGTLEGTNEE FIKSILEITD IPVIASGGVG SISDLLSLVK FENSGLFGVI VGKALYENKF TIKEANNVLS SERLNDFDLN RNYYA
|
| |