Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9515_07851 |
Symbol | hisA |
ID | 4719450 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9515 |
Kingdom | Bacteria |
Replicon accession | NC_008817 |
Strand | + |
Start bp | 708523 |
End bp | 709290 |
Gene Length | 768 bp |
Protein Length | 255 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640080464 |
Product | 1-(5-phosphoribosyl)-5-[(5- phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase |
Protein accession | YP_001011101 |
Protein GI | 123966020 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0106] Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase |
TIGRFAM ID | [TIGR00007] phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.278475 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTAA TTCCAGCAAT AGATTTGATG AATGGTAAAT GTGTAAGACT TTTTAAAGGT GACTTTAACA AGAGAAAAGA CTTCTCTAGA AAACCTTATG AGCAAGCTAA ATATTGGGAA GAACAGGGTG CAAAATGTAT TCATATCGTT GATTTGGATG CCGCTAAAAG CGGATATCCA TCAAATGATC AATCAATTAA AAAAATTGCC AAAGAAGTCA ATATTCCTAT CCAAATTGGA GGGGGTATAA GATCTTTAGA AAGGATTGAA CAATTATTTT CATACGGTGT AGATAAGGTA ATAATGGGTA CCTCAGCTAT TGAAAATAAA GAACTAGTTA AAAATCTATC AACCAAATTT CCAAGAAGAA TAATTATTGG AATTGATGCA AAAGATGGGA AAGTAAGCAC TAGAGGATGG ATAGAACAAT CAGATGTATT AGCCACAGAT TTAGTAAAGG AATTTTCCAA GTTTGAAATT GCCAGTTTTA TTGTCACCGA CATCAATACT GATGGAACCT TAGAAGGGAC AAATGAAGTT TTTATAAAAA AGATTCTTGA AATTACTGAT ATTCCTGTTA TCGCTTCAGG AGGAGTAGGT GCAATTTCTG ATTTATTGTC ATTAACCAAA TTTGAACATT TAGGACTATG TGGAGTAATA GTAGGAAAAG CACTTTATGA AAATAAATTT AAGATCAGTG AAGCTAATAA CATATTATCT CCTGAAAGAT TACAAGATAT TCCAATTAAT AAGGACTATT TCGCTTGA
|
Protein sequence | MKLIPAIDLM NGKCVRLFKG DFNKRKDFSR KPYEQAKYWE EQGAKCIHIV DLDAAKSGYP SNDQSIKKIA KEVNIPIQIG GGIRSLERIE QLFSYGVDKV IMGTSAIENK ELVKNLSTKF PRRIIIGIDA KDGKVSTRGW IEQSDVLATD LVKEFSKFEI ASFIVTDINT DGTLEGTNEV FIKKILEITD IPVIASGGVG AISDLLSLTK FEHLGLCGVI VGKALYENKF KISEANNILS PERLQDIPIN KDYFA
|
| |