Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_09651 |
Symbol | hisA |
ID | 5731296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 857879 |
End bp | 858646 |
Gene Length | 768 bp |
Protein Length | 255 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641285332 |
Product | 1-(5-phosphoribosyl)-5-[(5- phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase |
Protein accession | YP_001550850 |
Protein GI | 159903506 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0106] Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase |
TIGRFAM ID | [TIGR00007] phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.505677 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00168109 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAGCTTA TCCCAGCTAT AGATTTACTA GAAGGCAATT GTGTAAGGCT AGTTCAAGGT AATTACAATA AGGTAACTAA ATTCAACAGC GATCCTGTAA GTCAAGCCCT TAGATGGGAA GACATGGGTG CAAGCAGATT GCATATAGTC GATCTGGATG CTGCAAGGCA AGGTTTTTCA TCTAATGATG ATGTAATCAA ACAAATAGCT AAAAGCCTAT CTATCCCAAT ACAAATAGGA GGAGGGATTA GAACAAGTAA AAGGGCTAAA GAATTATTAG ATTATGGAAT AGATAGAGTG ATTATTGGTA CGGCAGCATT AGAGGATCCA AGGCTTGTAG AGGACCTAGC CTCTGCTTTT CCGAAAAAAA TTGTATTAGG AATAGATGCA AAGGAAGGCA AAGTAGCAAC TAGAGGCTGG ATAGAACAAA GCGATGTGAG AACTGAAGAC CTTATAAAAC AATTTTCCAA CGCGAAAATA GCTGCAATTA TTTCAACAGA CATTTCTACT GATGGAACTT TAGAAGGTCC AAATTTAAAA AGTTTAACCT CTGTTGCGAA AGTCTCAAAT GCACCCGTAA TAGCTTCTGG AGGGATAGGC TCTCTAGCTG ACCTAATTTC ATTGACAACG CTAGAAAAGG CTGGTGTTAC TGGTGTTATA GTAGGTCGAG CGCTTTATGA CAACAAATTT TCTTTAGAAG AAGCTATAAA GGTCTTGTTA AATATTGACC TTCAAGACCA ACCTTTTAAT GCAAAAAACA TAGCCTAA
|
Protein sequence | MELIPAIDLL EGNCVRLVQG NYNKVTKFNS DPVSQALRWE DMGASRLHIV DLDAARQGFS SNDDVIKQIA KSLSIPIQIG GGIRTSKRAK ELLDYGIDRV IIGTAALEDP RLVEDLASAF PKKIVLGIDA KEGKVATRGW IEQSDVRTED LIKQFSNAKI AAIISTDIST DGTLEGPNLK SLTSVAKVSN APVIASGGIG SLADLISLTT LEKAGVTGVI VGRALYDNKF SLEEAIKVLL NIDLQDQPFN AKNIA
|
| |