Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_08361 |
Symbol | hisA |
ID | 4780472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 768731 |
End bp | 769501 |
Gene Length | 771 bp |
Protein Length | 256 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640084111 |
Product | 1-(5-phosphoribosyl)-5-[(5- phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase |
Protein accession | YP_001014659 |
Protein GI | 124025543 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0106] Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase |
TIGRFAM ID | [TIGR00007] phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.103566 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0927273 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAATCA TACCTGCAAT AGATCTACTG AATGGTAAAT GTGTTCGACT AAATCAAGGA AATTATAATG AAGTTACTAA GTTCAATAGT GATCCTGTAA AACAAGCACA AATTTGGGAA AGCAAGGGAG CAAAACGACT ACATCTTGTA GATCTTGATG GTGCTAAGAC AGGTGAGCCC ATAAATGATC TAACTATAAA AGAGATAAAA AAATCTATTA CAATACCTAT TCAACTTGGC GGTGGAATTA GGAGTATTGA TCGTGCGAAA GAATTATTCG ACATTGGAAT AGACAGAATT ATTTTAGGAA CAATTGCAAT AGAGAAGCCC GAATTAGTTA AAGACCTATC TAAAGAATAT CCAAAAAGAG TTGCAGTAGG AATTGATGCC AAAGAGGGAA TGGTAGCCAC TCGAGGTTGG TTAAAACAAA GCAAAATATC TTCTCTAGAC TTAGCAAAAC AACTTAACGA TCTTGACTTA GCGGCAATCA TATCAACTGA CATTGCTACC GATGGCACTC TAAAAGGACC TAATGTTCAA GCCTTGAGAG AAATAGCTGA GATAAGTATT AATCCAGTAA TTGCCTCAGG GGGGATAGGT TCAATAGCTG ATTTAATTTC ACTAGCAGAT TTCGCGGATG AAGGTATTGA AGGAATAATC GTAGGCAGAG CCCTATATGA CGGCTCAATA GATTTAAAGG AAGCGATTTT AACTCTAAAA AATCTTCTTC TGCAAGATGC TTTCAATGAG AAAGATAAAT TTCTTGTCTA A
|
Protein sequence | MEIIPAIDLL NGKCVRLNQG NYNEVTKFNS DPVKQAQIWE SKGAKRLHLV DLDGAKTGEP INDLTIKEIK KSITIPIQLG GGIRSIDRAK ELFDIGIDRI ILGTIAIEKP ELVKDLSKEY PKRVAVGIDA KEGMVATRGW LKQSKISSLD LAKQLNDLDL AAIISTDIAT DGTLKGPNVQ ALREIAEISI NPVIASGGIG SIADLISLAD FADEGIEGII VGRALYDGSI DLKEAILTLK NLLLQDAFNE KDKFLV
|
| |