Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_17461 |
Symbol | hisA |
ID | 4778115 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1527066 |
End bp | 1527833 |
Gene Length | 768 bp |
Protein Length | 255 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640087253 |
Product | 1-(5-phosphoribosyl)-5-[(5- phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase |
Protein accession | YP_001017753 |
Protein GI | 124023446 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0106] Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase |
TIGRFAM ID | [TIGR00007] phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGATCA TCCCAGCCAT CGACCTACTG GACAGCGCTT GCGTGCGACT TCACCAGGGC GACTACGCGA AGGTGACTCG CTTCAGCGAG GACCCCGTTG CTCAAGCCCT GAGCTGGCAG AAGCAGGGGG CAACTCGCCT ACACCTGGTG GATCTCGATG GCGCCAAATC TGGTGAGCCG GTCAACGATT CCTGTGTGCG AGCTATTACC TCTGCCCTCA ATATTCCTGT CCAGCTCGGT GGTGGTGTGC GCACTCTCGA ACGAGCTGAA GAACTTCTCG CCTACGGCCT TGAACAAGTC ATCCTTGGCA CCGTGGCAAT AGAGCAACCC CAGCTAGTGA AACAGCTTGC CCAAAGGAAT CCCGGTCGGA TTATCGTCGG CATCGATGCC AAAAACGGCA AGGTGGCAAC GCGAGGATGG ATTAGCCAAA GCGAGGTAAA TGCAACAGAT TTAGCCGCCG ATTTCAATGC TGCCGGCATC GCCGCAATCA TCAGCACAGA CATTGCAACC GATGGAACCC TTGAAGGGCC GAACCTAGAG TCATTACGTG CAATGGCCAA CGCCAGCAGC GTTCCATTAA TCGCCTCCGG AGGCGTGGGT TGCATGGCCG ATCTACTTAG TCTGCTGGCT CTTGAACCCT ATGGAGTGAG TGGGGTGATT GTGGGAAGAG CTCTCTACGA CGGCAAGGTG GATCTAAAAG AAGCAATCCG AGCCATTGGC GATGGCCGAC TTCAGGACCC ACCAAGTAGC AAACCACTGA TGGCATGA
|
Protein sequence | MEIIPAIDLL DSACVRLHQG DYAKVTRFSE DPVAQALSWQ KQGATRLHLV DLDGAKSGEP VNDSCVRAIT SALNIPVQLG GGVRTLERAE ELLAYGLEQV ILGTVAIEQP QLVKQLAQRN PGRIIVGIDA KNGKVATRGW ISQSEVNATD LAADFNAAGI AAIISTDIAT DGTLEGPNLE SLRAMANASS VPLIASGGVG CMADLLSLLA LEPYGVSGVI VGRALYDGKV DLKEAIRAIG DGRLQDPPSS KPLMA
|
| |