Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_08751 |
Symbol | |
ID | 4779878 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 812773 |
End bp | 813519 |
Gene Length | 747 bp |
Protein Length | 248 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640084150 |
Product | imidazole glycerol-phosphate synthase |
Protein accession | YP_001014698 |
Protein GI | 124025582 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0107] Imidazoleglycerol-phosphate synthase |
TIGRFAM ID | [TIGR00735] imidazoleglycerol phosphate synthase, cyclase subunit |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.236803 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.00000146373 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGAATTA TAGGAAGATT GGATGTAAAG AATAATCATG TTATAAAAGG GATTCACTTG GAAGGTCTTA GGAAAGTCGG AGATCCACAA GAGTTAGCAG TAGAATACTA TAAACAAGGA ATTGAAGAAA TAGTTTTCAT GGATGCAGTC GCAAGTCTAT ATAATAGAAA TAATTTATTT CACATAATAC AAAAGGCGTG TGAGAAAGTT TTTGTCCCAA TTGCCTTAGG TGGCGGATTA AGATCCTTGG ATGATGTTTC TAAAGCTTTG AACGCAGGTG CAGATAAGGT AGTTATAAAT ACGGCATTAG TAAATAGAAT TGATTTAGCT AAAGAAATAG CACAAAAGTA TGGTTCGCAA TGTTTAGTGG GTTCTATCGA AGCTAAAAGA GAAAATGATT CTTGGAGAGT TTACGTTGAT AATGGTAGAG AGCCAACTAA ACACAATGTA GTTGACTGGG CAATCAAACT TAAAGAAGCT GGTGTAGGAG AAATACTTCT AACATCTATT GATCAAGAAG GAACTTCAAA GGGTTTTGAT ATTGAACTTA TAAAACAAAT AAATGAATCA ATAAGATGTC CAATAATTGC AAGTGGAGGT TATGGAAATA AAAAACATCT TGCTGATTTA TTAAAAATTG TTGAACCTTC TGCTATTGCT ATCGCAGGCT CTTTACATTA CAAGAAAGAT ACCGTAAGTA GTATAAAACA AGAAATTAAT TCTTTGTTTA TACAAGAAAA AAAATGA
|
Protein sequence | MRIIGRLDVK NNHVIKGIHL EGLRKVGDPQ ELAVEYYKQG IEEIVFMDAV ASLYNRNNLF HIIQKACEKV FVPIALGGGL RSLDDVSKAL NAGADKVVIN TALVNRIDLA KEIAQKYGSQ CLVGSIEAKR ENDSWRVYVD NGREPTKHNV VDWAIKLKEA GVGEILLTSI DQEGTSKGFD IELIKQINES IRCPIIASGG YGNKKHLADL LKIVEPSAIA IAGSLHYKKD TVSSIKQEIN SLFIQEKK
|
| |