Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pisl_1763 |
Symbol | |
ID | 4617869 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum islandicum DSM 4184 |
Kingdom | Archaea |
Replicon accession | NC_008701 |
Strand | - |
Start bp | 1601461 |
End bp | 1602558 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639784848 |
Product | chorismate synthase |
Protein accession | YP_931256 |
Protein GI | 119873249 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 86 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACCT TTGGAAGAGA GTTCCGGATA ACCACCTTCG GCGAATCCCA CGGCAAGGCC ATAGGCGTGG TGATAGACGG CGTGCCGGCT GGGCTGGAGC TGACGGAGGA AGACATCAAG AGGGAGCTAG AGAGGAGGAT GTTCTGTCAC ATACCAGTCC TCAACCCGAG GTGCGAGCCG GAGGAGGTGG AGATACTATC CGGCGTGAAG GAGGGCTACA CCCAGGGCAC CCCCATAGCC GTCGTGATAT GGAACAGACG CGTCATCTCC AGCTACTACG AGGAGCTCTG GATGAAGCCC AGGCCGGGCC ACGCCGACTT CGCCTACTAC CTCAAATACG GCAGATACTA CGACCACAGG GGAGGAGGCA GAGCCTCCGG TAGAACAACT GCGGCGGTGG TAGCGGCGGG GGCAGTGGCC AAGAAGATAC TCGCCCTAGC CGGCGCCGAG GTAGCCGGCC ACATAGTCGA GCTAGGCGGC GTCGAGATAA ACGCCAGCTA CACCTACGAA GACGTCAAAA AAAGCTGGGA AAGGCCCCTC CCCGTGGTAG ACCAACAAGC CCTAGACAAG ATGTTGGAAA AAATCCAAGA GGCGGCGGCA AGAGGAGACA GCATAGGCGG CGGGGTGGAG GTCTGGGCCG TCGGGGTGCC GCCCGGCCTG GGAGAACCCC ACTTCGGCAA GATAAAAGCC GACATAGCCG CCGCCGCCTT CTCCATACCA GGCGCCATAG CGCTCGACTG GGGCATGGGC AGAGCACTGG CAAAGATGTG GGGAAGCGAG GCCAACGACC CGATAACAGT CGCCAACGGC AGACCAACCC TCGCCACAAA CAAAATCGGC GGCGTCCTCG GCGGAATAAC TGTGGGAACC CCCATATACT TCAGAGTCTG GTTCAAGCCC ACCCCCTCCG TCAGAAAGCC ACAGCAGACC GTAGATCTAG CCAAGATGGA GCCGACGACG ATAGAGTTCA AGGGGAGATA CGACGTGTCC ATAGTCCCCA AAGCCCTCGT AGCGCTAGAG GCCATCACGG CGGTAACACT CGCCGACCAC CTACTCAGGG CAGGGCTCAT AAGAAGAGAC AAGCCACTAG AGAAATAA
|
Protein sequence | MNTFGREFRI TTFGESHGKA IGVVIDGVPA GLELTEEDIK RELERRMFCH IPVLNPRCEP EEVEILSGVK EGYTQGTPIA VVIWNRRVIS SYYEELWMKP RPGHADFAYY LKYGRYYDHR GGGRASGRTT AAVVAAGAVA KKILALAGAE VAGHIVELGG VEINASYTYE DVKKSWERPL PVVDQQALDK MLEKIQEAAA RGDSIGGGVE VWAVGVPPGL GEPHFGKIKA DIAAAAFSIP GAIALDWGMG RALAKMWGSE ANDPITVANG RPTLATNKIG GVLGGITVGT PIYFRVWFKP TPSVRKPQQT VDLAKMEPTT IEFKGRYDVS IVPKALVALE AITAVTLADH LLRAGLIRRD KPLEK
|
| |