Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pisl_1914 |
Symbol | |
ID | 4617277 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum islandicum DSM 4184 |
Kingdom | Archaea |
Replicon accession | NC_008701 |
Strand | - |
Start bp | 1728850 |
End bp | 1730118 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639785005 |
Product | anthranilate synthase |
Protein accession | YP_931404 |
Protein GI | 119873397 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0888242 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATCC CGCTGTCTAA GCTCCCACCG CCGAGAGAGC TAGCCCACGG GCTGTATCAG TCGGGGGAGG AGTTCGTGGC TCTTCTAGAG TCGGGCCAGG GCTTCGCAGA GAGGGCGAGG TTCACCCTCG TGGCGTGGGG GGTGGAGAGG GCGTACGTAT CCTCTGGGCC CGATCTACAA CAAGTGCTCT ACTTAGCACA AAGGGAGTTG AAAGCGGATG GAGGGCCCTT CGGCGGCGAT GTGTTAATCG GGGCTCTGAC CTACGAGGCA TCGTACTACA TGGAGCCTCT GTTGCTTAGG TATAACAAGG TAGACCGGTC TATCCCGGCG GCGTTTCTGG TTAAGCCCAG GGGCTACATC CTCTACGACA AGATGCTGGG GAGGGGCTAC CTGAGGGGGG AAATGCCGAA GGTTTCTGTG GAACGGAGAG AGACCAAGGT GAGGGGGCCG GTGGCCATGA CCGACCCGAA CCGCTTCAAG AGCTGGGTGG CAGAGGGGAG GGAGAGGATC GCAGCTGGGG AGATCCTCCA AGTGGTGCTC TCCAGGTGGG TAGACTACAG AGCGGAGGGG GACCTCTTCC CTCTGTACAA GGCGCTGGCA GAGGAGAACC CCTCGCCGTA TATGTACTTC GTTAAATACG GCGATATCCA CTTGATTGGG ACGTCGCCTG AGCTGTTGGT AAAGGTGCAG AGCGGCCGCG TGGAGACCCA CCCAATCGCC GGGACTAGGC CAAGGGGCGC CACCGAGGAG GAGGATCTAG CGCTGGAGGA AGATATGCTC AGCGACGAGA AGGAGCTAGC TGAACACATC ATGCTCGTGG ATCTGGCTAG GAACGACATC GGGAGGGTGT GCCAGCTGGG GTCTGTCAAG GTGGAAGAGC TGTTCGCCGT GGAGAAATAC AGCAGAGTGC AACACATAGT GTCTAGGGTC ATGGGCGTCA TGGATAGAAG GTTCACCCCT GTCGACGCCC TCTTGGCCAC CCACCCGGCG GGCACCGTGT CCGGCGCCCC CAAGGTAAGG GCTATGGAGA TAATCGCCGA GCTTGAGGAC GAGCCTCGGA GGTTCTACGC AGGGGCCGTG GGCTTCATCT CGCCGTCTCT CCTCGAGTTT GCCATAGTCA TAAGGACTAT AGTGGCCATG GGCGACTCCC TCCGTATACA AGCCGGGGCG GGGGTTGTGT ATGACTCCAC GCCCGAGCGT GAGTTTAGAG AGACCGAGTC TAAGCTTGCA GCGCTCAGAG CCGTCGTGGA GGGTGGGCCA TGGACTTGA
|
Protein sequence | MKIPLSKLPP PRELAHGLYQ SGEEFVALLE SGQGFAERAR FTLVAWGVER AYVSSGPDLQ QVLYLAQREL KADGGPFGGD VLIGALTYEA SYYMEPLLLR YNKVDRSIPA AFLVKPRGYI LYDKMLGRGY LRGEMPKVSV ERRETKVRGP VAMTDPNRFK SWVAEGRERI AAGEILQVVL SRWVDYRAEG DLFPLYKALA EENPSPYMYF VKYGDIHLIG TSPELLVKVQ SGRVETHPIA GTRPRGATEE EDLALEEDML SDEKELAEHI MLVDLARNDI GRVCQLGSVK VEELFAVEKY SRVQHIVSRV MGVMDRRFTP VDALLATHPA GTVSGAPKVR AMEIIAELED EPRRFYAGAV GFISPSLLEF AIVIRTIVAM GDSLRIQAGA GVVYDSTPER EFRETESKLA ALRAVVEGGP WT
|
| |