Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1420 |
Symbol | |
ID | 5054871 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1281092 |
End bp | 1282360 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640468961 |
Product | anthranilate synthase component I |
Protein accession | YP_001153630 |
Protein GI | 145591628 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR01820] anthranilate synthase component I, archaeal clade |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.150343 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGAAGA TCCCCCTCTC CAAGCTACCG GAGCCCAGGG CCTTGGCGAA GTCCTTATAT GCCCGAGGCG AGGACTTCGT GGCGTTGCTG GAGTCGGGTG TGGGATACGC CGAGCGTAGC CGCTTCACCC TAGTGGCTTG GGGGGTGGAG GAGGAGTACG TGTCGTGGGG GGCCGACGTC TACCAAATTG TGGACAGCGC CTACAAGAGA TTGAGGAGGG GCCCCACCCC ATTCGGCGGC GAGGTGGCCA TAGGCGCTGT TGCCTACGAC GCGGTTGCCT ACCTTGAGCC TGTGTTGCTT AGGTATGGCA AGTTGGATAA GTCGTCTCCT GTCGCCTTCT TCGTAAAGCC CAGGGGCTGG GCGCTGTACG ACAAGTTGCT GGGCCGGGCT TACGTCTACG GGGAGTTGCC AAACGGCGGG GCCGCGTCGC TGGAGTCGCC GATGGTGAGG GGGCCAATCG CCGAGACCGA CGCCTCTTCG TTTAAGAGGT GGGTGGCTGA GGCGAAGAGG AGGATAGAGG AGGGGGAGAT CTTCCAGGTG GTGCTGTCGC GCCACGTGGA CTTCGCCGTG TCTGGAGACG TGTTTGCCCT ATACGCCTCG CTGGCGTCTG TCAACCCGTC GCCGTATATG TTCTTCGTCA AGTGGAGGGA CTTCCAACTG CTGGGAACCT CGCCTGAGTT GCTGGTAAAG ATCCAGGGGG ACAGGGCGGA GACGCACCCA ATTGCCGGAA CTAGGCCGAG GGGGGCCGCC GAGGATGAGG ACTTGGCGCT GGAGGAGGAG ATGCTCGCAG ACGAGAAGGA GCGGGCGGAG CACTACATGC TTGTTGACCT GGCTCGCAAC GACTTGGGCA GAGTCTGCCG GCCGGGGACT GTGAAGGTGG ATGAGCTGTT CGCTGTGGAG AAGTACAGCA GGGTGCAACA CATCGTGTCG AGGGTCTCGT GCGTCTTGGA GAAGAAGTAC ACGCCAGCAG ACGCCCTCTT CGCCACGCAC CCCGCCGGCA CTGTGTCGGG GGCGCCGAAG GTGAGGGCCA TGGAGATAAT CGCCGAGCTG GAGGACGAGC CGAGGGGTTA CTACGCCGGC TCGCTGGGGT TCCTCTCCCC GGCACTGTCC GAGTTCGCCA TAGTCATTAG GACAGCCATC GTGAAGGGGG GAGTGCTGAG GATACAGGCG GGGGCGGGGG TGGTATATGA CTCCACGCCG GAGAGGGAGT TTAGGGAAAC CGAGGCTAAG CTTAAAGCCC TTAGAGAGGC GCTGGGGCTA TGGACCTGA
|
Protein sequence | MRKIPLSKLP EPRALAKSLY ARGEDFVALL ESGVGYAERS RFTLVAWGVE EEYVSWGADV YQIVDSAYKR LRRGPTPFGG EVAIGAVAYD AVAYLEPVLL RYGKLDKSSP VAFFVKPRGW ALYDKLLGRA YVYGELPNGG AASLESPMVR GPIAETDASS FKRWVAEAKR RIEEGEIFQV VLSRHVDFAV SGDVFALYAS LASVNPSPYM FFVKWRDFQL LGTSPELLVK IQGDRAETHP IAGTRPRGAA EDEDLALEEE MLADEKERAE HYMLVDLARN DLGRVCRPGT VKVDELFAVE KYSRVQHIVS RVSCVLEKKY TPADALFATH PAGTVSGAPK VRAMEIIAEL EDEPRGYYAG SLGFLSPALS EFAIVIRTAI VKGGVLRIQA GAGVVYDSTP EREFRETEAK LKALREALGL WT
|
| |