Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2097 |
Symbol | |
ID | 5054218 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1872651 |
End bp | 1874141 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640469647 |
Product | putative alpha-isopropylmalate/homocitrate synthase family transferase |
Protein accession | YP_001154295 |
Protein GI | 145592293 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.588766 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGACG AGCTCGGCGT AGACTATATA GAGGGCGGAT GGCCCTACTC CAACCCCAAA GACCTGGACT TCTTCAAGGC GATGAGGGAA TACCCACTCG CAAAGGCCAA GCTAGCCGCT TTTGGAAGCA CGAGGAGAAA GGGGGTGAAG CCTGAGAAAG ACGAAAACCT AAACGCGATA GTAAAGGCGG ATGTCCCCGT TGCGGTTATC TTCGGCAAGA GCTGGACTCT CCACGTGGAG AAGGTGCTGG AAGCCACCTG GGAGGAGAAC TTGGCTATGA TAGCGGAGAG CGTGGAGTAC CTAAAATCCC ACGGCATGGA GGTGATCTAC GATGCCGAGC ACTTTTTCCA GGGGTATCAG GAGGACCCGG AGCGGGCGCT GGCCTCTATA GAGGCTGCCT GGAGGGCGGG GGCTAGGGTT GTGGTGCTGG CCGACACCAA CGGCGGGACT CCTACGCACG AGGTGTATAG AATAACAGCA GAGGTGAAGA GGAGGTTCCC CGCGATGCCG CTGGGAGCCC ACATGCACAA CGACATCGGT TGCGCCGTGG CTAACACCCT AATGGCAGTG GCCGCCGGGG CTAGGCACGT CCAGGGAACA ATAAACGGAG TGGGTGAGCG GACGGGCAAT GCGGACCTGA CCGCGGTTTT GCCGACGCTG GAGCTGAAGA TGGGCTTCAA GGTCCTGGGC GGCTCCCCGC CCCGGGTTAA GTTCGCCAAG CTGAGGGAGG TGTCACGCTT CGTCTACGAG GCCTTGGGGA TGAGCCCAAA CCCATATCAG CCCTACATCG GCGACTACGC CTTTGCCCAC AAGGGAGGGG TACACGCCGC GGCTGTGATG AAGGTGCCCA GGGCATACGA GCACATAGAC CCCGAGCTGG TGGGCAACAG GAGGGTCTTC GTCGTGTCGG AGATGGCCGG CGCCGCCAGC GTGGTGCTGA AGGCGGCGGA GGAGCTGGGG ATATCGCTAG ACAAGCGCCA GGAGGCTGTG AGGGCGGCGC TGGAGGAGAT AAAGGCGCTG GAGAGGCAGG GCTACTCCTT TGACTCGGCC CCGGCCTCCG CCATGCTGAT ACTGCTTAGG CACATGGGGC TCTACCAGGA GAGGTTTAGG CTAGTGGAGT GGCGCGTGGT CACCGGCCCC ACCAACACGT CCTACGCCGT GGTGAAGGTA TGGGTAAGCG GCGAGGTAAA GCTGGAGGCC GGCGAGGGCG TCGGCCCCGT ACACGCCGTC GACGTTGCGC TGAGGCGCGC GCTGGTGTCA GCCTTCCCGG AGCTGGCGGA GGTTAGGCTG AGGGACTACA AGGTGGTGCT CCCCACTGCG GTAAGGAGCA CGGAGAGCGT GGTGAGGGTC ACCGTTGAGT TTACCGACGG CGGGAGGATA TGGCGCACAG TCGGCGTATC CAGCAACGTC GTCGAGGCGT CGATCAAGGC GCTGGTCGAC GGCTACGACT TCGCCCTACA GCAGAGGCAG TTGCAAAACC GCAAGGCCTA G
|
Protein sequence | MLDELGVDYI EGGWPYSNPK DLDFFKAMRE YPLAKAKLAA FGSTRRKGVK PEKDENLNAI VKADVPVAVI FGKSWTLHVE KVLEATWEEN LAMIAESVEY LKSHGMEVIY DAEHFFQGYQ EDPERALASI EAAWRAGARV VVLADTNGGT PTHEVYRITA EVKRRFPAMP LGAHMHNDIG CAVANTLMAV AAGARHVQGT INGVGERTGN ADLTAVLPTL ELKMGFKVLG GSPPRVKFAK LREVSRFVYE ALGMSPNPYQ PYIGDYAFAH KGGVHAAAVM KVPRAYEHID PELVGNRRVF VVSEMAGAAS VVLKAAEELG ISLDKRQEAV RAALEEIKAL ERQGYSFDSA PASAMLILLR HMGLYQERFR LVEWRVVTGP TNTSYAVVKV WVSGEVKLEA GEGVGPVHAV DVALRRALVS AFPELAEVRL RDYKVVLPTA VRSTESVVRV TVEFTDGGRI WRTVGVSSNV VEASIKALVD GYDFALQQRQ LQNRKA
|
| |