Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2234 |
Symbol | |
ID | 5056395 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 2001690 |
End bp | 2002919 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640469787 |
Product | 2-methylcitrate synthase/citrate synthase II |
Protein accession | YP_001154432 |
Protein GI | 145592430 |
COG category | [C] Energy production and conversion |
COG ID | [COG0372] Citrate synthase |
TIGRFAM ID | [TIGR01800] 2-methylcitrate synthase/citrate synthase II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.709963 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAAC AAACTGTACA AATTAGGACA TCGGGAAAAG TTTTGCAATC GCCATGCGGC CCCATTGTAC ACGGCCTTGA GGATGTACTA ATAAAAAACA CCACAATCAG CGACATAGAC GGGGAGAAGG GCATCTTGTG GTACAGGGGG TATAGAATAG AGGACTTGGC TAAGTTCTCA AACTACGAAG AGGTCTCGTT CTTAGTCCTA TACGGCAGGT TGCCCACTAG GTCCGAGTTG AAAGAGTATG AGAGAAGACT AAAATCTTCG AGGGACTTGC ACCCAGCCAC AGTGGAGGTA ATAAGGGCGT TGGCGAAGGC GCACCCAATG TTCGCTCTCG AAGCCGCTGT CGCGGCTGAG GGGGCCTACG ACGAAGATAA CCAGAAGCTG ATCGAGGCGT TGAGAGTGGG GAGGTACAAG GCAGAGGAGA AGGAGTTGGC CTACAGAATT GCGGAGAAAC TCATAGCCAA GTTGCCGACT ATTGTGGCAT ATCACTACCG GTTTTCCAAG GGTCTGGAGC TGGTGAGGCC TCGGGACGAC TTATCACACG CCGCTAACTT CCTCTACATG ATGTTCGGCA AAGAGCCAGA CCCCCTGGCG GCCAGGGGCA TCGATCTATA CTTAATCCTA CACGCAGACC ACGAGGTACC TGCCAGCACC TTCACCGCCC ACGTGGTGGC CTCTACGCTA AGCGATCTGT ACTCCTCTGT GGTTGCCGCA ATCGCGGCGC TTAAGGGCCC GCTCCACGGC GGGGCCAACG AGATGGCTGT GAGGAACTAC CTAGAAATCG GAGATCCCTC CAAGGCAAAA GAACTGGTGG AAGCGGCTAC TAAGCCAGGC GGCCCTAAGC TAATGGGTGT GGGACATAGA GTCTACAAGG CGTACGATCC CAGGGCCAGG ATCTTTAAGG AGTTTTCCAG AGACTACGTG GCCAAGTTCG GAGATCCGAA GAACCTATTC GCCGTAGCCA GCGCCATAGA GCACGAGGTG CTGAATAACC CGTACTTCCA GCAGAGGAAG CTGTACCCGA ACGTCGACTT CTGGTCCGGC ATCGCGTTCT ACTACATGGG CGTGCCCTAC GAGTACTTCA CCCCCATATT CGCAGTATCG AGAGTAGTGG GCTGGGTGGC GCACATCCTC GAATATTGGG AGAACAATAG GATATTCAGA CCGCGTGCGT GCTACGCAGG TCCACACGAC CTACAGTACA TACCAATTGA CCAAAGATAA
|
Protein sequence | MSEQTVQIRT SGKVLQSPCG PIVHGLEDVL IKNTTISDID GEKGILWYRG YRIEDLAKFS NYEEVSFLVL YGRLPTRSEL KEYERRLKSS RDLHPATVEV IRALAKAHPM FALEAAVAAE GAYDEDNQKL IEALRVGRYK AEEKELAYRI AEKLIAKLPT IVAYHYRFSK GLELVRPRDD LSHAANFLYM MFGKEPDPLA ARGIDLYLIL HADHEVPAST FTAHVVASTL SDLYSSVVAA IAALKGPLHG GANEMAVRNY LEIGDPSKAK ELVEAATKPG GPKLMGVGHR VYKAYDPRAR IFKEFSRDYV AKFGDPKNLF AVASAIEHEV LNNPYFQQRK LYPNVDFWSG IAFYYMGVPY EYFTPIFAVS RVVGWVAHIL EYWENNRIFR PRACYAGPHD LQYIPIDQR
|
| |