Gene Pars_2234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2234 
Symbol 
ID5056395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp2001690 
End bp2002919 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content54% 
IMG OID640469787 
Product2-methylcitrate synthase/citrate synthase II 
Protein accessionYP_001154432 
Protein GI145592430 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01800] 2-methylcitrate synthase/citrate synthase II 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.709963 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAAC AAACTGTACA AATTAGGACA TCGGGAAAAG TTTTGCAATC GCCATGCGGC 
CCCATTGTAC ACGGCCTTGA GGATGTACTA ATAAAAAACA CCACAATCAG CGACATAGAC
GGGGAGAAGG GCATCTTGTG GTACAGGGGG TATAGAATAG AGGACTTGGC TAAGTTCTCA
AACTACGAAG AGGTCTCGTT CTTAGTCCTA TACGGCAGGT TGCCCACTAG GTCCGAGTTG
AAAGAGTATG AGAGAAGACT AAAATCTTCG AGGGACTTGC ACCCAGCCAC AGTGGAGGTA
ATAAGGGCGT TGGCGAAGGC GCACCCAATG TTCGCTCTCG AAGCCGCTGT CGCGGCTGAG
GGGGCCTACG ACGAAGATAA CCAGAAGCTG ATCGAGGCGT TGAGAGTGGG GAGGTACAAG
GCAGAGGAGA AGGAGTTGGC CTACAGAATT GCGGAGAAAC TCATAGCCAA GTTGCCGACT
ATTGTGGCAT ATCACTACCG GTTTTCCAAG GGTCTGGAGC TGGTGAGGCC TCGGGACGAC
TTATCACACG CCGCTAACTT CCTCTACATG ATGTTCGGCA AAGAGCCAGA CCCCCTGGCG
GCCAGGGGCA TCGATCTATA CTTAATCCTA CACGCAGACC ACGAGGTACC TGCCAGCACC
TTCACCGCCC ACGTGGTGGC CTCTACGCTA AGCGATCTGT ACTCCTCTGT GGTTGCCGCA
ATCGCGGCGC TTAAGGGCCC GCTCCACGGC GGGGCCAACG AGATGGCTGT GAGGAACTAC
CTAGAAATCG GAGATCCCTC CAAGGCAAAA GAACTGGTGG AAGCGGCTAC TAAGCCAGGC
GGCCCTAAGC TAATGGGTGT GGGACATAGA GTCTACAAGG CGTACGATCC CAGGGCCAGG
ATCTTTAAGG AGTTTTCCAG AGACTACGTG GCCAAGTTCG GAGATCCGAA GAACCTATTC
GCCGTAGCCA GCGCCATAGA GCACGAGGTG CTGAATAACC CGTACTTCCA GCAGAGGAAG
CTGTACCCGA ACGTCGACTT CTGGTCCGGC ATCGCGTTCT ACTACATGGG CGTGCCCTAC
GAGTACTTCA CCCCCATATT CGCAGTATCG AGAGTAGTGG GCTGGGTGGC GCACATCCTC
GAATATTGGG AGAACAATAG GATATTCAGA CCGCGTGCGT GCTACGCAGG TCCACACGAC
CTACAGTACA TACCAATTGA CCAAAGATAA
 
Protein sequence
MSEQTVQIRT SGKVLQSPCG PIVHGLEDVL IKNTTISDID GEKGILWYRG YRIEDLAKFS 
NYEEVSFLVL YGRLPTRSEL KEYERRLKSS RDLHPATVEV IRALAKAHPM FALEAAVAAE
GAYDEDNQKL IEALRVGRYK AEEKELAYRI AEKLIAKLPT IVAYHYRFSK GLELVRPRDD
LSHAANFLYM MFGKEPDPLA ARGIDLYLIL HADHEVPAST FTAHVVASTL SDLYSSVVAA
IAALKGPLHG GANEMAVRNY LEIGDPSKAK ELVEAATKPG GPKLMGVGHR VYKAYDPRAR
IFKEFSRDYV AKFGDPKNLF AVASAIEHEV LNNPYFQQRK LYPNVDFWSG IAFYYMGVPY
EYFTPIFAVS RVVGWVAHIL EYWENNRIFR PRACYAGPHD LQYIPIDQR