Gene Pars_2175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2175 
Symbol 
ID5054791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1944637 
End bp1945725 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content61% 
IMG OID640469727 
Productphosphoribosylaminoimidazole carboxylase 
Protein accessionYP_001154373 
Protein GI145592371 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.470665 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCTCT ACGTCCTCGG CGGGGGTCAG CTGGCGCTTA TGATGTGCTG GGAGTCGCAG 
AGGGTCCCGG TACACTTCTC GGTGTACGAC CCCGACCCCT CCGCCCCGGC CTACAGATGC
GCGAAGAGGC CCTCAGACCC GCTTGAGGAG GTGGACAAGG CCGATGCCGT CACCTTCGAA
TTTGAGAACG TGGACATCTC GCTGGCGGAG CGGGCTGAGA AGCTGGGCAA GCTTAGGCCC
CCGCTGGCCT ACCTGAAAGT TAAGAAGAGC AGGATCGAGG AGAGAGCCCT CATGGACTCT
ATCGGCGTGC CCACTGTGCC GTGGCGCCGG GCGGCTAATT GGGAGGAGGC TATTAAAATA
GCGGAGTCCA TGGGTAGGGC ATATGTAAAG GTGCCCACCG GCGGCTACGA CGGGAAGGGG
CAGTACATAT ACCCCCACGA GGCCGCGTTG ATAAGGGGGC TGGCTGGGGA GCTCCTGGTA
GAGGAGTACG TAGACATCAG GAGGGAGTTC TCCATTGTGG CCGCGAGGGC GGAGAACGGC
GATGTCTACT TCTACCCACC TGCCGAGAAC TTCTACGTAC ACGGCATCTT GGTCTGGAAC
TACGCTCCGA CAAAAGTGCC GGAAGAGGCG TATGAATACG TCAACCGCAT ATTGGAGTGG
GGGAAGTACG TTGGGGTCAT CGCAGTGGAG TTCTTCGAGG CTAGGGACGG CCGCGTTTTG
GTCAACGAGA TAGCCCCCAG GGTCCACAAC ACGGGGCACT GGACCCTGGA GACAGACGCC
AGCCAGTTCG AGAACCACGT CCGCGCCGTC TTAGGCCTCC CGCTGAGGAG ACCCCGCGCT
ATGGCCCCTA CGGCGATGGT AAACATCTTG GGCGTCGGCC TGGGAAAGCT ACCGCTGGCG
GAGCTGGAGC GGCGGGGGCG GGTGTACTGG TACTACAAGG CTGAGGCTAG GCCGAGGCGG
AAAATGGGCC ACCTCAACAT AACGGCTGGG TCAGTGGAGG AGGCGATCAC AAAGGCGAGG
GAGGCGCTGA GGCTGATATA TGGAGCGGAT TTCCCAAGGC TTGTGATGAG GTCGAGGCCT
AGCCCTTGA
 
Protein sequence
MRLYVLGGGQ LALMMCWESQ RVPVHFSVYD PDPSAPAYRC AKRPSDPLEE VDKADAVTFE 
FENVDISLAE RAEKLGKLRP PLAYLKVKKS RIEERALMDS IGVPTVPWRR AANWEEAIKI
AESMGRAYVK VPTGGYDGKG QYIYPHEAAL IRGLAGELLV EEYVDIRREF SIVAARAENG
DVYFYPPAEN FYVHGILVWN YAPTKVPEEA YEYVNRILEW GKYVGVIAVE FFEARDGRVL
VNEIAPRVHN TGHWTLETDA SQFENHVRAV LGLPLRRPRA MAPTAMVNIL GVGLGKLPLA
ELERRGRVYW YYKAEARPRR KMGHLNITAG SVEEAITKAR EALRLIYGAD FPRLVMRSRP
SP