Gene Sare_0872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0872 
Symbol 
ID5704537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp976789 
End bp977970 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content71% 
IMG OID641270391 
Productphosphoribosylaminoimidazole carboxylase, ATPase subunit 
Protein accessionYP_001535781 
Protein GI159036528 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.803514 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0297907 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTCCC ACACCGGTCT GCCCCTTGTC GGCATGGTGG GCGGCGGTCA ACTGGCCCGG 
ATGACCCATC AGGCCGCGAT CGCCCTCGGC CAGTCGCTGC GGGTGCTCGC GCTCGCTCCC
GACGACAGTG CTGCCCTGGT GGCCGCCGAC GTGCAGTACG GCGACCACAC CGACCTGGCG
GCACTGCGCA CCTTTGCCAA GGGCTGTGAC GTGGTCACCT TCGACCATGA GCACGTTCCC
ACCGAGCACA TCGACGCCCT CGCCGACGAA GGCGTCAAGC TGTTCCCGCC GGCCGAGGCG
CTGGTGCACG CACAGGACAA GCAGGTCATG CGGGAACGTC TCGCCGGGTT GGGCATGCCG
AACCCGGCCT GGCGGCCGGT CGACACTCCG GCTGACGTCG AGTCCTTCGG TGACGCGGTG
GGCTGGCCGG TGGTGCTCAA GGCGGCCCGG GGTGGCTACG ACGGCCGGGG CGTGTGGCTG
GTGGACGACG CCGCCGGGGC GGTTGAGCGA ACGGCCACGC TGCTGGCCGC AGGGACGCGC
CTCATCGTCG AGGAGCGGGT GGCGCTGCGC CGGGAACTGG CCGTGCAGGT GGCCCGTTCA
CCGTTCGGGC AGGTCGCCGT GTATCCGGTG GTCGAGACCG TGCAACGGGA CGGCGTCTGC
GTCGAGGTCC TGGCCCCCGC ACCAGACCTG CCGGAGGAGT TGGCGGTCGG TGCGCAACAG
CTCGCTATCG ATCTGGCCAC CGCGCTCGGC GTGGTGGGGC TGCTCGCCGT CGAGTTGTTC
GAGGTGGCCG ACCCGGCCGA GGTGACGGGC AGTCGGCTCG TGGTCAACGA GTTGGCGATG
CGTCCGCACA ACTCCGGGCA CTGGACGATC GAGGGCGCCC GGACGTCGCA GTTCGAGCAG
CACCTACGGG CGGTGCTTGA CTATCCGATG GGGGACACCT CCCTGGCCGC GCCGATCGTG
GTGATGGCGA ACGTGCTGGG CGGCGAGCCG GGAGGTATGT CCTTCGACGA GCGCCTGCAC
CACCTGTTCG CTGCCGAGCC GGGCGCGCAG GTGCACCTGT ACGGCAAGCA GGTGCGCCCA
GGTCGCAAGA TCGGCCATGT CACGGTGCTC GGCGACGACC TGGACGAGGT ACGTACCCGG
GCGGCGCGCG CGGCCCGTTG GCTGCGGGAG GGGCGCGGAT GA
 
Protein sequence
MDSHTGLPLV GMVGGGQLAR MTHQAAIALG QSLRVLALAP DDSAALVAAD VQYGDHTDLA 
ALRTFAKGCD VVTFDHEHVP TEHIDALADE GVKLFPPAEA LVHAQDKQVM RERLAGLGMP
NPAWRPVDTP ADVESFGDAV GWPVVLKAAR GGYDGRGVWL VDDAAGAVER TATLLAAGTR
LIVEERVALR RELAVQVARS PFGQVAVYPV VETVQRDGVC VEVLAPAPDL PEELAVGAQQ
LAIDLATALG VVGLLAVELF EVADPAEVTG SRLVVNELAM RPHNSGHWTI EGARTSQFEQ
HLRAVLDYPM GDTSLAAPIV VMANVLGGEP GGMSFDERLH HLFAAEPGAQ VHLYGKQVRP
GRKIGHVTVL GDDLDEVRTR AARAARWLRE GRG