Gene Rcas_2525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2525 
Symbol 
ID5540007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3257264 
End bp3258391 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content62% 
IMG OID640894656 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_001432623 
Protein GI156742494 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0404264 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGATA GATTAATCGG TGTTCTGGGC GGAGGGCAAT TGGGGCGAAT GCTGGCGCTC 
GCCGGTTATC CGCTCGGTTT CCGCTTCCGC TTCCTCGATC CTGCCGATGA TGCGCCGGTT
CGCTACCTGG CAGAGCAGGT TGTCGCATCA TACGACGACC ATTTAGCCGT GGCACAGTTC
GGCAGCGGGT TGATAGTCGT CACCTATGAG TTTGAGAATG TACCGGTGGC GACGGCGCGC
GCGCTCGAGC AGCATATACC GGTGTTTCCA CCGCCGCAGG CGCTTGAGGT TGCACAGGAT
CGTCTCGCGG AAAAGCGCTT CTTCACACAA CTGAACATTC CAACCGCGCC GTTCGCGCCG
GTGGACGACC GCGCGTCGCT TGATGCAGCA ATCGAGCGTA TTGGGCTGCC TGCTCTTCTG
AAGACGCGAC GCCTGGGGTA TGATGGCAAG GGGCAGGCAT TGATCGGGCA GCGTGCAGAC
ATCGAAGACG CCTGGCGCGC ACTTGGCGGG CAGCCGCTTA TCCTCGAAGG GTTTGTTTCG
TTTGTGCGTG AACTTTCCGT CCTGGCAGTG CGCGGGCAGG ATGGCGCCGT CGCGTGCTAC
CCGCCGGTCG AGAATCTGCA CCGCAATGGC ATTCTGTATC GCTCGATCGC CCCTGCGCCC
GGGCTTGCCA CCGAGGTGCA ACTGCTGGCG GAGACCTATG CGCGCCGGGT GCTCGAAGCG
CTCGACTATG TCGGCGTTCT GGCAATCGAG ATGTTTGAAG TGGAACCTGA CCGTACCGGC
GGCGCTCGCC TGCTGGCGAA CGAAATGGCG CCGCGCGTCC ACAATTCTGG TCATTGGACG
ATTGATGGCG CTGTGACGAG TCAGTTCGAG AACCACCTGC GCGCCATCGC CGGTCTGCCG
CTCGGCGACG CCTCAGCGCG CGGTTGCGCC GCGATGGTCA ACCTTATCGG CGGATTGCCC
GATGTGACGA TGCTGCTGGC GCTGCCTGAT ACTCATCTGC ACCTCTATGA CAAAGCGCCG
CGACCGGGGC GCAAACTGGG ACATGTAACC GTCTGCGCCG TGGATGCAGA GCGCCTGGCA
GAGCGCCTGG CAGTCGTCGA ACGGTATGTT CTGGCGAGCA CGAAGTGA
 
Protein sequence
MNDRLIGVLG GGQLGRMLAL AGYPLGFRFR FLDPADDAPV RYLAEQVVAS YDDHLAVAQF 
GSGLIVVTYE FENVPVATAR ALEQHIPVFP PPQALEVAQD RLAEKRFFTQ LNIPTAPFAP
VDDRASLDAA IERIGLPALL KTRRLGYDGK GQALIGQRAD IEDAWRALGG QPLILEGFVS
FVRELSVLAV RGQDGAVACY PPVENLHRNG ILYRSIAPAP GLATEVQLLA ETYARRVLEA
LDYVGVLAIE MFEVEPDRTG GARLLANEMA PRVHNSGHWT IDGAVTSQFE NHLRAIAGLP
LGDASARGCA AMVNLIGGLP DVTMLLALPD THLHLYDKAP RPGRKLGHVT VCAVDAERLA
ERLAVVERYV LASTK