Gene Rxyl_0254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_0254 
Symbol 
ID4116085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp259984 
End bp261150 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content74% 
IMG OID638035044 
Productphosphoribosylaminoimidazole carboxylase 
Protein accessionYP_643043 
Protein GI108803106 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCCGGA CCATCCTCCC CGGGAGCACG GTCGGCGTGC TGGGCGGCGG CCAGCTCGGT 
CGCATGCTGG CGCTTGCGGG CGGCCACATG GGCTACCGGT TCGTGGTGCT CGACCCGACG
CCCAACGCCC CCGCCGGGCA GGTCTCGAGC GGCCAGGTCG TGGCCGCCTA CGACGACCGC
GAGGCCGCCG GCAGGCTCGC CGCCTCCTCC GACGTGATCA CCTACGAGTT CGAGAACGTG
GACGCCGGGG TGGCCGGGAT GCTGGAGCGG GAGGCGTACG TCCCCCAGGG GAGCCGGCTG
CTGCACACCA CCCAGCACCG GCTGCGCGAG AAGCGGGCCG TGGAGGAGGC GGGGGTGCGG
GTCGCCCCCT ACGAGCCGGT GCGAGACGGC GAGGACCTGC GGGCCGCGCT GCGGCGCCTC
GGCACCCCCT GCGTGCTCAA GACCGCCACG GGCGGCTACG ACGGCAGGGG CCAGCGCGTC
ATCCGCTCCG AAGACGAGGC CCCGGCGGCC CTCTCGGAGC TCTCCGGGGA GGGGACCGAG
CTGGTGCTGG AGCGCTTTGT CCGCTTCGAG AAGGAGCTCT CGGTCATCGC CGCCCGCACC
CCCGGGGGGG AGGTCCGGAC CTTCCCCCCC GCCGAGAACG TCCACGTGGA CAACATCCTC
CACCTCTCCA TCGTCCCCGC CCGCATCCCG CGGGAGGTGC AGGAGGAGGC CCGGCGGATG
GCGGTGCGCG TGGCCGAGGG GCTCGGCGTG GTGGGGCTCG TCGCCGTGGA GATGTTCTGG
GCCGGCGGCG ACGGGCTCTA CGTCAACGAG CTCGCCCCCC GCCCCCACAA CTCCGGCCAC
TACACCATAG AGGCCTGCGC CACCTCCCAG TTCGAGCAGC ACCTCAGGGC CATATGCAAC
CTGCCGCTCG GGCCGACCGA CCTCCTCACC CCCGCCGTGA TGGTGAACGT GCTGGGCGAG
CATCTGGAGC CGCTCGTCCG CGCGCTCTCG GAGGGGAGGA TCGCCGCCCG CGGCGGGGCG
GTGCCGAAGG TCCACCTCTA CGGCAAGGCC GAGTCGCGCC CCAAGCGGAA GATGGGCCAC
GTGACCCTCC TCGCCCCGGA GACGGGCGCC GCCCTCCGGT GGGTCGAGGA GAGCGGCCTC
TGGAAGGCGC AGGGAGGGGC CGCCTAG
 
Protein sequence
MSRTILPGST VGVLGGGQLG RMLALAGGHM GYRFVVLDPT PNAPAGQVSS GQVVAAYDDR 
EAAGRLAASS DVITYEFENV DAGVAGMLER EAYVPQGSRL LHTTQHRLRE KRAVEEAGVR
VAPYEPVRDG EDLRAALRRL GTPCVLKTAT GGYDGRGQRV IRSEDEAPAA LSELSGEGTE
LVLERFVRFE KELSVIAART PGGEVRTFPP AENVHVDNIL HLSIVPARIP REVQEEARRM
AVRVAEGLGV VGLVAVEMFW AGGDGLYVNE LAPRPHNSGH YTIEACATSQ FEQHLRAICN
LPLGPTDLLT PAVMVNVLGE HLEPLVRALS EGRIAARGGA VPKVHLYGKA ESRPKRKMGH
VTLLAPETGA ALRWVEESGL WKAQGGAA