Gene Rxyl_1000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_1000 
Symbol 
ID4116218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp1036612 
End bp1038135 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content73% 
IMG OID638035785 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase / IMP cyclohydrolase 
Protein accessionYP_643778 
Protein GI108803841 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGCGGA GGGCTGTGAT CTCTGTTTCC GACAAGAGCG GGGTGGCGGG GTTCGCCCGT 
CGCCTGCGAG ATCTCGGCTT CGAGATAATC TCCACCGGCG GGACGGCGCG GGCGCTGCGG
GAGGCCGGAA TCCCGGTCGT CCCCGTCTCC GAGGTCACCG GGGAGCCGGA GATCCTGGGC
GGCCGGGTGA AGACGCTGCA CCCCAGGATC CACGGCGGCA TCCTGGCCGA CCGCGAGAGG
GAGGAGCACC TCCGCCAGCT GGAGGAGCGC GGCATAGACC CCCTGGACCT GGTGTGCATC
GACCTCTACC CCTTCGAGCG CGCCGTCGCC GGAGGAGCGG GCGAGGAGGA GGCCATCGAG
CAGATAGACA TCGGCGGCCC CGCCATGCTG CGGGCCGCCG CCAAGAACTT CCGCTCGGTG
GTCGTGGTCC CGGGGCCGGA GTTCTACGGG GAGGTGCTCT CGGGGCTGGA GTCCGGCGGG
GTGCCGGAGG AGACCCGAAG GCGTCTCGCG GCCGCCGCCT TCAGGCGCAC GGCGCTCTAC
GATGCCGCCA TCGCGCGCTG GATGGCCGGG GAGGGGGAGC TCCCCGAGGC GCTCGTGCGG
GGCTACAGGC GAAAGATCCC GCTGCGCTAC GGCGAGAACC CCCACCAGAG GGCCGCCTAC
TACGCCGAGG AGGGGGCGCC CGCACACCTG CTCTCCGGGG CCAGGCGCTT GCAGGGAAAG
CAGCTCTCCT TCAACAACCT CTACGACGTG GACGCCGCGC GCTCGCTGCT CGCCACCCTC
GGCGAGGAGC CGGCGGCCGT CATCGTGAAG CACGCCAACC CGTGCGGCGC CGCGGTGGGC
AAAAGCGCGG GCGAGGCCTA CCTGAAGGCG CTCGACTCCG ACAGGATCTC GGCCTTCGGG
GGCATCGTGG CCCTCAACCG CGAGGTGGAC GGGGAGCTGG CGCGCGAGAT CTCGGGCGTC
TTCACCGAGG TCCTCGTCGC CCCGGGCTTC GCCGCGGAGG CGCGGGAGGT CTTCGCGGAG
AGGGAGGCCA TGATCCTTCT GGAGGCCGGG CCGCTGGAGC CCCCGAGCCT CTCCGCCAAG
TACGTGACGG GCGGGATGCT CCTGCAGGAG ACGGATGCGG TGGCCGGGGA GGACGCTTCC
TCCTACAAGA CGGTCGCCGG CGAGCCCCCC TCCGGGGAGG CGCTGCGAAA CCTCCTCCTC
GCCTGGCGCG TCGCCGCCCG GGTCAAGAGC AACGCCATCG TCCTCGTGCG GGACGGCGCC
ACGGTGGGGA TAGGGGCCGG GCAGATGAGC CGGGTGGACG CGGCGCGCAT AGCCGTGGAG
AAGGCCGGCG GGCGGAGCCG CGGCGCGGTC GCCGCGAGCG ACGCCTTCTT CCCGTTCGCC
GACGGGGTGG AGGCGCTGGC CGACGCGGGG GTCTCGGCGG TCATCCAGCC CGGGGGCTCC
AGGCGGGACG CCGAGGTCAT AGCGGCGGCC GAGCGGCGGG GAATGACCAT GGTCTTCACC
GGCAGGAGGC ACTTCCTCCA CTGA
 
Protein sequence
MRRRAVISVS DKSGVAGFAR RLRDLGFEII STGGTARALR EAGIPVVPVS EVTGEPEILG 
GRVKTLHPRI HGGILADRER EEHLRQLEER GIDPLDLVCI DLYPFERAVA GGAGEEEAIE
QIDIGGPAML RAAAKNFRSV VVVPGPEFYG EVLSGLESGG VPEETRRRLA AAAFRRTALY
DAAIARWMAG EGELPEALVR GYRRKIPLRY GENPHQRAAY YAEEGAPAHL LSGARRLQGK
QLSFNNLYDV DAARSLLATL GEEPAAVIVK HANPCGAAVG KSAGEAYLKA LDSDRISAFG
GIVALNREVD GELAREISGV FTEVLVAPGF AAEAREVFAE REAMILLEAG PLEPPSLSAK
YVTGGMLLQE TDAVAGEDAS SYKTVAGEPP SGEALRNLLL AWRVAARVKS NAIVLVRDGA
TVGIGAGQMS RVDAARIAVE KAGGRSRGAV AASDAFFPFA DGVEALADAG VSAVIQPGGS
RRDAEVIAAA ERRGMTMVFT GRRHFLH