Gene Rxyl_0206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_0206 
Symbol 
ID4117833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp206674 
End bp208074 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content71% 
IMG OID638034997 
Producthypothetical protein 
Protein accessionYP_642996 
Protein GI108803059 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.166385 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAGC GAACCCTGAT CCGCGGCGGC CACGTCATCT CCATGGACCC CCAGATCGGG 
GACATCCCCG GCGGGGACGT GCTGATCGAG GGCGAGGAGA TCGCGGCGGT CGCTCCGTCT
ATCGACGCTT CCGATTGCGA GGTGGTGGAC GCCTCGGGCG CGATCGTGAT CCCCGGCTTC
ATAGACTCCC ACCGGCACAC CTGGGAGACC GTGATCCGGG GGATCGCCCC CGACGTCACC
CTCGACGGCT ACTTCCAGCT CGTCCTGGAC ACCCTCGCTC CGGCCTACCG CCCGGAGGAC
GTCTACGCCG GCAACCTCCT CGGCACGCTC GAGGCCATAG ACGCCGGGGT GACCACGCTG
CTGGACTGGT CGCACATCAA TAACACCCCC GAGCACGCCG ACGAGGCCAT CCGGGCGCTG
GCCGAGACCG GCATCCGGGC CGTCTACTGC TACGGCAACC CCAACACCTC GCTCGCCGAC
TGGTGGTTCA ACAGCACCCT GAAGGCCCCG GAGGACATCC GGCGGGTGCG CGAGAGGTAC
TTCTCCTCCG AGGAGGGGCT CATGACCCTC GCCATGGGCA CCCGCGGCCC CGGCTTCTGC
ACCCCGGAGG TCGTCCGGCA CGACTGGGAG CTGGCGCGGG ACATCGGCGT GCCCATCAGC
GTGCACGTGG GGATGGGGCC CGTGGCCGGA CGCTTCAGGA TGGTCGAGCA GCTGCACGAC
CTCGGCCTGC TCGGCCCGGA CATCACCTAC ATCCACTGCA ACCACCTCAC CGACCGCGAG
TTCCGGCTCA TCGCGGAGAC CGGGGGGACC GTCTCCATCG CCCCCATGGT GGAGATGACC
ATGGGCCACG GGATGCCGCC GACCGGGGAG GTGCTGGCGC ACGGCATCCG GCCCAGCCTG
AGCTGCGACG TCGTGACCAG CGTCCCCGGC GACCCCTTCA CCCAGATGCG CTTCCTCTTC
GCCGCCGAGC GGGTGCGCGT CCACGAGCGG GTCTTCGAGG AGGAGCTGGA GGAGATGCCG
CCCCTGCTCT CCTCGCGCGA CGTGCTGGAG TTCGCCACCA TCGAGGGCGC GCGCACGGTC
GGCCTCGCCG ACAGGACGGG CTCGCTCATC CCCGGCAAGA AGGCCGACGT CGTGATGCTG
AGCATCGAGC GGGTCAACGC CGCGCCCGTC ACCGACCCGG TGGGGACGGT GGTGTGCAGC
ATGGACTCCT CCAACGTGGA CTCCGTGTGG GTGAACGGGC GCGCCCTCAA GCGCAACGGG
GTGCTCGTGG ACGCCGACCT GGAGCGGGCC CGCCGCCTCG CCGAGGACTC GCGGGACTAC
CTGATCTCCC GCACCGGCCG TCAGGCTCAC TGGGCCACCC CGCGCACGAC CGGAGAGGCC
GCGCCGGGCG CCGGCCTCTA G
 
Protein sequence
MAERTLIRGG HVISMDPQIG DIPGGDVLIE GEEIAAVAPS IDASDCEVVD ASGAIVIPGF 
IDSHRHTWET VIRGIAPDVT LDGYFQLVLD TLAPAYRPED VYAGNLLGTL EAIDAGVTTL
LDWSHINNTP EHADEAIRAL AETGIRAVYC YGNPNTSLAD WWFNSTLKAP EDIRRVRERY
FSSEEGLMTL AMGTRGPGFC TPEVVRHDWE LARDIGVPIS VHVGMGPVAG RFRMVEQLHD
LGLLGPDITY IHCNHLTDRE FRLIAETGGT VSIAPMVEMT MGHGMPPTGE VLAHGIRPSL
SCDVVTSVPG DPFTQMRFLF AAERVRVHER VFEEELEEMP PLLSSRDVLE FATIEGARTV
GLADRTGSLI PGKKADVVML SIERVNAAPV TDPVGTVVCS MDSSNVDSVW VNGRALKRNG
VLVDADLERA RRLAEDSRDY LISRTGRQAH WATPRTTGEA APGAGL