Gene Rxyl_3037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_3037 
Symbol 
ID4115972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp3044409 
End bp3045908 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content52% 
IMG OID638037806 
Productamidohydrolase 
Protein accessionYP_645758 
Protein GI108805821 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCCCG AACGTATATT GCTGCGCTGC GATATATTGA TTGTCGACGC TTTATCAGAC 
CCGATCTACA ACGCCGCCAT CCTGATCGAA GACGGGCATG TGGAATCCGT CGGTGACTAT
CACGCTATGC GACGAAGTTA CCCACTGGCA TCGGAACATG GGAAACGAAT CCCTCTTGCT
ATGCCTGGCC TCGTCGATGG CCACTCCCAC GGGCGTGGGA TATCAACCGT CGAGCAAGGC
ATAGCAGACG CGCCGCTTGA TATCTGGCTA ACGCGCATAA CCGCAGCTAC CGCGTTTGAC
CCTTACGATG AGGCATTGGT CTCAGCAGCG GAACTCATCA CCACAGGAGT TACTACAGTC
CAGGTTATTT TCCATTCTTT CTCTAAGGCA GAAGATTATG TGCAAGGAGT TATCGCAACA
GCTAAAGGAT TTAAGCAAGT GGGGGTCGGC TTAGAGCTCG TTCTGGGCAT AAGCGACCAG
CACGAGTTTA TACCACCCGT TAGCACATCA CTCCACAGCC GCGTAGATCG TTTGCTCTCA
TCTCCTGAGC GAGGAATGGA TCCAACAACC TTTTTCGAAA TGTTCGACGC TCTCTCCGGT
TTGAAGAGCG ACACTTCGAT ACTCCCTACA AAAGAGGTTC AAGAAATTTT AAGCGAAACG
CGACTAGTGC TTGGTCCAAT CGCGCCGCAA TGGTCATCTG AAAACCTCAT ACAAGGCATC
GCTGACCGAG CAGCCCAGGG TGTACGTGTA CACACGCACC TATTGGAGTG TAAAAAACAG
CGTTCGCCTT TGTACGGCCC ACTGCCAGTA CAGAAGCTGG ATCAGCACGA ACTGTTAAGT
AACAGAACCT CTGTCGCACA CGGTGTGTGG CTTGAGCCAG ATGAGATAGC CCTACTAGCA
GCACGAAAAG TGTCCGTAGT CCATTGTGCA GGCTCTAACA CCCGACTTGA AGTTGGTTTA
GCACCAGTAC GCGAGATGCT CGACGCTGGC GTGCTTGTAG CCATCGGTCT TGACAGCAAC
ACTGTACACA ATCCTCCAGA TATCTTTGCG GAGATGCGCC ACGCGCTTGA GGTAGCGAGC
GCTCGAGGAT CGCAGGTTTC GGAGAGAGAA GTCCTCGCCA TGGCGACCTC TGGCGGCGCA
GCCGCTATAG GACGACAGGA TGAAGTTGGC ACTCTCAGAC CAGGCTCAAG AGCAGACTTG
GTTATTCTTA CACCAACTGA ACCCTTGACT GTCTACGAAG ATCCAATCTC GTGGATTGTC
GGCGAAGCTT CAAGAAACGA CCTGCACGAA GTGTGGGTAG AAGGAAAAGT ACTATACAGC
AACGGTTGTC TGCGCAACTC GTCTATAGTA GCAACAGCTC GGCGGCATCT ATATGAAGCC
CTGCTTCAGG ATGCGGTACG CCGTAGGGAG CGACTTAAGG AGCTGAGAAA GCTGGAACCA
TGGCTGAGAG GAATCTGGGA GAAGACAAGC ACCGCAACAA CCCAGGAGAA CCGTTCGTAG
 
Protein sequence
MNPERILLRC DILIVDALSD PIYNAAILIE DGHVESVGDY HAMRRSYPLA SEHGKRIPLA 
MPGLVDGHSH GRGISTVEQG IADAPLDIWL TRITAATAFD PYDEALVSAA ELITTGVTTV
QVIFHSFSKA EDYVQGVIAT AKGFKQVGVG LELVLGISDQ HEFIPPVSTS LHSRVDRLLS
SPERGMDPTT FFEMFDALSG LKSDTSILPT KEVQEILSET RLVLGPIAPQ WSSENLIQGI
ADRAAQGVRV HTHLLECKKQ RSPLYGPLPV QKLDQHELLS NRTSVAHGVW LEPDEIALLA
ARKVSVVHCA GSNTRLEVGL APVREMLDAG VLVAIGLDSN TVHNPPDIFA EMRHALEVAS
ARGSQVSERE VLAMATSGGA AAIGRQDEVG TLRPGSRADL VILTPTEPLT VYEDPISWIV
GEASRNDLHE VWVEGKVLYS NGCLRNSSIV ATARRHLYEA LLQDAVRRRE RLKELRKLEP
WLRGIWEKTS TATTQENRS