Gene Rxyl_0235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_0235 
Symbol 
ID4117726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp241058 
End bp242137 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content53% 
IMG OID638035026 
ProductNitrilase 
Protein accessionYP_643025 
Protein GI108803088 
COG category[R] General function prediction only 
COG ID[COG0388] Predicted amidohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.700557 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTCCC AATTTCCAAA AAGCTTCCGC GCAGCAGCGG TGCAAGCATC GCCTGTACAT 
CTTAAGCCGG ATGCCACCGT CGACAAGCTG GAAAGCTTAG TAGCTGAGGC AGCACGTGGT
GGAGCACAAC TCGTGGTCTT TTCTGAGTCG TTTATACCTG CTTTTCCTGT GTGGAATCTC
GTGCTCCCAC CAGTCGATCA ACATGACCTC TTCCGGCGAC TCTTTTTGAA CTCAGTATTA
GTTCCTGGAC CTATAACGCG GCGTCTCGCT GAAATCGCAA AGCGTCATGA CGTGTATTTA
TCGGTCGGGG TAACTGAGCG CACTAATATC AGTATGGGAT GTCTCTACAA TACAAACTTG
CTCTTTGCGC CGACTGGCGA ACTGCTCAAC CATCGGCGGA AGCTCGTTCC TACGTGGGCA
GAGAAACTCA CGCATGCGTG GGGCGACGCA AGCGACCTGC GTCCTGTGCA GACCGAGCTA
GGTAATATTG GTGTGCTTAT CTGCGGTGAA AACACTAACC CGCTTGCTCG GTACACCCTT
CTTGCTCAGG GAGAACAAAT ACACATCGCC ACCTACCCTC CTGCCTGGCC ATTTCGTCGG
ACCGGCGGGC GCCAGACCTA CAATCTTCGA AAGGCTATCG AGATTCGGTC AGCAGCCCAT
GCTTTCGAAG GTAAGGTGTT CAATATAGTT TCCTCCGGAT TACTCGATGA AGGCATAATC
AAAGATATAA TTGCGATTGC TCCTGACCTT GAGCCGACGC TGCGCGAAGC TCCCGCCCCA
GCCTCGATGA TTCTAGGCCC GACCGGAGAA CCGTTGGTAG AACCCCTTGT AGGTGATGAA
GGGATCATCT ACGCAGACAT TGACGTCACC GAGAGCATCG AGGTAAAGCA GGCTCACGAC
ATCGTGGGCT ATTATCAGCG GTTCGACGTG TTCCAGCTCA CCGTCGATCA GCGACCGCAG
CTTCCTATTA ACTTGATACG CGGTCCCGAA ACCAGTTATG ACAGCAGGAT GGGCGCTGAA
ACAGTCGAAA CGGCGGTAGA AGAAGATCGA GCGTCGTCCG AGTCGACCAC CATCCGCTAA
 
Protein sequence
MDSQFPKSFR AAAVQASPVH LKPDATVDKL ESLVAEAARG GAQLVVFSES FIPAFPVWNL 
VLPPVDQHDL FRRLFLNSVL VPGPITRRLA EIAKRHDVYL SVGVTERTNI SMGCLYNTNL
LFAPTGELLN HRRKLVPTWA EKLTHAWGDA SDLRPVQTEL GNIGVLICGE NTNPLARYTL
LAQGEQIHIA TYPPAWPFRR TGGRQTYNLR KAIEIRSAAH AFEGKVFNIV SSGLLDEGII
KDIIAIAPDL EPTLREAPAP ASMILGPTGE PLVEPLVGDE GIIYADIDVT ESIEVKQAHD
IVGYYQRFDV FQLTVDQRPQ LPINLIRGPE TSYDSRMGAE TVETAVEEDR ASSESTTIR