Gene Rxyl_1559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_1559 
Symbol 
ID4116889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp1578809 
End bp1580119 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content59% 
IMG OID638036355 
Productpeptidase M24 
Protein accessionYP_644333 
Protein GI108804396 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.98503 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTCTT CGGCGGGCTC TTCACACGGA GATCTTGCGG CAAGGGACTT CGTTCGCTCC 
GGCCCGGCTG AAAGCAAGGA GCGCCCCGCG ATCAACAAGG ACAGACACAC TCTTGGGCAG
ACAAACACCC TGTTGGAGCG CGGGGTCGAT CTTGAGAGGC TGCGCCGGGA GCGACTTCTC
AAGGTGCAGT CCGAGATGCG ATCTCGGGAC ATTGGCGCCC TCGTTTTGAC GGACCCCATA
AACATCCGCT ACACAACCGG CCTCAGCGTG ATGCCGTTGT GGGCGGCCAC GAACCTCGCC
CACTACGTCT TGGTGCCAGT GGAAGGCAGT CCGGTCGTCT TCGAGTATGC CCGGGCGAAA
TTCCGCGCCG AGGAGTTCTT CTCGGATGTG AGGAGCGCAC ATTACTGGCA GGCGCGCTTT
GCCGATCAAC TGGCTGCGGA GCGTTCGGGA GAATGGGCCG CAGAGATCAA AGACGTACTT
CGTACTTGGG GCGTGATCGA CTCCAAGCTC GGGATAGATT GTCTGGATTA CCACGGCTTC
TCAGCGCTCC AAGGTCAGGG CATATGCCTT ACGGATGCCG ACGATCCGAT ACAGAACGCA
CGCATTATCA AAACCGCTGA CGAAATCGAG CTCCTGAAAC AATCCGCGGC GGTCTGCGAG
GCCGCGCTGT ACGATCTGGA GCGGGCTATC CGCCCCGGGG TGAGCGAGCA CGAGTTGCTG
GGCGTCTTCT ACCACAAGAT GCTGGCCCTG GGTGGAGAGC ACTGCTTTTC GCGGCTACTT
AGCACGGGCC ACAAGACCAA TCCCTGGTTC CACGAAGCGG GCAGCAAGCT GGTGCGCCCC
GGAGACCTCG TGGCCTTCGA CACCGACATG ACAGGGCCGG AGGGCTACGT CTGCGACATC
TCACGCACGT TCCTGTGCGG AGAAGAAGCC ACAGCCGCAC AGAAGGAGGC GTACAGGGTA
GCGTACGAGT TCACCCAGGA ACTCGCTTCC ATGCTGCGTC CCGGCCTAGG CTACGACGAG
TTGTTGAACA ATCTTCCCGA GTACCCGGAT CTCTACAAGG CGCAACGCTA CTCGTTCGTG
CTGCACGGTG TGGGAACGGA TGATGAACTG CCGTTTCTTC CGTATCCCGA TGACCCCGGG
GCGATCGAAC TCGATGGGGA ACTCAAGGAG AACATGGTTG TCAGCGTGGA ATTCTACGCA
GGGAAGGTTG GAGAACAAGA CGGCGTGAAG CTGGAAGACG AGGTGTGGAT TACCGAGGAA
GGGCCGGTGA TGCTCTCGTT GTACCCGTAC GAGGAGAAAT TGATCTCTTA A
 
Protein sequence
MNSSAGSSHG DLAARDFVRS GPAESKERPA INKDRHTLGQ TNTLLERGVD LERLRRERLL 
KVQSEMRSRD IGALVLTDPI NIRYTTGLSV MPLWAATNLA HYVLVPVEGS PVVFEYARAK
FRAEEFFSDV RSAHYWQARF ADQLAAERSG EWAAEIKDVL RTWGVIDSKL GIDCLDYHGF
SALQGQGICL TDADDPIQNA RIIKTADEIE LLKQSAAVCE AALYDLERAI RPGVSEHELL
GVFYHKMLAL GGEHCFSRLL STGHKTNPWF HEAGSKLVRP GDLVAFDTDM TGPEGYVCDI
SRTFLCGEEA TAAQKEAYRV AYEFTQELAS MLRPGLGYDE LLNNLPEYPD LYKAQRYSFV
LHGVGTDDEL PFLPYPDDPG AIELDGELKE NMVVSVEFYA GKVGEQDGVK LEDEVWITEE
GPVMLSLYPY EEKLIS