Gene Xcel_1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXcel_1037 
Symbol 
ID8648547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXylanimonas cellulosilytica DSM 15894 
KingdomBacteria 
Replicon accessionNC_013530 
Strand
Start bp1112488 
End bp1113921 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content72% 
IMG OID 
ProductPeptidase M23 
Protein accessionYP_003325626 
Protein GI269955837 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.501943 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGACAGC ACGGGTTCGC ACGGGCGGCC GTCGGGGCCG TCCTCGTCGC GGCGCTCGGC 
TTCGGCGTCG CGGGGTCGGC CGCCGCCGAC GACATCGACG ACCGGCTCGC CGCCGCGCAG
CGGGACGCGC AGCAACGGCG CAACGAGCGG GCAGGGCTCG AGGAGGACCT GCACGAGACC
GACCAGAAGC TGAAGCAGGC GGTGCTGGAC CTCGACGAGG TCGAGGCGCG CCTGCCGGTC
GCCCAGGCCG AGCTGGAACG CGCCCAGGCC GACCTGGAGA AGGCGCAGCG CGAGGCCGAG
ATCCTCGCGC AGCGGCTCCA GGACGCCCAG GACGAGGAGG CGGCGGTCAC CGCCCAGCTC
GCGGCCGGGG CCGGCCAGGT CGAGGCGGCC CGCGCGGACA TCGCGCAGAT GGCGCGCGAG
GCGGCCCGCG GCCAGGGCAG CGTGTCGGCG TTCGGGATCG TCACGGGCGC GCAGTCGACC
GAGGACTTCC TGGCGCAGTT CGCCGTCTCC TCCTCCGCGG CCCGGTCGCA GGCCCGCACG
CTGACCGCCC TGCAGGACGC CGAGGCGCTG GCGCGCAACC AGGAGGCGCG CCTCCAGGCC
GTCCGGGAGC AGATCGACCA GCTCAAGACG GCCGCGGACG CCAAGGTGGT CGAGGCGCAG
GAGGCGGAGC AGCGCGCGAA GGACCGCAAG GCCGAGGTCG AGTCGCTCAT CGCGAAGCAG
AAGAAGCTCA AGGCCCAGAT CGAGGACCAG AAGGAAGCCG CGCTCGCCGA GCTGCGCCAG
AACGAGGCAG AGCAGAAGGC GCTCGAGGCC GACCTCAAGA AGATCATGGC GGAGCGCGAC
GAACGTGACC GGCGCATCGA GGAGCAGCGC CGCAAGGAGG AGGAAGCGCG CAAGAAGCGC
GAGGCCGAGG AGCGCCGCAA GCAGGAGGAG GCCGCGAAGG CGGCTTCCGG TGGCGGCTCG
AACAGTGGCG GCGGTTCGAG CGGCGGCGGT TCCGGCGGCG GTTCCGGTGG CGGCAGCACG
ACGCCGGTGT CCACCACGTT CCTGGGCTGG CCGACCGCCG TGCCGCACGT CACCAGCAGC
TACGGCATGC GGTTCCACCC CGTGCTGGGC ATCTGGCGAC TGCACGCCGG CACCGACTTC
CGCGCCTACT GCGGCACGCC GATCCTCACC TCGCAGTCCG GCATCGTGGT GCGCACCGCG
TACGGGTCCG GGCCGGGCAA CAACATCATG ATCGACCACG GCACCGACAA CGGGCAGAAC
ATCATGACCC GGTACCTGCA CCTGTCGAGC TTCTCGGTGA GCCAGGGGCA GTGGGTGAGC
AAGGGGCAGG TGATCGGCCG CTCCGGCAGC ACGGGGACGT CATCGGCCTG CCACCTGCAC
TTCGAGGTGT ACGTCAACGG CAGCACCGTC AACCCCATGA CGCGCCTGCC CTGA
 
Protein sequence
MRQHGFARAA VGAVLVAALG FGVAGSAAAD DIDDRLAAAQ RDAQQRRNER AGLEEDLHET 
DQKLKQAVLD LDEVEARLPV AQAELERAQA DLEKAQREAE ILAQRLQDAQ DEEAAVTAQL
AAGAGQVEAA RADIAQMARE AARGQGSVSA FGIVTGAQST EDFLAQFAVS SSAARSQART
LTALQDAEAL ARNQEARLQA VREQIDQLKT AADAKVVEAQ EAEQRAKDRK AEVESLIAKQ
KKLKAQIEDQ KEAALAELRQ NEAEQKALEA DLKKIMAERD ERDRRIEEQR RKEEEARKKR
EAEERRKQEE AAKAASGGGS NSGGGSSGGG SGGGSGGGST TPVSTTFLGW PTAVPHVTSS
YGMRFHPVLG IWRLHAGTDF RAYCGTPILT SQSGIVVRTA YGSGPGNNIM IDHGTDNGQN
IMTRYLHLSS FSVSQGQWVS KGQVIGRSGS TGTSSACHLH FEVYVNGSTV NPMTRLP