Gene Xcel_2797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXcel_2797 
Symbol 
ID8650342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXylanimonas cellulosilytica DSM 15894 
KingdomBacteria 
Replicon accessionNC_013530 
Strand
Start bp3103374 
End bp3104654 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content69% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003327369 
Protein GI269957580 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCGTT CGACACGGCT CACGACGGCG TCAGCGCTCG CCCTGAGCGT CGCCCTGCTC 
GCCGCCTGCA GCAGCGGCAG CCCGGACTCG ACGAGCGGCG GAGGTGACGG CGAGGACCGC
GGCCCGATCA CCTACGCCCA AGGCAAGGAC AACTCCGGAA CCATCGAGCG GCAGCTGGCC
GAGTGGAACG CGGCGCACCC AGGTGAGGAG GTCACCCTGG TCGAGCTGCC GGAGAGCTCG
GACGCTCAGC GTCAGCAGCA CATCCAGAAC GCGCAGACGG AGTCCGACGC GTTCACGGTG
ATCGCTCTGG ACAACGTCTG GGTCGCCGAG TTCGCTGCGA ACCAGTGGGT GGACCCGTTG
CCGAACGACA TGTTCCCGGC CGAACCCTTC CTGCCAGCCG TGTACGCAAC CGGGCTGTAC
CGCGACCAGC TGTACTCGGT GCCGCGGGTC TCCGACGGCG GCCTGCTGTA CTACCGGACC
GACCTTCTCG AGGCAGCCGG GATCGCCGCA CCGCCGACCA GCTGGGACGA GATGGTCGCG
CAGTGCGCCC TGATCCAGGC CACTCCAGAG GGGGCCGGCG TCGGCTGCTA CGCAGGCCAG
TTCGAGAAGT ACGAAGGCCT GACGGTGAAC TTCGCCGAAG CGGTCAACGG TGCCGGGGGC
GTGATCACCG ATGAGAACGG CGTGCCCGAC GTCGACACCC CCGAGGCCCT GGCCGGCCTC
ACCCAGCTGG TCGACGGTCT CCAGTCGGGG ATCATCCCGG CCGAGGCGAT CACCTACAAG
GAAGAGGAGG GTCGCCAGGC GTTCCAGGCT GGAAAGCTTG CCTTCCACCG CCAGTGGCCC
TACCAGTACT CCCTGGCCAA CGCCACCGAC GGGTCCAGCC AGGTCGCCGG AAAGTTCGAG
GTCACCGGGC TGCCGGGCGT CTCGTCGCTC GGCGGCTGGG GCGTGTCCAT CTCGGCCTAT
GCCACGCACA AGGACACCGC GCTCGACTTC ATCCAGTGGT TCACCAGCGA AGAGCGCCAG
CGGCAGAACC TCGTCGAGGC CTCGAACGCA CCGGTCTACG CCAGCCTGTA CGACGAGCCC
GACCTGGTTG CAGAGTTCGC CTACCTGCCG GCGCTCAAGG CCTCGATCCT GTCCGCCCAG
GCACGACCGC GGGTCGTCAA CTACGGCGAT GTCACCGCCG CGATCCAGGA CTCCGCGTAC
GCCGCGCTGA CCGGCAACCC CTCGCCGCAG GAGGCGCTGA GCCAGCTCCA GGCCCGCCTG
GAAGAACTCA CCGCGGGCTA G
 
Protein sequence
MSRSTRLTTA SALALSVALL AACSSGSPDS TSGGGDGEDR GPITYAQGKD NSGTIERQLA 
EWNAAHPGEE VTLVELPESS DAQRQQHIQN AQTESDAFTV IALDNVWVAE FAANQWVDPL
PNDMFPAEPF LPAVYATGLY RDQLYSVPRV SDGGLLYYRT DLLEAAGIAA PPTSWDEMVA
QCALIQATPE GAGVGCYAGQ FEKYEGLTVN FAEAVNGAGG VITDENGVPD VDTPEALAGL
TQLVDGLQSG IIPAEAITYK EEEGRQAFQA GKLAFHRQWP YQYSLANATD GSSQVAGKFE
VTGLPGVSSL GGWGVSISAY ATHKDTALDF IQWFTSEERQ RQNLVEASNA PVYASLYDEP
DLVAEFAYLP ALKASILSAQ ARPRVVNYGD VTAAIQDSAY AALTGNPSPQ EALSQLQARL
EELTAG