Gene Slin_4384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4384 
Symbol 
ID8728144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5319780 
End bp5321120 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content54% 
IMG OID 
Productxylose isomerase 
Protein accessionYP_003389164 
Protein GI284039234 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0452947 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGACG TTAAATTGAC CCTCGGCGAG AAAACTTACT TTCCGTTCAT AGAAAAACCA 
ATTGCCTACG AAGGCCGTGA ATCGGATAAT CCGTTGGCCT TCAAGTTCTA CGACGCCAAC
CGACTCATTC TGGGCAAACC GATGAAGGAT CTGTTCCGGT TTGCTACGGC TTACTGGCAT
ACCTTCTGCG GTACCGGAGC CGACCCCTTT GGTCCGGGTG TCAAGCATTT TCCCTGGGAT
GCCAACCCCG ACCCGCTGGC CGCTGCCCAT GATAAGATGG ATGCCGCTTT CGAGTTTATC
ACCAAAATAG GCATGGAGTT TTACTGCTTC CACGATGTAG ACGTGGCCCC CGAAGGAAAC
TCTAACAGTG AATTCGAGAA GAACTTCCGG GCTATTGTCG ACTACGCCAA ACAGAAGCAG
GCCGCCAGTG GTGTAAAACT GCTGTGGGGC ACGGCCAACC TGTTCTCGCA CGAGCGGTAC
ATGAACGGGG CCTCCACCAA CCCTGATTTT CACGTGCTCG CCCATGGTGG CTGGCAGGTG
AAAAACGCCA TCGACGCCAC CATCGAACTC GGGGGCGCAG GCTATACTTT CTGGGGAGGC
CGGGAAGGGT ACATGTCGCT GCTGAATACC AACATGAAAC GGGAGCAGGA ACACCTGGGC
AAGTTTCTGC AAATCAGCCG CGATTACGCC CGTAAGCAGG GTTTTAAAGG TTCGTTTTAC
ATCGAGCCCA AACCGATGGA GCCCACCAAA CACCAGTACG ATTTCGATGC AGCAACGGTT
GTCGGTTTCC TGAATCGCTT TGGCTTACAG GACGACTTCG AGCTAAACAT CGAAACCAAC
CACGCCACCC TAGCTAATCA TACGTTTGCC CACGAATTGC AGATTGCCGC CGATAACAAC
ATGCTCGGCA GCATCGACGC CAACCGGGGC GATTACCAGA ATGGCTGGGA TACCGACCAG
TTTCCGGTAG ATGTATACGA ACTGACGGAA GCCATGCTGG TCATTCTGGA AGCGGATGGC
CTCAAATCCG GCGGGGTTAA CTTCGACGCC AAGACGCGCC GGAATTCAAC CGACCTGGAC
GATATTTTCA TCGCCCACAT TGGCGGCATG GACACCTTCG CACGGGCAGC CATCGCGGCC
GAAGCCATTC TTGATAAGTC GCAGTACCGG AAACTCCGCG CCGAACGTTA CGCCAGCTAC
GACTCGGGCG AAGGTGCCCG TTTCGAAAAA GGTGAGTTAA CGCTGGAAGA CCTGCGCCAG
TATGCCATGA CCAATGGCGA GCCCAAACAA CTCAGCGGCA AACAGGAGCT GTATGAAATG
ATCGTTAATC AGTATATTTA A
 
Protein sequence
MSDVKLTLGE KTYFPFIEKP IAYEGRESDN PLAFKFYDAN RLILGKPMKD LFRFATAYWH 
TFCGTGADPF GPGVKHFPWD ANPDPLAAAH DKMDAAFEFI TKIGMEFYCF HDVDVAPEGN
SNSEFEKNFR AIVDYAKQKQ AASGVKLLWG TANLFSHERY MNGASTNPDF HVLAHGGWQV
KNAIDATIEL GGAGYTFWGG REGYMSLLNT NMKREQEHLG KFLQISRDYA RKQGFKGSFY
IEPKPMEPTK HQYDFDAATV VGFLNRFGLQ DDFELNIETN HATLANHTFA HELQIAADNN
MLGSIDANRG DYQNGWDTDQ FPVDVYELTE AMLVILEADG LKSGGVNFDA KTRRNSTDLD
DIFIAHIGGM DTFARAAIAA EAILDKSQYR KLRAERYASY DSGEGARFEK GELTLEDLRQ
YAMTNGEPKQ LSGKQELYEM IVNQYI