Gene Slin_3338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3338 
Symbol 
ID8727091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4037369 
End bp4038517 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content56% 
IMG OID 
Productglycoside hydrolase family 5 
Protein accessionYP_003388147 
Protein GI284038217 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0734572 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.542716 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACGAA GGACGTTTAT CCAGAACACG GGTTTGCTTA CGGCAGCCCT CAGCACCACC 
GGAACCGAAC TTTTGGCCAG TTCGACCCTC GGCACCGCAG CCAAAAACAA ACTGCCCAAA
TGGAAAGGAT TTAACCTGCT GGACTTCTTC TCGCCCGACC CGTCCAAGAG TCGGAAAGGC
ACGACCGAAG ATCACCTCAA ATGGATGCAG GACTGGGGTT TCGATTTCGT CCGGCTGCCG
ATGGCGTATC CGTATTACCT GAAGTTCGAC CGGAGCCGCA ACATCACCCC CGACGAAGTC
TACCAGATCG ACCCGCAGGC CGTCGACCGG ATTGATGAGC TGGTCGCCAT GGCGCACAAG
CATAATCTGC ACGTGAGCCT GAATCTGCAC CGCGCGCCGG GCTACTGCGT CAATGCGGGC
TTTCACGAAC CGTATAATCT CTGGACCGAT CAGGCGGCAC TGGATGCATT CTGCTTTCAC
TGGAACATGT GGGCGAAACG GTATAAGAAT ATATCCGCCA AAAAAATCAG CTTCGACCTG
CTGAACGAAC CCGGCATGCG GGCCGACATG AACGACCAGC ACTCCAAACG GGGTTCGGTT
CCCGGCGACG TGTACCGAAA AGTGGCGCTG GCTGCTTCGG ACGCTATCTG GAAAGAGAAC
AAGAACCACC TCATTATTGC CGATGGCAAC GACACCGGCT CGTCGGTGAT TCCATCCATC
GCCGATCTAA ACATTGCCCA GAGCTGCCGG GGCTATAATC CGGGCATTAT TTCGCACTAC
AAAGCGCCAT GGGCCAACAA AGACCCGGAA AGTCTGCCCG AGCCCAAATG GCCGGGGCAG
GTGGGCGACA AGTACCTGAG CCGGACCATG CTGGAAACAT TCTACCAGCC GTGGATCGAG
TTGGTCAACA AAGGCGTTGG CGTGCATTGT GGCGAGTGCG GTTGCTGGAA CAAAACCCCG
CACGATGTGT TCCTGGCCTG GTTCGGCGAC GTACTCGACA TCCTGTCAAA AAACGGCATC
GGCTTCGCGC TTTGGGAGTT CATCGGCGAC TTCGGCATCC TGAACTCGGG CCGGGCCGAT
GTGGCTTACG AAGACTGGCA CGGTTATAAA CTCGACCGAA AACTGCTGGA GCTGATCAGA
AAGGCATAA
 
Protein sequence
MQRRTFIQNT GLLTAALSTT GTELLASSTL GTAAKNKLPK WKGFNLLDFF SPDPSKSRKG 
TTEDHLKWMQ DWGFDFVRLP MAYPYYLKFD RSRNITPDEV YQIDPQAVDR IDELVAMAHK
HNLHVSLNLH RAPGYCVNAG FHEPYNLWTD QAALDAFCFH WNMWAKRYKN ISAKKISFDL
LNEPGMRADM NDQHSKRGSV PGDVYRKVAL AASDAIWKEN KNHLIIADGN DTGSSVIPSI
ADLNIAQSCR GYNPGIISHY KAPWANKDPE SLPEPKWPGQ VGDKYLSRTM LETFYQPWIE
LVNKGVGVHC GECGCWNKTP HDVFLAWFGD VLDILSKNGI GFALWEFIGD FGILNSGRAD
VAYEDWHGYK LDRKLLELIR KA