Gene Slin_1904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1904 
Symbol 
ID8725641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2302570 
End bp2303883 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content52% 
IMG OID 
ProductL-sorbosone dehydrogenase 
Protein accessionYP_003386748 
Protein GI284036818 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.883513 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.169794 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAT CTTTTGTAGT AATTACCTGT CTGGCCACGT CTTTGTTGAC GCTGAACGGT 
GTGTCTGGGT ACCCCACCAA ACCGCCCGTA AAAGTTGCCG CTAATGCGAA GAAATCAATT
ACCGGAGCAG ATGTGAAGCT ACCCGCTGGC TTTTCGGCCA CCGTCGTAGC GGAAGAAGTA
GGAGCAGCCC GGCATATCGT CGTTACGAAA ACCGGCGATA TTTATGTGAA ACTGGCCAAG
CTAAAGGACG GTAAGGGCAT CTACCGCTTG CGCGATACCA ATAAAGATGG CGTAGTCGAT
GAGCGGACGG GCTTTGGCGA TTATCCAGGC ACGGGTATTT TCATTCGGAA TGGGTATTTG
TATACCTCGT CCAATAACAG CATTTTCCGG TATAAGCTGA ATGAAAATCA GGAAGTGGTC
AACCCGGATG CGCCCGAGAA ACTGGTATCC GGGCTGCGGG AAAAAGACCG CGATAAGTCG
AAGTCCATTG CCGTCGACAA TCAGGGCAAT ATTTACGTCA ACATCGCTTC GGATAACGAC
GCCTGCCGCG AGGCCGGAAC GGGAAAAGGC TTGATGCCCT GCCCACTGCT CGACTCGGCG
GCTGGTATCT GGCGGTTTAA AGCTGATGTG CCTGATCAGC CCTTCTCCAG CGGTGTACGA
TTTGCTACCG GCCTGAAAAA CGTTGTAGGG CTGGACTGGA ATAATAAAAC CAACTCCCTG
TTCGTACTTC AGCACGGCCG GGGTAAGTTC GATGATTTCT ACCCACAGTA TTACACGCCT
AAACAGAGCG CTGAGCTACC CGCTGAGACG ATGTATGAAG TGCATCAGGG CGACGATGCA
GGCTGGCCTT ACGTTTATTA CGACCATTTC CAGAAAAAGA AAATTCTGGC TCCGGAGTAT
GGTGGTGATG GCAAGAAAAC CGGAACGGCC AAAACGATCA ATCCGGTGGC CGCTTTCCCA
GCGCACATGG GTCCTAATGG GCTGCTATTC TATACCGGAA CGGCTTTCCC GGAGAAGTAC
CGCAATGGGG CGTTTATTGC CTTCCATGCG CAGTCGCAGG AGTTGCATAA GGGCTATTTG
ATTGGCTTTG TTCCGTTCAA AAACGGCAAG CCATCGGGTC CGTGGGAAAT CTTTGCCGAT
AATTTTGCTG GTACGGATCT GGTGAAGCCA ACGGGCCCCG TTCAGCACCG GCCCTGCGGT
CTGGCACAAG GCCCCGACGG TTCACTGTAT GTGACCGACG ACTTGAATGG AACGCTGTTT
AAGATCAGTT ACCAGGCAGC AAACCATAAA GCAACCGCGT CGTCTAAGAA GTAA
 
Protein sequence
MNKSFVVITC LATSLLTLNG VSGYPTKPPV KVAANAKKSI TGADVKLPAG FSATVVAEEV 
GAARHIVVTK TGDIYVKLAK LKDGKGIYRL RDTNKDGVVD ERTGFGDYPG TGIFIRNGYL
YTSSNNSIFR YKLNENQEVV NPDAPEKLVS GLREKDRDKS KSIAVDNQGN IYVNIASDND
ACREAGTGKG LMPCPLLDSA AGIWRFKADV PDQPFSSGVR FATGLKNVVG LDWNNKTNSL
FVLQHGRGKF DDFYPQYYTP KQSAELPAET MYEVHQGDDA GWPYVYYDHF QKKKILAPEY
GGDGKKTGTA KTINPVAAFP AHMGPNGLLF YTGTAFPEKY RNGAFIAFHA QSQELHKGYL
IGFVPFKNGK PSGPWEIFAD NFAGTDLVKP TGPVQHRPCG LAQGPDGSLY VTDDLNGTLF
KISYQAANHK ATASSKK