Gene Slin_0166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0166 
Symbol 
ID8723894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp203659 
End bp204897 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content52% 
IMG OID 
Productcysteine desulfurase, SufS subfamily 
Protein accessionYP_003385031 
Protein GI284035101 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value3.85676e-08 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.52095e-15 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCAATCAG CGATAGAAAC AACCTTGGAT ATACAGCAAA TACGCCGGGA TTTTCCCATA 
CTCGATCAAC AGGTGAACGG TCGTCCGCTG GTGTATTTAG ATAATGCCGC CACCAATCAG
AAACCGACGG CGGTCATCAA GGCCCTGACG GATTATTACG AAGGGTACAA TGCCAACATT
CACCGGGGTA TTCACCACCT GGCCGAAAAA GCGACGGCGG CTTTCGAAGC GTCGCGCCGG
GCATTTCAGG ATTTTTTGAA TGCCAAACAC TGGCAGGAGA TCATCTTCAC GTACGGCACC
ACCGATGGCA TCAACCTGGT GGCGCAAACC TACGGACGCC AGTTTCTGAA CGAAGGCGAC
GAGATCATTA TCTCGACCAT GGAGCACCAT TCCAACATTG TGCCCTGGCA GATGCTATGT
GAGGAAAAAG GCTGCATCCT GAAAGTCATT CCCGTCAACG ACGAAGGTGA ACTGCTCATT
GACGAGTACG AAAAGCTGCT GACGGAGCGC ACTAAATTCG TTTCGTGCGT CCATGTGTCG
AACTCGCTGG GCACCATCAA CCCCGTCAAA ACCATCATCG ACAAAGCCCA TGCGGTTGGC
GCGGTGGTGC TGATCGACGG TGCACAGGCC AGTTCGCACC TGGAACTCGA CGTACAGGCG
CTTGACGCTG ATTTTTATGT TCTGTCGGCT CATAAATTAT ATGGACCAAC GGGCATGGGC
GTATTATATG GTAAAAAAGA ACTCCTCGAT GCCATGCCTC CCTACCGGGG TGGTGGCGAA
ATGATTAAGG AAGTAACGTT CGCCAAAACG ACCTATAACG AGATTCCCTA TAAATTTGAA
GCGGGTACAC CCAACATTGC CGATGTGATT GCCGTCAAAA CGGCTCTCGA CTACATGGCA
GGTCTGGGTA AAGAGAACAT TGCGGCTCAC GAAAACGATC TGCTTCAGTA CGCCACCGAG
CAATTGAGCG AGTTGGACGG TCTCCGTATC ATTGGCCGGG CAACGCACAA AATTGGCGTT
GTTTCGTTTG TGCTCGACGG CATTCACCAT CAGGATACGG GCGTTATTCT GGACCAACAG
GGCATTGCCG TCCGGACGGG TCACCATTGC ACCCAGCCGC TCATGCAACG CTTTGGTATT
GCCGGAACTA CGCGGGCATC GTTCGCGGTT TATAACACCA GAGACGAAAT CGACCGGCTT
GTTCAGGGCC TTCGACGGGT TCAGAAAATG ATGTTATAA
 
Protein sequence
MQSAIETTLD IQQIRRDFPI LDQQVNGRPL VYLDNAATNQ KPTAVIKALT DYYEGYNANI 
HRGIHHLAEK ATAAFEASRR AFQDFLNAKH WQEIIFTYGT TDGINLVAQT YGRQFLNEGD
EIIISTMEHH SNIVPWQMLC EEKGCILKVI PVNDEGELLI DEYEKLLTER TKFVSCVHVS
NSLGTINPVK TIIDKAHAVG AVVLIDGAQA SSHLELDVQA LDADFYVLSA HKLYGPTGMG
VLYGKKELLD AMPPYRGGGE MIKEVTFAKT TYNEIPYKFE AGTPNIADVI AVKTALDYMA
GLGKENIAAH ENDLLQYATE QLSELDGLRI IGRATHKIGV VSFVLDGIHH QDTGVILDQQ
GIAVRTGHHC TQPLMQRFGI AGTTRASFAV YNTRDEIDRL VQGLRRVQKM ML