Gene Slin_5965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5965 
Symbol 
ID8729746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7232715 
End bp7233911 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content51% 
IMG OID 
ProductPSP1 domain protein 
Protein accessionYP_003390726 
Protein GI284040796 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.565053 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTGCA AATCTTGTGC AACCGGCGGG TGCGGAACCC AGCGTGGCGC ATCTGATGGC 
GAGAAAACAG CCACTAAAGG ATGCGGTAGC GGAGGGTGTA GCACTGGAGG CTGCAATAAA
CTGAATTCGT TCGACTGGCT GAGTGATATC GCCATGCCGG GCCAACGATA CGATGTAGTT
GAGGTTAAAT TTAAGGGTGG ACGAAAAGAA TATTATCGCA ATGTCCATCT GCTCGATTTG
ACTACAGGCG ATTATGTGGT GGTCGAAATG CAATCGGGTT TTCACATTGG CGCGGTTTCC
CTACAGGGCG AACTGGTTCG GCTACAGGTG AAAAAACGAG GGGTTAAAAT TACGGACGAT
ACCAAGGTTA TTCATCGGAT TGCTACGCCG AAAGATATGG AACGGCACGA ACAGGCCGTT
TTGCGCGATT TGCCAGCTTT GTACCGTTCC CGCGAAATAG CCCGGGAACT GAAGCTGAAT
ATGAAATTAT CTGATGTTGA ATTTCAGTCC GATAATACGA AAGCAACCTT CTATTACTCG
TCTGAAGAGC GGGTCGATTT TCGTGAGTTG ATCAAGATGC TGGCCGGTGA GTTTAAGGCT
CGTATCGAAA TGCGCCAGAT CAGCCTCCGT CAGGAAGCGG GTCGTCTGGG GGGCATTGGC
TCCTGCGGGC GCGAACTCTG CTGCTCAACC TGGCTTACCG ATTTCAAGAA TATTGCGACA
TCGGCCGCCC GGTACCAGAA TCTGTCCCTG AACCCGGCTA AATTATCCGG TCAGTGTGGC
CGGTTGAAAT GCTGCCTGAA TTACGAGCTG GATACGTACA TGGATGCCCT CCGGGATATT
CCAACGATTG AGAAACCGCT GGAAACCCAA AAGGGACGGG CTTATTTACA GAAGACCGAC
ATCTTCCGGA AGCTCATGTG GTTTGGCTAC AGTGCCGAAA GTACCTGGCA TCCGTTGCCC
ATTATTCGGG TACTGGAAAT CGTTGAGCTG AACAAACGAG GTATCATTCC CGAATCGTTC
GAGGTGTTGA CCCCGATAGG AGAGAAGGAG CCCACTACGG CAGCCGCCCT CAATAGCGAT
CTGCAAAAGC TGGACGCCAA GTACACGACC CGGACGCCCT CCAAGAAGAA AAAGAAAAAA
TCGAGAGGCC CCGAAGGTGG TGCGCCAAGA CCGGCTCAGA ATGGTAAAGT CAATTAA
 
Protein sequence
MSCKSCATGG CGTQRGASDG EKTATKGCGS GGCSTGGCNK LNSFDWLSDI AMPGQRYDVV 
EVKFKGGRKE YYRNVHLLDL TTGDYVVVEM QSGFHIGAVS LQGELVRLQV KKRGVKITDD
TKVIHRIATP KDMERHEQAV LRDLPALYRS REIARELKLN MKLSDVEFQS DNTKATFYYS
SEERVDFREL IKMLAGEFKA RIEMRQISLR QEAGRLGGIG SCGRELCCST WLTDFKNIAT
SAARYQNLSL NPAKLSGQCG RLKCCLNYEL DTYMDALRDI PTIEKPLETQ KGRAYLQKTD
IFRKLMWFGY SAESTWHPLP IIRVLEIVEL NKRGIIPESF EVLTPIGEKE PTTAAALNSD
LQKLDAKYTT RTPSKKKKKK SRGPEGGAPR PAQNGKVN