Gene Slin_3793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3793 
Symbol 
ID8727551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4558100 
End bp4559386 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content51% 
IMG OID 
ProductTPR repeat-containing protein 
Protein accessionYP_003388587 
Protein GI284038657 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.085128 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTTA CTGCTCAACA GACTGCGGTT GAGTTCTTTG AAAGTGGAGT TAACCGCAGT 
AAATCAAGTG ATTTTACCGG TGCTCTCCAG GCCTTCAGTA TGGCCATTAC CATGAACCCC
GAAAATGCGC CCAGCTATTA CAACCGGGGG TTAGCCAAAG CTACTTTAAA AGATCACCGG
GGCGCCATTC TCGATTACGA CAAGGCAATC GAACTCAACA GTAAAGACGC CCTGGCTTAC
CTCAGTCGGG GCGTTAGCAA AAGTCGGCAG GAGGACCATC GTGGTGCTAT ACTCGACTTC
GGCCGTTCCA TTGAACTTAA CCCAGACGCC CCCCAGGCCT ATTATAACCG GGGAATTAGC
CGGAGCCGCC TTGACCAATA CCAGGGGGCT CTGACCGACT TCTCAAAAGC CATCGAACTG
GAGCCGGTCA ATGCCTATGC CTATTACGCT CGTGCGGTCA CTAAACAGAA ATTGAACGAC
TTCGCCGGAA GTATACTTGA TTTTACGAAA GTTATTGAGA TTAGCCCAAA GCGGGCACAA
GCCTATGCAG GCCGGGGCAC ATCGAAAGTT GAACTGAATG ACTTTACTGG TGCCATTACC
GACCTTAACA AAGCGATCGA GTTAAGCCCG CAAGACAGCG AATCCTATTT CCATCGGGGT
TATGCAAAAG GAAAGCTCGA CGATTATAAA GGTGCACTGC CTGATTATGA GCGGGCGCTG
GCGTTAAAGC CGGATCATTA CCGGGCCTAT TATGGGCGGG GTTTTTGCCG TAGCAAACTT
GGCGATCAAA AAGGGGCCGT TCAGGATTTC AACCAGGCTA TTGAGGTAAA CAATGTTTAT
GTCGAGACAA AAGTGGTTTA CAACGGCCGG ATAAGCCATG CTATCCTCGA TAACCTGCGG
AACGTCGTTC AGGAACGTGA CAAAATTAAC GAGTTAGGCA GCGAACGCGC CGAAGCCTAC
TTCAGCCGGG GCGTCAGTAA ACACAGGCAG GGTGACTCAA AAGCGGCCAT TATAGACCTT
ACTAAGGCCA TTGAACTCAA TCCGGCTTAT GCCGAAGCGT ATTTCACCCT GGGCCTAATC
AAGTCGGCCC AGGGTGACCA GAAGGGCGCC CTCACCGATT GCAACAGCGC GATCAAACTA
AAGCCCGGCT ATGCTGAAGT ATTCTATGTA CGCGGGCTCA TCAAGCATAG TCTGGGCGAC
GAAAACGGCG GCTGCCTGGA TCTGTCCAAA GCCGGTGAGC TAGGTTACAC CCCGGCATAT
AAGGTGATTA GTGAGTATTG TAATTAG
 
Protein sequence
MTVTAQQTAV EFFESGVNRS KSSDFTGALQ AFSMAITMNP ENAPSYYNRG LAKATLKDHR 
GAILDYDKAI ELNSKDALAY LSRGVSKSRQ EDHRGAILDF GRSIELNPDA PQAYYNRGIS
RSRLDQYQGA LTDFSKAIEL EPVNAYAYYA RAVTKQKLND FAGSILDFTK VIEISPKRAQ
AYAGRGTSKV ELNDFTGAIT DLNKAIELSP QDSESYFHRG YAKGKLDDYK GALPDYERAL
ALKPDHYRAY YGRGFCRSKL GDQKGAVQDF NQAIEVNNVY VETKVVYNGR ISHAILDNLR
NVVQERDKIN ELGSERAEAY FSRGVSKHRQ GDSKAAIIDL TKAIELNPAY AEAYFTLGLI
KSAQGDQKGA LTDCNSAIKL KPGYAEVFYV RGLIKHSLGD ENGGCLDLSK AGELGYTPAY
KVISEYCN