Gene Slin_3166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3166 
Symbol 
ID8726919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3835438 
End bp3836574 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content51% 
IMG OID 
Productprotein of unknown function DUF692 
Protein accessionYP_003387976 
Protein GI284038046 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.681272 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGAC TATACTCATC CCTTGCCTGC AATCTGGATA CCAACAGCTT ACAGGCATCG 
CTTCCTCTTT TTGAAGCTGA AAAAGTTCAG GCCATTGAGT GGTCGTTCGA TACACTGTTT
GCCCTTGGCG AAATGCCCGA CTGGTTTGCG GAATTGCTCC GCGCTTACAG TCAGGAAGGT
CGATTGATTG GGCACGGCGT TTTCTTTTCT CTCTTCTCCG GCAAATGGAC GCCAGACCAG
CAGCAGTGGC TGAAGCAGCT CAAGGCATTG TCGGCCGACT TTACTTTCGA TCACCTCACT
GAACATGTCG GTTTTATGAC CGGTGAGGAT TTTCATAAGG GTGCCCCCAT CAGCATTCCT
TTTACGACCT CAACGCTGGC GATAGGCCGC GACCGGCTGC TACGGCTCCA GGATGCGGGT
AACTGCCCGG TTGGTCTGGA GAACCTGGCA GTCGCTTACT CGCTCGACGA TGTAAAACGG
CAGGGTGACT TTCTGGCTCA ACTGCTCGAA GCAGTCAATG GATTCATTCT TTTAGACTTA
CATAATCTGT ATTGCCAAAG TCAGAACTTC GACCTAGGCA TTGCTGATAT ACAAGCGCTG
TACCCCCTCG ACCGTGTCCG CGAAATTCAT ATATCGGGTG GAAGCTGGGT GCCATCCACC
GTCAATCCGA CAAAACAGAT CCGGCGGGAC ACGCACGATG AGTCAGTACC AGCAGCGGTT
TTCCACGCGC TGCAACAAGT TATCGGCCAA TGCCCTAACC TGAAATATGT AGTGCTTGAG
CAACTGGGCA CTGGCCTCAC GACAGATGTA AGTCGTCAGC ATTTTCGGGA GGACTTCTAT
ACGATGGATG CCCTTATTGA AGCTACCAAC CAGCTAAACA GTCATTCGCC GATCAACTCT
TTCCTGCCTT TATCTGAAAC AAGCATTCCC GAAACGCCAA TGGAGAATCC ATTGCTTAAC
CAACAACAGA CTGAACTATC GGCCATACTG GAAACCGCTA CGGATTATGG TCAGGCTCAG
TTGTTTCTGA ACGCATCAAG TCTGGCGAAT TCAGATTGGA ATATCGAGAA CTGGCAACCG
GAAATGCTCG AAACAGCCCT TGCCATCGCT CAGAAATGGA AAGATGGGTT GGTGTAG
 
Protein sequence
MSRLYSSLAC NLDTNSLQAS LPLFEAEKVQ AIEWSFDTLF ALGEMPDWFA ELLRAYSQEG 
RLIGHGVFFS LFSGKWTPDQ QQWLKQLKAL SADFTFDHLT EHVGFMTGED FHKGAPISIP
FTTSTLAIGR DRLLRLQDAG NCPVGLENLA VAYSLDDVKR QGDFLAQLLE AVNGFILLDL
HNLYCQSQNF DLGIADIQAL YPLDRVREIH ISGGSWVPST VNPTKQIRRD THDESVPAAV
FHALQQVIGQ CPNLKYVVLE QLGTGLTTDV SRQHFREDFY TMDALIEATN QLNSHSPINS
FLPLSETSIP ETPMENPLLN QQQTELSAIL ETATDYGQAQ LFLNASSLAN SDWNIENWQP
EMLETALAIA QKWKDGLV