Gene Slin_4180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4180 
Symbol 
ID8727939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5032765 
End bp5035737 
Gene Length2973 bp 
Protein Length990 aa 
Translation table11 
GC content54% 
IMG OID 
Productconserved repeat domain protein 
Protein accessionYP_003388965 
Protein GI284039035 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.245505 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATTTC CTTTCCGATG GATTGGGGTT ATTGTCTTAC AACTTTTCTC TGGTCTGGCG 
TTCTCACAAC TCACCATCAC CAGCCCTGTA CCAAGAATGG TCTTCCAGCG CAATTTAGCA
AATGAAGCCA GTGTTAGTAT AACCGGAATT GCCTCTTCTT CAGCAACAAC CATAGAAGCT
CGTTTTGTCC CCATGGCCGT TGGTCAGGGA AATGTTACGG ACTGGAAGCC GTTGAAATTT
CTACCTCAGT CGACGGCCTT TCACGGGCAG GTGACCGTAT CGGCAGGCTG GTACCGACTG
GATGTGCGAT CGCGGTCAGG GACAACTATC ACCGCCCAAA CCCAGGTCAA CCGCGTCGGT
GTGGGCGAGG TGTTTATTAT TGCCGGGCAG TCGAATGCCG AAGGTGGGTT TCAACGGCCG
CCCAGTTCGG TAGATGACCG GGTCATGTGC GTCGACTTCC GACAGGATAG CCTCAGTGAG
CAGTTGATCC CCTTACAATT TAGCCACATC AGTTACGGCA CCAGCATTGG TCCCAGCCAG
CCTCCCCATA TTTACAGCAT TTTAGGCGAT AAGCTCGCCC AGCGGCTTAA TGTTCCCATT
CTGTTTTTAG GCGCGGCTCT AGGCGGGACC AGCAGCGCCG ACTGGCAGCA ATCCGCAGCC
GGCAATATGG GCACCGGTCG TAATTCAGCC GTTTATCGGC GAATGGGGGC GGTTTTACTT
CATTATGTCA CCCGAACGGG GGCCCGGGCT GTTCTCTGGC ATCAGGGAGA AAGTGATCTT
CATTCGTCAA CACAGACGTA TTTCGACAAC ATCAAGTATG TCATTGAAAA AAGTCGGCAG
CAGCTTGGCG GGAAGCCTCT GGCATGGGCA GTTTCACGGG CAAGTTATAT CTTCGGGCAA
ACCAGTTCAT CTGTTATCGC GGCACAGAAC CAGCTCATCA ACAGTGTCTT TAACGTTTTT
GCCGGACCCG CTACCGATGG TATTACCGGC CCCGACAACC GGTTCGACGA TCTGCATTTT
GGCGGCAATG GGCTCTATCG GTTTGCCAGT GCCTGGGATG AGAGCCTGAC AGCCTCCTTT
TTTCAAAATG CACTGCCCGT TATGCCAATC GACTCGGTGT CGCTGATCAC GAGCGGTTAC
ACTATACCGC TTACCCGACG ACCGGGTGAG ACGGTTGCGG TCGCTTCTGT CCGCAGTGAT
GCGCATGAAT CGGACAATCA GTATGTTGCT CAGATTATCC GGGCCAGTGA TGGGGCGCTG
ATGGCCGAAT CCAGTCCAAC TACTGACAAC CCGATCTCGA TGGTGTTACC ATTTGCACTG
GCAAACGGGC AGTATCGTCT GCGTACCCGC TCTACTCATC CCGTTGTGCT GGGCACTCTG
GGCGAACCAT TCAATGTACA GCAGGATGCC ACCCCCCAGA CTCAACCCCC TATACAGCGA
CTACCTGTTA GCGGAGGAAC AGCCGATACA ACCATCAGAC GTTTTGCGTA TCGTTTTGAA
ACAGGCTCCC ATTCGTTTTA CGGGCTCATT CAGGCTACTT CGCCGGTAGA AGTTCGACTA
CAGAGTCTCG ACGGCAGCGG TTTCAATGAT TCTGACTGGC ACCTCGCTCC TCCCAGTTCT
CAGGCACCTG ATTATGATCA GTTCGCTGAC TTCAACTATA TCCGTAATTA CCCCCCTATT
GCCGGGGGAG TTGGTGGCGT GATTCCGGGG AGATACCGTT TTTCGATCCG TCGGCAGGGC
AACACCGGGC CGGGCTTATG GTATGAAATG ACGCTACTCA ACGGCCGAAA TATCCTATAC
TATCCCATGG AGCCCATTGG TACTGTTCCA CCAGTACTTA CCATCACCAA CTCGGTAACG
CCATGTCTGG TTGGCTCCTT TGCCGTAGCC GTCGACGTCG CCGAAAGTGC CCCGCAGGCA
GGCAATGTTT TCAGTGTCAA ACTGTCGGAT GCCAACGGCT CGTTTACCAA TGAAACAACC
ATTGGCACCG GTACAACCAG TCCTATTGCT GTCACCCTGT CGCCAACGCT TCCTGTGGGG
TCTAATTACC GTATTCGGGT TATTGCCAGC AATCCGGCGG TAGCCAGCGC CCCCAGCCAG
CCGTTTTCCA TTTGTGCCGG GGCCGATCTG TCCATGCAAA TGGCCATCAG TAATCGGGCA
TCGTTAACCA GTCAGCCCGT TACGTTAACA GTTGTACTAA CCAACGCAGG GCCAATGGAC
GCGACAAATG TAAAAGCCAG CAGCATACTT CCCGACGGGA TGAGTTTTGT CGATGCGGCA
TCAGGAGCCG TTAGTACAGC CGCAAACACC GTTTCCATCA ATGCGGGTAA TTTGCTCAAT
GGAGCCAGCA AATCGTTTGC GTTTCGGGTT AAACCTACTA AAAACGGCAC CTTTTTTACG
TCTGCCCAAA TTACGGCCAG TGATCAGTTC GACCCCGACA GCCAGCCCAA CTCAGGCACT
GGCGACGGGC AGGATGACGA AGGCAGCGTC GATTTGCGTA CACCCGACTC TGGTACGTTC
GTCAGCATAT CCCCCAATCC AGGCCAGGTA CCACTACCCC CCGTTCAGTC CAGCCAACCG
CCAGTAGACA ATACCAAAGC CGATCTGAGT TTGGCTATCG CCACCAACTC GCTGGTAGTT
TCCGCCAATC AGGTCGTAAA CATTCCTTTA ACCGTAAGCA ATTTGGGCGG GGCAAACGCG
ACAAATGTAT CCGTACAGGC GCTTTTACCC ACGGGCTGGC AATTAACGAC AACAGCGGGG
CTGACGGTCA GCGGGCAAAC TGTAAGCGGT ACCATTGGGT CCGTTGCAGC CGGTAGCACA
GGCACACTGG TACTTGTGGT GAAGATAACC CAGGCGGGTA CGCTGCAAGC CCAGATAGCT
GGCGCATCGC CTTCTGACCC CGACTCCACA CCGGGCAATG GCTACACGAA AGGCGAAGAT
GATGAGGCCA GATTAAGTCT GCGGATCAAG TAA
 
Protein sequence
MPFPFRWIGV IVLQLFSGLA FSQLTITSPV PRMVFQRNLA NEASVSITGI ASSSATTIEA 
RFVPMAVGQG NVTDWKPLKF LPQSTAFHGQ VTVSAGWYRL DVRSRSGTTI TAQTQVNRVG
VGEVFIIAGQ SNAEGGFQRP PSSVDDRVMC VDFRQDSLSE QLIPLQFSHI SYGTSIGPSQ
PPHIYSILGD KLAQRLNVPI LFLGAALGGT SSADWQQSAA GNMGTGRNSA VYRRMGAVLL
HYVTRTGARA VLWHQGESDL HSSTQTYFDN IKYVIEKSRQ QLGGKPLAWA VSRASYIFGQ
TSSSVIAAQN QLINSVFNVF AGPATDGITG PDNRFDDLHF GGNGLYRFAS AWDESLTASF
FQNALPVMPI DSVSLITSGY TIPLTRRPGE TVAVASVRSD AHESDNQYVA QIIRASDGAL
MAESSPTTDN PISMVLPFAL ANGQYRLRTR STHPVVLGTL GEPFNVQQDA TPQTQPPIQR
LPVSGGTADT TIRRFAYRFE TGSHSFYGLI QATSPVEVRL QSLDGSGFND SDWHLAPPSS
QAPDYDQFAD FNYIRNYPPI AGGVGGVIPG RYRFSIRRQG NTGPGLWYEM TLLNGRNILY
YPMEPIGTVP PVLTITNSVT PCLVGSFAVA VDVAESAPQA GNVFSVKLSD ANGSFTNETT
IGTGTTSPIA VTLSPTLPVG SNYRIRVIAS NPAVASAPSQ PFSICAGADL SMQMAISNRA
SLTSQPVTLT VVLTNAGPMD ATNVKASSIL PDGMSFVDAA SGAVSTAANT VSINAGNLLN
GASKSFAFRV KPTKNGTFFT SAQITASDQF DPDSQPNSGT GDGQDDEGSV DLRTPDSGTF
VSISPNPGQV PLPPVQSSQP PVDNTKADLS LAIATNSLVV SANQVVNIPL TVSNLGGANA
TNVSVQALLP TGWQLTTTAG LTVSGQTVSG TIGSVAAGST GTLVLVVKIT QAGTLQAQIA
GASPSDPDST PGNGYTKGED DEARLSLRIK