Gene Slin_3083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3083 
Symbol 
ID8726836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3738334 
End bp3740808 
Gene Length2475 bp 
Protein Length824 aa 
Translation table11 
GC content31% 
IMG OID 
ProductTIR protein 
Protein accessionYP_003387893 
Protein GI284037963 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.846458 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAAG CCTTTCTATC ACACAGTAGC GTTCAGAAAC AGTTTGTCAC ACGAATTGCC 
AATGAGTTAG GCTCATCAAC TTCAATAATT GATTCGAGAT CTTTTGAAGA AGGAATGAGC
AACATAGAAG AAATTCAAAA TGCTCTCGAC CAAACGGACA TTTTCGTACT TTTTTTATCA
AATGAAGCAT TAAATTCTAA ATGGGTAAAA GACGAGATTT TGATAGCTCA CGAGAATGTA
AAGAGAAATA TCATAAAGAG AATATACCCG ATAATAATTG ACGAAACAAT TACTCACAAT
GATTTAAGAA TTCCCGACTG GTTAAAGGAA CTAAATATTA AAACTGTTTT AAGACCAGGT
AAGGTTACTA GATTAATAAA TAGAAGGCTA AAAGAAGTAA GCTGGCAACT CCATCCCGTT
TTAAAGGAGA AAGAGTTACT TTTTGTTGGA AGAACTGAAC AAATAAAGAC CCTACAAGAA
CGGCTGTATG ACTTCACGAA GCCTACACCT TTTTGTATAA TAGCGTCAGG TATACCGAAA
ATAGGAAGGA AAAGTTTCCT AAAAAATGCT TTATTAAATA CCCATACTTT AACTAGAGAA
TCTCACGATC TACCAACTAT CACAATAAGT AGGCGAGAAA GTATAGAGGA TTTTATATAC
AAAGTCTATG ATCTTGGAAA TTCTCCAGAA CAAGTTTTCC CCGATTTGTT GAGTACTACT
CTAGCAGATA AAGTTACTCT AGCAAGTAAC TTATTAGCTG ATTTTCAGGA GGCAAGCGAA
ATATTATTAA TAGAAGACTT AGGAGGTATT ATTCAAGCAG ATCGTTCTAT TTGCAATTGG
TTCAAAGATA TACTATTAAA TAGCCAAGAG AATTCCTTAG AGAGTCGAAT TACAATTTGT
GTTGCTTCAA GGAATAGAGC CTTTGGTCAC TTGGCAAATA ATATACCGCA AATATTTCAA
ATTGAAATTC CAGAGTTGAC TATAAGTGAC CGGAATGGTT TACTAAAAAG ATATTCAACA
ATAAATAAAC TATCTATAAA TAACGAAAAT CTAATTTTTT TCTCTAACCT TTTGAATGGT
TTTCCAGAAC AAGTACATTT CTGCGTAGAT TTAATCATAA ATGAATCACT TGCATTTGCA
AAAGCAAACA CTTATTTAAT AAAGAACTAT CAAAGCGAAA TTTTTTCACA AATTATTTCT
GATATTGATA CAGATATAAA TACAAAAAAC ATATTGTGCC TTTTAGCTGA TTTTGAATTT
ATTAGTTTAG AGTTAATTTT AACATTTATC AATAAAGATG AAATAGATTT AGTACAGGGT
ATAATCAATA AACTTGTTGG CTGGTCAATA ATAGAGTTCC TAGGTGTAGA TAAAGAATAC
TTCAGGTTAA ACGATGGCAT AAAAGACTAT TTACAAAGGG CTAAGTATTC TTTGCCAGAA
TCCTATAAAT CTAAGTTAAA AAAACACATT CTTGAATTTG TCCAAACAGA TCAATTAGTT
AGTGAAGATA TCTCGGATTT TTTCTTTTCT ATGAGAGGTG CACTCTTAAG TAATTATAAC
TTCAAGAGTA GTTATTTGAT TCCTTCTCAC TATCTGCAAA CAATGATACA CTTGTATGAA
AAAGAACAGG ATTATTTAAA TGTAATTGTG CTTGCTGATA GAGTACTACA AAAAGCTGAC
AAATTAGATT ACAGTATAGT TAGAGAAATA AAATACTGGT TATGTTTATC ATTAGCTAGA
AAAAAACGAG AAAGATTTAA AGACGAAGTG CAATTCTTCA AAGGTGCAGA TTATCAATTT
TTATTCGGCT TTTTCAATAG AATTATTGGT AAATCTGGAT TTGCATTAAA TAACTTCAAC
AAGGCACTTG AGGAACGTCC ATCTTTTCAT AGAGCTAAAA GAGAGTTAGT TAAAGTCTAT
ATTAATTTAG GCATGTATGA TGATGCATAC GATTTAGCTA AATCCAATTA CATCAATGAT
AAAAACGACC CCTACCATGC ACATGCATAT TTTGATTGTT TAGTAAGAAG AAAAGATGCG
ACTAGCCATG CTAAAACTCT TAGAGACCTG ATTGAAAACC TGGAAAAAAT AAATACGAAT
AAAGCAAAAG AGATGGCACT GTTATCTCAA GCTGACTATG ATTATTATAT TAATAATGAT
CCTGAAGAAG CATTAAATAC AATAAGAAGA TTTATATATT TATACGGATT AACATACTAC
GTCATAACAA AAAAATTTGA TATTGCTGAA AAAGAAAGAA ATGTTGACCT AATGCTAAAA
GTAATAGACG AAGTAAAGAA CAGTCATATT AATTATAAAT TTGACGAAAA CGTGATAAAT
ATAATGGAGG CTAAATGTCT TTCACATCAA GGTAGAATAG ACGACGCACT ATATATAATA
GATAACTTAT TAGATGTCCC TATTTCGTAT GTAATTTACT TAAAAGAGTA CGTCAATAAA
TTTACTATTA AATAG
 
Protein sequence
MPKAFLSHSS VQKQFVTRIA NELGSSTSII DSRSFEEGMS NIEEIQNALD QTDIFVLFLS 
NEALNSKWVK DEILIAHENV KRNIIKRIYP IIIDETITHN DLRIPDWLKE LNIKTVLRPG
KVTRLINRRL KEVSWQLHPV LKEKELLFVG RTEQIKTLQE RLYDFTKPTP FCIIASGIPK
IGRKSFLKNA LLNTHTLTRE SHDLPTITIS RRESIEDFIY KVYDLGNSPE QVFPDLLSTT
LADKVTLASN LLADFQEASE ILLIEDLGGI IQADRSICNW FKDILLNSQE NSLESRITIC
VASRNRAFGH LANNIPQIFQ IEIPELTISD RNGLLKRYST INKLSINNEN LIFFSNLLNG
FPEQVHFCVD LIINESLAFA KANTYLIKNY QSEIFSQIIS DIDTDINTKN ILCLLADFEF
ISLELILTFI NKDEIDLVQG IINKLVGWSI IEFLGVDKEY FRLNDGIKDY LQRAKYSLPE
SYKSKLKKHI LEFVQTDQLV SEDISDFFFS MRGALLSNYN FKSSYLIPSH YLQTMIHLYE
KEQDYLNVIV LADRVLQKAD KLDYSIVREI KYWLCLSLAR KKRERFKDEV QFFKGADYQF
LFGFFNRIIG KSGFALNNFN KALEERPSFH RAKRELVKVY INLGMYDDAY DLAKSNYIND
KNDPYHAHAY FDCLVRRKDA TSHAKTLRDL IENLEKINTN KAKEMALLSQ ADYDYYINND
PEEALNTIRR FIYLYGLTYY VITKKFDIAE KERNVDLMLK VIDEVKNSHI NYKFDENVIN
IMEAKCLSHQ GRIDDALYII DNLLDVPISY VIYLKEYVNK FTIK