Gene Slin_3801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3801 
Symbol 
ID8727559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4568471 
End bp4571983 
Gene Length3513 bp 
Protein Length1170 aa 
Translation table11 
GC content58% 
IMG OID 
ProductIg family protein 
Protein accessionYP_003388593 
Protein GI284038663 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000269985 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.432666 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACA ACTTTACCCG GTTTCGTATG AAGAATTCCG CTCACCTCTC CGCCTGTCTG 
CTGGGAGTGG CCCTGTGGCT GATGAGCGCC CTGAGTCTGC GGGCTCAAAC GGTGTATGTT
ACCCAAGAGG GAGCCGGTCA GCAAAGCGGT GCCGACTGGG CCAACGCCCT GCCCGGCAGT
CAACTCCAGG CTACCCTGGC CTCGGCCTCC GAAGGAGCTG AGTTTCGATT GGCGGGAGGA
ACCTACAAGC CCAGCCAGAC GGGTGATCGT GGCTTAACCT TTACTATCCC TTCGGGGGTA
AAGGTGCTGG GGGGTTATCT GGGCAACGGT GCCAATCCCG ACCAGCGTAT AGACTTTGCC
AGTACCGACC AGCCCAGCAG CACAACCCTC TCAGGTGATA TTGATAACAA CAACCAGTTG
GATGCGGGGA ACAGCGAACA CGTCGTTAAT TTTAACAAGG CCAGTGAGCA GACCCGTCTG
GATGGTGTGG TCATTACAGG AGGAAATGCG ACCAACGGTG GTGGAGGAAT CTACAACAAT
GGCAATCGAG GAGTCAGCAG TCCAACTATT CAGAATTGTG TAGTAAGTCA GAATCGTACG
AATGGGAAGG GCGGAGCCAT CTTAAACGAT GGAAGTTCGG GCCAGGCCAA TCCCGTGCTT
ATTAATTGCC GCTTCGTCAG TAATCAGGCC AATCAGGGAG GAGCTATTTA CAATGACGCG
TTTCTGGGCA CCTGTAAGCC TACTTTCACG AATTGCTCCT TTCTGAACAA CTCAGCCAGC
AATGGCGGAG CGATATACGA TTATGCCGAA GCGAATGGTC CGTTTGGCCC CCCTGGTCCA
GGTGACAATC GTCCTCTTCT GGTAAACTGT GTGTTACAGG CAAATACCGC TTCTGCCACT
GGTGGCGCCA TGGTTAATGA AACCAGAGGA GCCGGTCTTC CAACCACTGA GCCAACCCTG
ATCAACTGTA GCCTGGTGAG TAATTCGGCC CCCCAGGGAG GGGCTTTTTA TAACATTGCC
ACGCCGAATA CAATTAATAA TACCCAAATA TTTGCCAAGC CCAGGTTGTA CAATAGCTTC
TTATGGAACA ACGGGGGGGG CAATACCACG GTAAACATCA AGTTTAAGAA CAACTCCGGT
GGAACCAGTA GCGGAGAAGG GCAGATTCTA TTTTATAACT GTCTGGGCGA TCCGGGGGTG
AATAACGCCC TGAACACGGA CCCCAAAGCC CAAATCATCA CCACATCCCC GTTTGTGAGC
GCGTCGAACT TACAGCTCAA TCCCTGCTCC CCGGCCATTA ACACCGGCAA CACCAGCTAC
TACACCGACC GATCGAGCCA GCAAACAGAC CTGGCCGGTA ACCCCCGGAT GGTGGGTGCC
ACCATCGACA TCGGTGCCTT CGAGTTCCAA GGCACGCCTG CTATCCCCCT GGCCATCACC
CAGCCACCCG CCAGTCAGTC CAGCGTCGTG GCCGGTGCGA CCGTCGAGAC AACCGTCGGC
CTCAATGCAC CGGCCGACAG CTACACATGG TACAAAGACG GAATCGTCGT GACGGGGCAG
ACCTCAGGTA GCCTCCGGCT CACTAATGTG CAACTCGCTC AGGCGGGTTC CTACTCGCTG
GTGGCCACCA GTGCCTGCAA CAGCGTTACC TCCACCGCCT TTAGCCTCTC CGTTACCCTG
TCACGGCTCA TCACCCTCAG TGGCCTCTCG GTCAGTCCCA ATCCCGTCTG TGCGGGCCAG
TCTATCTCGG TGGAGGCCAG CGTTGGCAAC CTCAGTGGCA GCTACAGCTA CACGCTCACC
AATGGGATCA ACCCCATCAG CGGCACGGCC ACCACCTCAG CCTTCAGTCA GTCGCTGACG
GCCACAGGCT CAGGGGTGCA GAGCTTCACC CTGACGGTGA GCAGCGGTGA ACAAGTAGCC
ACGGCCACGA CCAGCTTAAC CGTCAATGCG CCCCCCAGCC CGACGCTCAT CAGCAGTGGT
ACCTTAAGTT GTGGGCAGAC TTCCCTCACC CTGACAGCCA GTCCGGGTGA GCAGAGCTAC
CGCTTTAGTG GTCCCAGTGT GGTGAGCCAG AGTGGCAACA CGGCCCTCGT CAACGCACCG
GGCACCTACT CGGTGACCAG CACGAATGCC AGTGGGTGCG TGAGTACGAC CAGCACGACC
GTCTTCAGCA ACACGGCGGT GATCACCATG AATAATCCCC CTACCAGCAC TGCAACCCTC
AATGCGCCCT TCAGCCAAAC CTTAACGGCC ACGGGTGGGG CAACGCCCTA CTTCTACAGC
CTGGCTAGCG GCAGTCTACC TGCCGGACTG AGTTTGACCC CAAGTGGTCT GCTCAGCGGC
ACGCCAACCC AGGCCGGCAG TTTCACCCTG GTAGTGCGGG GCCAGGACGC CAATGGCTGT
TTCGGTCTGG GACCTGCCTA CGTCTTGACG GTTAATGCTA CGGCTGTCAT CAGCGGCTTT
ACTTCGCTGG AGAACACGGT CTGCGTGGGC AGTCCGGTAA CCTTTACCGC CACCGTGGGT
AACGTAACGG CCCCCTACAC GTATACGCTT ACCAATGGCA CCAGCACCAC CACTGGCACA
ACCAGTGGTG GTTTCAGCCA GCACCTGACG GCCACAGGCT CGGGGGCGCA GAGCTTCACC
CTGACGGTGA GCAGCGGTGA ACAAGTAGCC ACGGCCACGA CCAGTGTGAC GGTGATGCCT
ACGCCACCAA CACCCACCAT CGCTACGCAG AGTGGGCAGT CCTACCCCGG TGGTCAATCG
GCCCTCACCG TGGCTCAGTA CTCGGGTACG GTCACCCTGC TCATCAACGG CTGTAGCGGT
ACCATCAACT GGCAGGGGCC CAATGGCAGC AGCGGCAGCA CCACCAGTAT CCCGGTAGCG
ACCTCCGCCA CCGGTACGTT TGTCTACCAG GCTACCTGTC AGCAAACGGG CTGCTTGAGT
GCGCCCGCCA GCGCTACAGT CACCGTGCAG GGCGCACCCC TGCGGGTGAT CACTCCGCTG
TTCGACTGTG CCAGCGGCAA ACTGACCCTG CGCACCACAG GTGGTAACGG CCAGCCCATT
GAGTACCAGA TTCCCTCGGT GACCACTGGC TGGGAAGCTA CCAACCCGGT GAGCATCCAG
GCCAAAGATT TCAAAAAGAG CCTCAAGCTA CGGGCTCGTC AGCGAAGCGT GGGTAAGGGC
GGCTTTGAGA GTGACGAGCT GGACTACCAG TTACCGGCCT GTCCGGGAGC CCGTGTAGCT
TCGCCAGAAA CAGACACTGA GCTGAGGGTG GTGGTTCTGG ACAATCCCAT AACGGGTCAG
GCAGTGGTGG TAGAGGTGCG GGGTGCCGAA GGCCAACCCT TGCGATTGGC CTTAACTAAT
TTACAGGGTC AACCGATCAG CGAGAAGACC CTAGAACAGG CCAGGGGAGT AGAGACTCAA
AGCTTATCCG TAGGTTCCCA GGGGGCAGGT GTGCTGTTGT TGCGGGTGAG CACCGCTGGT
CAAACGAAGA CGCTCAAAGT GCTTAAGCTC TAA
 
Protein sequence
MKNNFTRFRM KNSAHLSACL LGVALWLMSA LSLRAQTVYV TQEGAGQQSG ADWANALPGS 
QLQATLASAS EGAEFRLAGG TYKPSQTGDR GLTFTIPSGV KVLGGYLGNG ANPDQRIDFA
STDQPSSTTL SGDIDNNNQL DAGNSEHVVN FNKASEQTRL DGVVITGGNA TNGGGGIYNN
GNRGVSSPTI QNCVVSQNRT NGKGGAILND GSSGQANPVL INCRFVSNQA NQGGAIYNDA
FLGTCKPTFT NCSFLNNSAS NGGAIYDYAE ANGPFGPPGP GDNRPLLVNC VLQANTASAT
GGAMVNETRG AGLPTTEPTL INCSLVSNSA PQGGAFYNIA TPNTINNTQI FAKPRLYNSF
LWNNGGGNTT VNIKFKNNSG GTSSGEGQIL FYNCLGDPGV NNALNTDPKA QIITTSPFVS
ASNLQLNPCS PAINTGNTSY YTDRSSQQTD LAGNPRMVGA TIDIGAFEFQ GTPAIPLAIT
QPPASQSSVV AGATVETTVG LNAPADSYTW YKDGIVVTGQ TSGSLRLTNV QLAQAGSYSL
VATSACNSVT STAFSLSVTL SRLITLSGLS VSPNPVCAGQ SISVEASVGN LSGSYSYTLT
NGINPISGTA TTSAFSQSLT ATGSGVQSFT LTVSSGEQVA TATTSLTVNA PPSPTLISSG
TLSCGQTSLT LTASPGEQSY RFSGPSVVSQ SGNTALVNAP GTYSVTSTNA SGCVSTTSTT
VFSNTAVITM NNPPTSTATL NAPFSQTLTA TGGATPYFYS LASGSLPAGL SLTPSGLLSG
TPTQAGSFTL VVRGQDANGC FGLGPAYVLT VNATAVISGF TSLENTVCVG SPVTFTATVG
NVTAPYTYTL TNGTSTTTGT TSGGFSQHLT ATGSGAQSFT LTVSSGEQVA TATTSVTVMP
TPPTPTIATQ SGQSYPGGQS ALTVAQYSGT VTLLINGCSG TINWQGPNGS SGSTTSIPVA
TSATGTFVYQ ATCQQTGCLS APASATVTVQ GAPLRVITPL FDCASGKLTL RTTGGNGQPI
EYQIPSVTTG WEATNPVSIQ AKDFKKSLKL RARQRSVGKG GFESDELDYQ LPACPGARVA
SPETDTELRV VVLDNPITGQ AVVVEVRGAE GQPLRLALTN LQGQPISEKT LEQARGVETQ
SLSVGSQGAG VLLLRVSTAG QTKTLKVLKL