Gene Slin_3960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3960 
Symbol 
ID8727718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4752627 
End bp4756055 
Gene Length3429 bp 
Protein Length1142 aa 
Translation table11 
GC content54% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003388749 
Protein GI284038819 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAAAC GAGTACTTTA TCAAAAGACT TTACAGAAAA TCATGCGCAT TGGCATCTAC 
CAGGCCTTTC TTTCTGTCGC TTTTGCCACC TTCTCCTACG CCGTCGAGGT TAGTGGGCAG
AAAGTGCTGG AGCAAAAAGT AACCCTTCAA CTGTCCAACG CCGATGTTGA AAAAGTGCTG
GACAAAATTG AGACCGTTAC GAACGTCAAA TTTCTGTACA ACCCGCAAAT CTTCGGGAAC
GACACCAAAG CCACCTATAA ATTTCGCAAC GAGCCGCTCT CCGATGTTCT GAACAAGATT
CTAAACCGGT ATCAGGTTAC CTACGAAGTA CTCCAGGACC GGATCATTCT GAAACGGCTG
GAGTCGTTCG AAATCAGAGT ACCCGTTCAG CAGGAAGCCC CCAAACGGAA AGTAGCGGGT
ATCGTTCTCG ACGAAAACGG GGCCGGATTG CCGGGCGTGA GCGTCGTAAT CAAAGAAGCC
CAGAAAGGGA CCACCACGGG TGCTGACGGC CGTTTTTCGC TCGATGTCCC AGACGATAAC
GCTGTACTGG TGTTCAGCTT TGTGGGCTAC AAACGGCAGG AAGTTACCCT AGGTAACCAA
AGCAACCTCT CGGTAACGCT GGCTCCGGAA GCTAGCACCC TCGGCGAAGT GGTGGTAACG
GCCCTCGGTA TTGCGCGGGA GAAAAAAGCC CTCGCCTATG CCGTGTCGGA AGTGAAAGGC
AGCGAGTTTA CACAGGCCCG CGAAAACAAC GTGGCCAACG CCCTGACGGG TAAGATTGCC
GGGGTCAACG CAACGGGTAT GGCGACCGGC CCCGGTGGAT CAAGCCGCAT CATCATCCGG
GGTAATGGCT CCCTGAACGG CAACAACCAG CCGCTGTACG TCATCAACGG GATGCCGATG
GATAACAGCA CCCCCGGCGG CACACAAGCC GACGGCAACG GCATGAACGT TGACCGGGGT
GACGGTATCG GCGGTATCAA CCCCGACGAT ATCGAGTCCA TCAGCGTACT CAAAGGTGGT
CCCGCTGCGG CTCTGTACGG AGCCCGTGCC TCCAATGGCG TTATTCTGAT TACGACAAAA
AAAGGCCGCG CTCAGAAAGG TGTCGGCGTA GAGATTAACA GCAATACGAC CTTCGAAGAC
ATCGCCGTGA TTCCAAATTG GCAGTACGAA TACGGCCAGG GACTCGATGG CAGAAAACCA
ACCACGGTAA CGGAGGCCAA AAGCACCGGC CGACTGTCTT ACGGGGCCAA AATGGATGGG
CTACCCACAA TTCAGGTGGA TGGGCAAATG CACCCCTATT CGCCCCAGAG AAACAATCTG
AAAAACTTTT ACCGCACAGG CACCAACTAC ATTAACTCGC TGGCCTTCAC CGGCGGCAGC
GAAACGGTAA ACTTCCGGCT GGGACTCAAC AACACCCAGT CGAACAGCAT CGTACCCAAC
TCGTCGTTCT CCCGGCGGAT CGCCAACCTG AACCTGAACG CGTTTCTGGG CAAGAAACTG
AGCGTTGAAA CGGTATTCCA GTACAACGTG GAAGAGGGCA TCAACCGACC GAAAGTTGGG
TATGCCGACT TCAACCCGCA CTGGGCCACT TACCTAATCG CCAACGTGGT CGACATTCGT
AGTCTGGCAC CGGGATACGA CCCCGTGACG GGCAAAGAGA TGGAATGGAA CCCTGTTCCG
GCCGCGCCGA ACCCGTATTT TGTGATCAAC AAGTTTAAGA ATAACGACAC CAAACACCGG
TTTATCAGCC AGGGAAGCAT CCGCTACGAT ATTCTGGACA ACCTGTTCCT CAAAGGCAGC
GTCAGCCAGG ACTTTTACAG CTTTTCGTCG GAATACGTCC AGCCTACCAA TAACGCCTAC
CAGCCGCTGG GCACCTACGA AGCCCGTAAA ACAAGCTCCT CGGAAACCAA TGGTATGCTG
ACGCTGAACT ACAACACGAC CTTTTTTAAG GACCTTACCT TTTCAGCCCT GCTGGGCGGC
AATGTCCAGA AAGCGATCTT CGACCAGACA ACCATAGCTG GCAGTGAATT TACGGTACCG
TATTTCTACA GCTACACCAA CCTCGCGACA TCGACCACAA CGCCAACCTA CCTGAAAAGC
GCCATCAATT CGGTGTTTGG CTCGGCCGAT TTCGGGTATA AAAACGTCGC TTATCTGACG
CTGTCGGGTC GGCAGGACTG GTTCTCGGTG CTGAACCCGA AGAGTAACCA CATCTTCTAC
CCATCTGTGG GCGGCTCGTT CATTCTGTCT GATGCGTTCC AGTTGCCCAA GGCGGTGAGC
TTTGCGAAGT TGCGGGCGTC GTGGGCGCAG GTGGGTGGCG CTACGGTCAA CGCCTATCAG
ATTTACCAGT ACTATTCCAT GCAGCAGGGC GGTCACAACG GTCGGCCGGT GCAGGTTTTA
TCGTCCTCGC AGGTACCCAA CCCCGACCTG AAACCGCTGA CCTCGACTAC GTACGAGGGG
GGTATTGAAG CCAAGTTCCT GAACAACCGG CTAGGTATCG ACCTCACGCT CTACAACCGC
AAAACCACGG ACGACATCGT GACGACGAAC ATCGCCCTGT CGTCGGGCTA CACCTCGGCG
CTGTTGAACG TGGGTGCGTT GAGTAACAAA GGCGTTGAGC TGCTACTGAC TGGCACGCCC
GTCAGCAAAG GGCCTTTCTC CTGGGATGTT AGCTACAACA TGGCCTACAA TAAGAGCAAG
ATCGAACAGC TGGCCGCAGG CATCACCGGT ATTGATGTTG GTGCGGGCGT AGGCGGTGGT
CTGGTTCGGA ACGTACTCAA CCGGCCTTAC GGCACCGTTT GGGGCTACAA CAAGAAGACC
GACGCCAACG GCAATGTGGT CTTTAACACA GCCAGCGGGT ATGCGCTTCG GGGCGATTTG
CAGGAAATCG GGCAGGGCAC GCCCCCACTC ACGATGGGGA TCACCAACAA CTTCCGATAT
AAGAACTTCT CGCTGAACAT CCTGGTCGAC GGTAAATTTG GCAGCATCGT TTACTCGAAC
CTATACCAGT ATGCCTACCG CTTTGGTCTG CCGCAGGAAA CCCTGCCCGG CCGCGAAACC
GGCATCACCG TCACGGGCGT AACCCCCGAA GGCAATCCGT ACAGCAAAAC ATGGAGCAAG
GAAGAGGTCG ATACGTACTA TGACAACGAC AAGAACTACA CCGCCATGTT CATGTTCAAC
AACGATTTCG TGAAGCTGCG TCAAGTGATC CTCAGCTACA ATCTGCCCGT TGCCAAACTG
CCCTTCCTGA AGCTACAATC GGCCACGATC TCGTTTGTAG CGCGTAACCT GGCTATTCTC
TACAAGGATA AAAAGAATCA GTATTTCGAT CCGGAGTCGG GCTATACGAG CACCAACGCA
CAGGGGCTGG AGGCTTTCGG CGTACCCAGA ACCCGCAGCC TGGGTGTGAA CTTAATGGTG
AAATTCTAA
 
Protein sequence
MSKRVLYQKT LQKIMRIGIY QAFLSVAFAT FSYAVEVSGQ KVLEQKVTLQ LSNADVEKVL 
DKIETVTNVK FLYNPQIFGN DTKATYKFRN EPLSDVLNKI LNRYQVTYEV LQDRIILKRL
ESFEIRVPVQ QEAPKRKVAG IVLDENGAGL PGVSVVIKEA QKGTTTGADG RFSLDVPDDN
AVLVFSFVGY KRQEVTLGNQ SNLSVTLAPE ASTLGEVVVT ALGIAREKKA LAYAVSEVKG
SEFTQARENN VANALTGKIA GVNATGMATG PGGSSRIIIR GNGSLNGNNQ PLYVINGMPM
DNSTPGGTQA DGNGMNVDRG DGIGGINPDD IESISVLKGG PAAALYGARA SNGVILITTK
KGRAQKGVGV EINSNTTFED IAVIPNWQYE YGQGLDGRKP TTVTEAKSTG RLSYGAKMDG
LPTIQVDGQM HPYSPQRNNL KNFYRTGTNY INSLAFTGGS ETVNFRLGLN NTQSNSIVPN
SSFSRRIANL NLNAFLGKKL SVETVFQYNV EEGINRPKVG YADFNPHWAT YLIANVVDIR
SLAPGYDPVT GKEMEWNPVP AAPNPYFVIN KFKNNDTKHR FISQGSIRYD ILDNLFLKGS
VSQDFYSFSS EYVQPTNNAY QPLGTYEARK TSSSETNGML TLNYNTTFFK DLTFSALLGG
NVQKAIFDQT TIAGSEFTVP YFYSYTNLAT STTTPTYLKS AINSVFGSAD FGYKNVAYLT
LSGRQDWFSV LNPKSNHIFY PSVGGSFILS DAFQLPKAVS FAKLRASWAQ VGGATVNAYQ
IYQYYSMQQG GHNGRPVQVL SSSQVPNPDL KPLTSTTYEG GIEAKFLNNR LGIDLTLYNR
KTTDDIVTTN IALSSGYTSA LLNVGALSNK GVELLLTGTP VSKGPFSWDV SYNMAYNKSK
IEQLAAGITG IDVGAGVGGG LVRNVLNRPY GTVWGYNKKT DANGNVVFNT ASGYALRGDL
QEIGQGTPPL TMGITNNFRY KNFSLNILVD GKFGSIVYSN LYQYAYRFGL PQETLPGRET
GITVTGVTPE GNPYSKTWSK EEVDTYYDND KNYTAMFMFN NDFVKLRQVI LSYNLPVAKL
PFLKLQSATI SFVARNLAIL YKDKKNQYFD PESGYTSTNA QGLEAFGVPR TRSLGVNLMV
KF