Gene Slin_1911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1911 
Symbol 
ID8725648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2312400 
End bp2315594 
Gene Length3195 bp 
Protein Length1064 aa 
Translation table11 
GC content50% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003386755 
Protein GI284036825 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.420637 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAATA ACTACTATGT ACGTTGGGGA TCCCATCTGC TGTGGATTAC CTTACTACTG 
ATCCACACAG CAGTTTTAGC CCAGGACCGT ACCATTACCG GGCGTATCAC GTCTAAAGGA
GAGGGCAGTG CACTTCCCGG TGTGAACGTT TCGATTAAAG GCACTTCACG CGGAGTTGTT
AGTGACGCCA ACGGTGGATA CAGTATTGTA GCTCCTCCCA GAACTACGCT TGTTTATTCA
TTCATTGGAT TCAAAGCGCA GGAAGTGGTT GTCGGGAATC AGTCCGTAAT TAACGTGACC
CTGTCGGAAG ATGTCTCCAC ACTCAATGAA GTTGTTGTTA CGGGCTATAG TGCCCAGTCA
AAACGGGACA TCACCGGAGC CGTTTCGACT GTGAATACCA AAGAACTGCT GTCGATCCCG
GCAACGGACG TAGCCCAGCA GTTGCAGGGC CGTGTAGCTG GGGTAACGGT AACGAACGAT
GCTACGCCGG GCGGTTCGGC CACGGTACGG ATTCGTGGAT TTGGTACGAT TGGTAACAAC
GACCCACTCT ACATTATTGA TGGTGTTCCA ACCCAAAATC TTGGCACCAT CAACCAGAAT
GATATTGAGA CCATTCAGGT ACTTAAAGAC GCTTCAGCCT CGTCTATATA TGGTTCCAGA
GCGGCCAATG GCGTTGTGAT CGTAACGACC AAAAAAGGGA AAGCCGGTGT ATCGCGGATC
ACGTTCGATG CCTATTATGG GTCGCAGCAG TGGGCCAAAA AAGGGGAAGT CCTCAACGCT
ACTGAACTGG GCCAGTACTT ATATCTGGCC GATGTTAACG CAGGTAAGGT CCCGTCACAC
GGGCAGTATA CGTATGGTGC TAACGGGCAG GTAACCATCC CAGCCTATGT ATTCCCCAGT
AAAGGAGCAG AAGGTACAGC AGCGGTTGAT CCGAGCAAAT ACTCTCTCAC TCCAGACAAT
ATTTACGCGA TCACCCGCTC GGCCAATACA AACTGGTTCG ATGAAGTTTC TCGGACAGCT
CCTATCCAAA ACTATCAGCT TGGCGCATCT GGAGGCTCAG AAACGGGCCG TTATGCCTTA
TCGGTTGGCT ATTTCAACCA GCAGGGAACT GTTAGGGATA TCAGCTACGA TCGCTACTCT
ATCCGGGCTA ACACGGAGTT CAATGTTAAA AAACGTATTC GTGTTGGCGA AAACCTGACA
GCCGCTTACA GCAGCCGTAA AGGAGGTTTT AACAACAACG AGGAACAGAA CGCAGTATCC
GGTGCCTACA AGCACCATCC ACTACTTCCT GTTTACGATA TCGCCGGAAA CTTTGCCGGT
AGCCGGGGAC TTAACCTGGG TAACAACTCC AATCCGGTAG CTACGCTGTT CCGTGAACGC
GATAACCGGT ATAACAGCCT TCGCGTATTT GGTAATGCGT ATGCTGAAGT CGACATCATC
GAAGGCCTGA CGGCTCGTAC ATCGATGGGT CTCGATGCCA ACGGAGACCG CGCCAAATAT
TTGGGACGGG CAAACCCGGA ATATATAGAG GGTAGCTTCA ACAACAGCCT GACCGACCAG
AACCGCTACT TCTACCAGTG GGTATGGACC AACACGCTGA ACTACTCCAA GACGTTCAAG
AACGTTCATA AAGTAGATGC TTTTGTTGGT ACCGAAGCCA TCCGTCAGTA TCAGGAATTC
TTCGGAGCCG CTCGCAGTGG CTACTTCACT GAGCAGAAAG ACATTCAAAG CTACCTTGAC
CTGGGTACGC AGTCATCAGC CAGCAACGAA GGACGCATTG AGCAGGATTA CTCCCTGTTC
TCGGTCTTCG GTAAACTGAA CTACGCTTAT AGCGACAAAT ACCTGTTTCA GGCCATTATT
CGCCAGGACA AGTCGTCACG CTTTCTGTCG GCTTCGAACA GTGCTCTGTT CCCGGCTGTG
TCAGCGGGCT GGCGTATCTC GCAGGAAGAT TTCTTTAAGA ACAACCTGAC ATTCGTAAGC
GATATGAAAC TGCGGGCAGG TTGGGGTAAA ACAGGAAACC AAGCCATCGG AGATTACAAC
GCCTATACTA CCTATCGTTC CAATACCTCA ACGAACGGCT ATCCGATTGA CGGAAGCATG
TCGACAGCAA CAGCCGGTTT CAGCCCACAG CGTTTTGGCA ACCCGAATGC TAAATGGGAA
GCTACAGCTT CGACCAACTT TGGTTTCGAC CTGGCCATGC TGTCGAACAA GTTGGATGTG
AGCTTCGATG TGTGGAGCCG GAAAACGACC GATATGCTCT TCACCTCACC GTTTACGTTC
ACTGCGGGCG ATGCAGATAT TCCGGCTTAT AACGTAGGTA GTATGCAAAA CCGGGGTATT
GACCTCGCTA TTGGCTACAA GGATCGCAAA GGTGATTTCC GTTATGGTGC CAGCATCAAC
TTCGCTACGT ATCGCAACAA AGTATTGAAA CTGGATGAAA GCGAGAATAC CCGTTACTTC
GGTTATGGCT CGCGCGTTCC GGCGGTTACT CTGACACAGG CCGGACTCCC CATTTCATCG
TTCTTTGGCT ACAAGGTACT TGGTATCTTC CAGACAGCCG AAGAAGCAAA AGCCTGGGCT
CCCTATGGTG ATTACAATGC TGTCGGTAAA TTCAAAATGG CCGATATCAA TGGTGACGGC
AAAATTGATG ATGCGGACCG AACCATCATC GGTAACCCCC ACCCCGATTT CACCTATGGC
ATAAATGTGA ACCTTGGTTA TAAAAACTTC GATCTGACCA TCTTTGGTAA CGGATCCCAG
GGCAACGACA TTTATAACTA CACCCGTTAT TTTACGGATT TCAACACCTT CCAGGGTAAC
CGTTCACGTC GGGCGTTATA CGATGCCTGG TCGAAAACGA ACCCAGGTGG CACAGTACCC
GTCCCGGATG CCAACGACCA GATCAGCAGC CGTCCCTCCT CTTACTTCAT TGAGGATGGT
TCATACTTCC GGATCAAAAA CGTTCAGTTG GGCTATACCC TGCCAGCCAG TTTGCTCTCC
AAACTGGGCT TAGCTTCCTG CCAGATCTAT GTGCAGAGCC AGAACCTGCT CACATTCACC
AAATATCAGG GACTCAACCC GGAGATTAGC ATTTCGAACA ACTACAATAG CGACAAAAAC
CGGAACCTCG GCTTCGACGG CGGTTACCTG CCCGCTTCCC GTACGCTGCT TTTTGGCCTA
AGTGTTGGAT TTTAA
 
Protein sequence
MKNNYYVRWG SHLLWITLLL IHTAVLAQDR TITGRITSKG EGSALPGVNV SIKGTSRGVV 
SDANGGYSIV APPRTTLVYS FIGFKAQEVV VGNQSVINVT LSEDVSTLNE VVVTGYSAQS
KRDITGAVST VNTKELLSIP ATDVAQQLQG RVAGVTVTND ATPGGSATVR IRGFGTIGNN
DPLYIIDGVP TQNLGTINQN DIETIQVLKD ASASSIYGSR AANGVVIVTT KKGKAGVSRI
TFDAYYGSQQ WAKKGEVLNA TELGQYLYLA DVNAGKVPSH GQYTYGANGQ VTIPAYVFPS
KGAEGTAAVD PSKYSLTPDN IYAITRSANT NWFDEVSRTA PIQNYQLGAS GGSETGRYAL
SVGYFNQQGT VRDISYDRYS IRANTEFNVK KRIRVGENLT AAYSSRKGGF NNNEEQNAVS
GAYKHHPLLP VYDIAGNFAG SRGLNLGNNS NPVATLFRER DNRYNSLRVF GNAYAEVDII
EGLTARTSMG LDANGDRAKY LGRANPEYIE GSFNNSLTDQ NRYFYQWVWT NTLNYSKTFK
NVHKVDAFVG TEAIRQYQEF FGAARSGYFT EQKDIQSYLD LGTQSSASNE GRIEQDYSLF
SVFGKLNYAY SDKYLFQAII RQDKSSRFLS ASNSALFPAV SAGWRISQED FFKNNLTFVS
DMKLRAGWGK TGNQAIGDYN AYTTYRSNTS TNGYPIDGSM STATAGFSPQ RFGNPNAKWE
ATASTNFGFD LAMLSNKLDV SFDVWSRKTT DMLFTSPFTF TAGDADIPAY NVGSMQNRGI
DLAIGYKDRK GDFRYGASIN FATYRNKVLK LDESENTRYF GYGSRVPAVT LTQAGLPISS
FFGYKVLGIF QTAEEAKAWA PYGDYNAVGK FKMADINGDG KIDDADRTII GNPHPDFTYG
INVNLGYKNF DLTIFGNGSQ GNDIYNYTRY FTDFNTFQGN RSRRALYDAW SKTNPGGTVP
VPDANDQISS RPSSYFIEDG SYFRIKNVQL GYTLPASLLS KLGLASCQIY VQSQNLLTFT
KYQGLNPEIS ISNNYNSDKN RNLGFDGGYL PASRTLLFGL SVGF