Gene Slin_0801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0801 
Symbol 
ID8724532 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp969997 
End bp973347 
Gene Length3351 bp 
Protein Length1116 aa 
Translation table11 
GC content50% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003385663 
Protein GI284035733 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.383306 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.353517 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAAA AATCTACTTT GTACTGCCAC TGCCGGGTGC ACATTGGCCA ATTCAGGTCA 
GCGTTCGGTC TGTTGCTTCT AACCTGCCTG TTGATGGGAG GTTCTGCTTT TGCCCAAAAT
CGGGTCGTTA GCGGGAAGGT AACGGATGCT AAAACCAATG GCCTGCCGGG TGTTAGTATC
ATCATCAAAG GAACCACAAC CGGAACCACA ACCGATGCGA ACGGCGATTA TTCGCTCAGC
GTACCGTCTG CTGAAGCGAC CTTGACATAT TCCTACATTG GGTTTGATGC ACAGTCGAAA
ACGATCGGTA GTCAGTCGGT TATCAACATA ACACTTGTTG AAAATACGGC GCAGCTCAAC
GAGGTTATTG TAACGGCTCT GGGTATCAGG AAAGAAGCCC GAACAATTGG CTATACCACG
CAGGATGTAG CGGGCGATCA GCTCGTGAAG GCGCGTGAGC CAAACCCGGT TAACTCGCTG
ACCGGTAAAA TCGCCGGTCT GACGGTCGGG CCTTCGGCCG AGATGCTGTC GAAACCCAAG
CTCCTGCTGC GGGGTAACAG CGATCTGTTA TTTGTCGTCG ATGGTGTTCC CATCAACTCG
GATACCTGGA ACGTGTCGGC CGATGACATT GAAACGTACA CCGTCTTGAA AGGTCCTAAC
GCAGCTGCTC TTTATGGCTT CCGGGGGCAG AACGGAGCGA TCATGATTAC GACCAAGAAG
GGGACGAAAG ACAAACGCAA AATTGCTGTC GACTTCAACA CGAGTACCAT GTTTGAATCG
GGTTTTCTGG CTTTGCCCGA CCGTCAGAGT GAATATGGAT ACGGGAATAA CTTCAAGTAT
GCCTATGGCA ATAAGCTCTA TGACGAAGAC GGTGGCTACC GCCGGACAAA CCTGTGGGGC
CCTCGTTTCG AAGGACAAAA TGTGCCGCAA TACAATAGCC CGGTAAATCC AACAACGGGT
ATCCGTCAGG GAACGCCCTG GCTAAATGTT GGTAAGGACA ACTTCAGGAA TTTTGTACAG
ACCGGTATCA TTTCGACCAA TAACGTTTCG GTATCGTCGA GTGGTGAGAA GTATGACCTG
CGGATGTCGG TATCGAACAA CTACCAGCGG GGTATCTACC CAAACACACG GCTGAACATT
ACCAATTTCA ACCTGACCAC GGGTATCAAT TTTACCGACA GGCTGCGGTT TGATGGTAGC
TTGAATACGA ATATCCAGGC ATCGCCAAAT ATTCCAGAAT ATAGCTCAGG TCCCGAGAGT
TATGTGTACG CCTTTCAAGT ATATGGTTCC AGTAGCTGGG ACCTCGCCGA TATGCGCGAT
TATTACAAAG GGCCACAGGG TAAGCAGGGT GTGCAGCAGT ATTACGCGGA ATATGGCCGG
GAGAATAACC CTTATTTCGT TGCTTATGAA TGGCTACGTG AGCATCGCAA AACGGATATT
TACGGCTATA CCCGGTTGAG CTACAAAATC AATGATTTCC TGAACCTATC TCTCCGGACA
CAGATAACCA CCTGGAATCA GCTGCGGACC GAAAAATTGC CCTATTCCAT GATCACTTAC
AAATCACCTG ATTTGCGGCA GGGTGATTAT CGCGAAGATC GTCGGAACAT GCTCGAAAAC
AATACGGACC TGCTGCTGAC CTTCAACAAG GACGTAGCCA AAGATTTTCA TATTAACGCA
TCGGCCGGTG CCAACGCGCG CACGTTTACT TACAATTCGA ACTGGACCAC AACCGACTTC
CTGATTGTGC CGGGTGTGTA TGCGTTTACC AACTCGAAAA ACCCTGTTCG GGCCTACAGC
TTCCGCTCGG ATATGCGGGT TCTGAGCGCC TACGCAACCA GTGACTTTAC CTACAAAAAC
CTGGTGACAC TGGGTGTAAC GGGCCGGTTT GATAAACTCT CGACTTTGCC AAAAGAGAAC
AACACGTATT TCTATCCGTC TGTGGCCCTT AGTACAGTCG TATCAGACTA TGTGAAAATT
CCGGAAGCGA TTTCATTCCT GAAACTACGA GGTTCCTATG CCAACGTGCG TGGTGGGTTG
ACGCAGTCGG AAATTGGTAC GGCCTACCGG GCGGTAACCG GTAGCGGTAC CGATGCCTTA
ATAGGCTACG GTACCGACCT GACCTCCTCG TATGACGGTC CAAGCTACGC CAACCAGAAC
ACCTACAGCA TCTCAACGGG CCTTTATAAC AACACCCCAA TGGCCAACTA TTCGGGAACG
CTGGCCAACA AATCGCTGAA GGCGTATACC GTTAGCTCGT ATGAGTTTGG TTTTGATGCC
AAATTTTTAG GCAATCGACT GGGCTTTGAC CTGACTCATT TTACCGCCGT GAACGGTCCA
CAGATTTTTG CCTTACCGGT GCCAAGTTCA ACCGGATTCT ACAATGAAAA CGTAAACGGT
CTGGTAACGA AACGGGACGG CTGGGAAGTG TCGGTAACGG GATCGGCCCT TAAAAATCCA
AACGGTCTGA ACTGGGATGT GTTGGCAAAC TGGTCGACGT TTAAAGAGCG GCTGAAAGAG
ATATACGGTA ACGAAACAAG TATATATCTC AGCGGCCCTG ACCACGTCTT TACCATCGGT
GACCGGCTTG ACGGCTATTA TAGCTACAAT TTCCTGCGCG ATCCAAACGG TAATATCATT
AACTCAGCTA CCGGACAGCC ACTAACACGT CCTTCTGGAA CGAACACCAA GCAGCTACTG
GGCTATACGA ACCCTGATTT TGTGTGGTCG CTCAATAACC GCTTCAGTTA TAAAAATTTC
AACTTCAGTT TCCAGTTCGA CGGCCGTGTG GGCGGTGTTA TTCGCGATCA GGTATATGCC
TATGCCATGA ACGCGGGTAA CCAAAAAGAT CTGGTAACCG GAGCCTTTGG CGAAGCTCGT
TTGAAGGAAT GGCAGAGTAC AAATACTAGC ACCGTAGCGG CAACCCCCGC TTACGTTGGC
CCAGGTGTGG TGACAACGGG TCAGGTTAAG TTCGACGGTC AGGGCAACAT CAGTAACATG
AGTGAGTTAA CCTTCTCGCC GAACACCAAA GCGGTAACGG TGCAGTCATA TGCTCAGGGT
GTTTATAACA GCGGTATCGA AGAATCCTAT ATGGTTAGCA AAACATACGC TAAACTACGG
GAGGTCATCA TTGGCTACAC GGTGCCCGTT ACGGTGTTAC CCCGGTTTAT TCGGGCGGCT
TCGGTATCCG TAGTAGGTCG TAACCTGCTC TATTTCGCTC AGCGTAAGGA TTTCGACCTG
GACCAGTTCC CGGAAGGCTA CAACGCCACA TCTAACTCCA CCCTGCGTAA CCCTGGTTTG
CAGTCGTCGA CGTTACGCCG ATTTGGCGTG AATCTAAATC TGACATTCTA A
 
Protein sequence
MKQKSTLYCH CRVHIGQFRS AFGLLLLTCL LMGGSAFAQN RVVSGKVTDA KTNGLPGVSI 
IIKGTTTGTT TDANGDYSLS VPSAEATLTY SYIGFDAQSK TIGSQSVINI TLVENTAQLN
EVIVTALGIR KEARTIGYTT QDVAGDQLVK AREPNPVNSL TGKIAGLTVG PSAEMLSKPK
LLLRGNSDLL FVVDGVPINS DTWNVSADDI ETYTVLKGPN AAALYGFRGQ NGAIMITTKK
GTKDKRKIAV DFNTSTMFES GFLALPDRQS EYGYGNNFKY AYGNKLYDED GGYRRTNLWG
PRFEGQNVPQ YNSPVNPTTG IRQGTPWLNV GKDNFRNFVQ TGIISTNNVS VSSSGEKYDL
RMSVSNNYQR GIYPNTRLNI TNFNLTTGIN FTDRLRFDGS LNTNIQASPN IPEYSSGPES
YVYAFQVYGS SSWDLADMRD YYKGPQGKQG VQQYYAEYGR ENNPYFVAYE WLREHRKTDI
YGYTRLSYKI NDFLNLSLRT QITTWNQLRT EKLPYSMITY KSPDLRQGDY REDRRNMLEN
NTDLLLTFNK DVAKDFHINA SAGANARTFT YNSNWTTTDF LIVPGVYAFT NSKNPVRAYS
FRSDMRVLSA YATSDFTYKN LVTLGVTGRF DKLSTLPKEN NTYFYPSVAL STVVSDYVKI
PEAISFLKLR GSYANVRGGL TQSEIGTAYR AVTGSGTDAL IGYGTDLTSS YDGPSYANQN
TYSISTGLYN NTPMANYSGT LANKSLKAYT VSSYEFGFDA KFLGNRLGFD LTHFTAVNGP
QIFALPVPSS TGFYNENVNG LVTKRDGWEV SVTGSALKNP NGLNWDVLAN WSTFKERLKE
IYGNETSIYL SGPDHVFTIG DRLDGYYSYN FLRDPNGNII NSATGQPLTR PSGTNTKQLL
GYTNPDFVWS LNNRFSYKNF NFSFQFDGRV GGVIRDQVYA YAMNAGNQKD LVTGAFGEAR
LKEWQSTNTS TVAATPAYVG PGVVTTGQVK FDGQGNISNM SELTFSPNTK AVTVQSYAQG
VYNSGIEESY MVSKTYAKLR EVIIGYTVPV TVLPRFIRAA SVSVVGRNLL YFAQRKDFDL
DQFPEGYNAT SNSTLRNPGL QSSTLRRFGV NLNLTF