Gene Slin_2454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2454 
Symbol 
ID8726198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2960841 
End bp2963915 
Gene Length3075 bp 
Protein Length1024 aa 
Translation table11 
GC content52% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003387273 
Protein GI284037343 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000521114 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.729289 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTACAGA TGGTTGTTAG TATTTCGGTT TCGGCTGGCT CAGCAGACCA GCCCGTATCC 
GGAAAGATTA CCGACGAAAA AGGCGATGGG CTACCTGGCG TAAGCGTCGT TATTAAAGGT
ACATCACGTG GTACGACGAC GGATGCATCC GGGCAGTTCA AGATCAGTGT GCCTTCCGGA
AAAGCCGTGC TGATCGTCAG TTTCGTTGGC TACGTTCGGC AGGAGGTCGA CGTAAACAAT
CGTTCAGTCA TTAACATACA GTTGCAAAAC GACGATAAGG CACTCGAAGA AGTTGTAGTT
GTTGGCTATG GCACCCAGAA AAAAGTGAAC CTGACGGGAG CCGTATCGAC AGTCGATTCC
AAGGCACTTC AATACCGGCC AACAACGAAC CTGGCCAATG CGTTGCAGGG TGTTACGCCC
GGACTCACGA TTACCCGTCA GAGCGGACAG CCCGGCAATG AATCTATTCG ACTTCAGATA
CGGGGTGTTA CGTCGGCCAA CGGGAACGTT GATCCGCTGG TCATTCTGGA TGGTGTTAGT
GTGCCTATTT CGACCATGAC TACGTTGAAC CCCAACGACA TCGAAAGTAT CAGCGTACTG
AAAGATGCAG CGGCAGCGGC AATTTATGGC GCGCAGGCTG CGGGTGGGGT TGTGCTGGTA
ACGACCAAGA AAGGTAAATC GGGTAAAGTA ACCTTTGACT ATCTGGCTCA ATACGGTACC
GACTGGTCTA TCAATGTACC GGGCCGCATG AGTTTACTGG AAGAAGCGCA ATTCTCCAAC
CTCGCACGGG CAAACTCGGG TAGCGCACCT GAATATACCG AGGACGATTT ACAGCGTATT
CGTAATAACA TCCCCTATGT GGTCAATCCG GCCGATACGG GCACCTATCT GTACTATAAT
CAACAGTCAC TGACAGACCA GTTGCTGCGG AAATACACGG CTATGAAAAC CCACAACTTT
ACCGCACGGG GTGGTAGCGA TAAAGTGAAT TTCCTGCTTT CGGCGGGGTA CTATGAGAAA
CAGGGGGTGT TTAAAGTAGG CCCGGATAAT ATGAAACGGT ATAACGTCCG GCTCAATCTG
GGGGCTCAGC TTACCAAACA TCTCTCGCTC GATACCCGCC TGTCGTACTC GCTGGAACAG
GTCAGGCAGT CATCTACCGA TGCCAATGGA AGCGGGTTGC TGTATCAGGT GTACCGGCTT
CGAACCAGAA CGCCCTTCTT CACACCCGAA GGACGATACA ATGGTGCTGG TTCGGCGGCT
ACAGCCTACG GGTTGCTCGA ATCGGGTGGG TATAACAACC AGAACAAACG CCAGTTCGAT
GGCGTGGTGA CGCTCCAGGC GGCAAACTTT GTTAAAGGGC TGACGTTACG CAGTGTGGCT
GGTGTTCAGT ATCGGCCGTC ACTCCGGCAG CAGTTTGCCC GTACCGTACC GCTCTGGGGC
AAATCCCGGA TTTTAAGCTA TGCCAATAAC CCAAACTCCT ATCAGGTAAC CAATGAACTG
GTTCGTAACA CGAATCTCCA GTTCCTGGCC ACCTACGAGT ACAAGTTAGG GGAGAAGCAC
AATTTCTCGA TCTTGGGCGG TTACCAGTGG GAAGATTATC GCGAGGAGGG GGTTGCCACG
GCTTCGTCAA ACCTGGTTAG TAATGACTTG CCAACCCTGA ACCTGGGCGA CGACCGTACA
AAAAGCAACA GCGAATACGT ACGGGCCCGC GCATTCCAGT CGGTATTCGG CCGATTCAAT
TATAATTTCG ACGGCAAATA CCTTTTTGAG GCTACACTGC GTCAGGATGA GAGTTCGAAA
CTGGCCTCCG GTTTGCGAAC CAAGATTTTC CCTTCGGCTT CGGCTGGCTG GAACCTCCAG
CGCGAAGACT GGTTCGCCAA GGCATTACCA CTGTTCACCG AATTTAAACT TCGGGCATCG
TGGGGACGTT TGGGCGGTGC TTTGGGCGAT AACATTGGAA ACTATGACTA CCTGAGCCAG
CTAAGTCGCG GTTCGGCGCT GGTACTGGGC GATGCCCGGA CATCCTATAT CTTTCAGGGC
TCAATCCCCT CGGCCGCCCT TTCGTGGGAA ACCATCGAAA CCAGCGACGT TGGAATTGAT
CTGGGCTTCT TCCAGAACCG GCTTCAACTG ACAGCCGATT ACTACGTGAA GTTTAACCGG
AATATGCTTA CTCCGCTGCA ACTACCCGGC ACGATCGGTA TCGGTACGCC AAGGCAGAAC
AACGGTGAAC TGAAATCGTG GGGTTGGGAA ACCGAAGTAA AATACCGTGA TCGGATCGGT
AAAGATTTCA CATACTCCGT AGCCGCTAAC CTGTCTGATA ACTTCAATAA ACTGGTGAGT
TACGCCGGGC GGACGGTAGT TGGTGCCGGA ACGAACAACT TGATCGAAGG CTATCCTATC
AATACCATTT GGGGGTATCA GACGGCGGGC TATTTCCAGT CGGCCGACGA GGTGAAAGGC
TGGGCCTTTC AGGATAACCG CGCGGGTGCC GGGGATGTTA AATATGTTGA CCAAAACGGC
GACAGTAAAA TCAACGTGGG CAAAGGCTCC ATTGCCGACC ATGGCGATCT GGTGCTGATC
GGTACTACCC AGCCACGTCT TCAGTTTGGC TTCACCTTAG GGGCTCAGTG GAAAGGATTC
GACCTCACCA TCTTTATGCA GGGGGTGGGC AAACGCAGCT ATCGCCCCAA TACAGAGTCG
ATTGCCCCAT TGCTTGTTAC CTGGAAACAG GCTTTGGCCA TTCACAACGA TTACTGGACC
CCCGAAAACC CCAATGCACT GTACCCACGG CCGTATGTTG GTGCAACACA TAACTACGTG
TCCTCCGATA AATGGGTGCT CAACGCCAGC TACATGCGGA TGAAAAACCT GCAGTTTGGC
TACACACTGC CCAGTACATT GACGCAGAAG ATACGAATCA GCCAGGCTCG TTTCTTCTTT
TCAGGACAGG ACTTATTTAC GGTGTCCGGG TTGAAGGCAT TCCAGGGATA TTACGATCCC
GAAATGCGTG ATGGTGTCGA GAATGATTAC CCATTCTTTG CCACCGCTTC GGTGGGACTG
AACGTCTCCT TTTAA
 
Protein sequence
MLQMVVSISV SAGSADQPVS GKITDEKGDG LPGVSVVIKG TSRGTTTDAS GQFKISVPSG 
KAVLIVSFVG YVRQEVDVNN RSVINIQLQN DDKALEEVVV VGYGTQKKVN LTGAVSTVDS
KALQYRPTTN LANALQGVTP GLTITRQSGQ PGNESIRLQI RGVTSANGNV DPLVILDGVS
VPISTMTTLN PNDIESISVL KDAAAAAIYG AQAAGGVVLV TTKKGKSGKV TFDYLAQYGT
DWSINVPGRM SLLEEAQFSN LARANSGSAP EYTEDDLQRI RNNIPYVVNP ADTGTYLYYN
QQSLTDQLLR KYTAMKTHNF TARGGSDKVN FLLSAGYYEK QGVFKVGPDN MKRYNVRLNL
GAQLTKHLSL DTRLSYSLEQ VRQSSTDANG SGLLYQVYRL RTRTPFFTPE GRYNGAGSAA
TAYGLLESGG YNNQNKRQFD GVVTLQAANF VKGLTLRSVA GVQYRPSLRQ QFARTVPLWG
KSRILSYANN PNSYQVTNEL VRNTNLQFLA TYEYKLGEKH NFSILGGYQW EDYREEGVAT
ASSNLVSNDL PTLNLGDDRT KSNSEYVRAR AFQSVFGRFN YNFDGKYLFE ATLRQDESSK
LASGLRTKIF PSASAGWNLQ REDWFAKALP LFTEFKLRAS WGRLGGALGD NIGNYDYLSQ
LSRGSALVLG DARTSYIFQG SIPSAALSWE TIETSDVGID LGFFQNRLQL TADYYVKFNR
NMLTPLQLPG TIGIGTPRQN NGELKSWGWE TEVKYRDRIG KDFTYSVAAN LSDNFNKLVS
YAGRTVVGAG TNNLIEGYPI NTIWGYQTAG YFQSADEVKG WAFQDNRAGA GDVKYVDQNG
DSKINVGKGS IADHGDLVLI GTTQPRLQFG FTLGAQWKGF DLTIFMQGVG KRSYRPNTES
IAPLLVTWKQ ALAIHNDYWT PENPNALYPR PYVGATHNYV SSDKWVLNAS YMRMKNLQFG
YTLPSTLTQK IRISQARFFF SGQDLFTVSG LKAFQGYYDP EMRDGVENDY PFFATASVGL
NVSF