Gene Slin_6433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_6433 
Symbol 
ID8730217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7796368 
End bp7799409 
Gene Length3042 bp 
Protein Length1013 aa 
Translation table11 
GC content57% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003391189 
Protein GI284041259 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.234793 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATTA AATGGCTACT TTTTTGTGGC GTCCTGCTTC CCGCACAGGG AGTACTCGGC 
CAGCAAATTG CCAGCAATAG CAGTCCGCTA TATGCTTCTG TCAGATCAAC GGTTACCGGA
CTACGTGCGT CGGAGACGGC GGTGATTACC GTGACGGGAA AAGTGACCGA CGAGAAAGGC
GACGCGCTGC CCGGCGCTAC CGTTTCGCTG AAAGGCGGCT CCGTCGGGGC AAATACCGAC
GCCGACGGCA ACTACACCCT CCGTATCCCC GACGGCACCC CGAATCCGGT GCTGGTCTTT
TCGTTTATCG GCTACACGTT GCAGGAAGTT GCCATTGGCA ACCAGACCGT GGTGAACGTA
CAACTCAAAG GTGATGCAAA ATCCCTGAAC GAAGTCGTCG TGGTCGGTTA CGGTACCCAG
AAACGTTCAG ACATTACCGG TTCGGTGGCG TCTGTACCCA AAACCCGTTT GTCGCAGTTA
CCCGTTACCA ACGTGTTGCA GGCCATTCAA GGCTCAGTGG CGGGTGTCAA CATCTCGCAG
TCGTCGTCGG TACCGGGAGC CGCGCCGTCA ACGACCATCC GCGGGCAGAA CTCCATCAAC
GCCAACTCCG GCCCCTATGT GGTTGTCGAC GGTATTCCGC TGAGCAAAAC GGGCGGCTCG
CTGAGCGACA TCAACCCCAA CGACATCGAG TCGATGGAAG TACTGAAAGA TGCCTCGGCG
GTGGCTATTT ACGGTACCAA CGGCGCCAAC GGCGTTATTC TTGTGACAAC CAAACGGGGT
AACACCGGCA AGCCGACCAT CCGGTATAAC AACTACGTGG GTGTCGAAAA CTTCGCACAT
ATGCTGCGTC CGCGCAACGG CGCTGAGTAC GTGCAGAAGT ACGCCGATTA CATGGCCCAG
ACCGGCCAGA AACTGGTCAA TCCGGTGCCT AACTACGACG AGTTGGCCAA CTACAACGCG
GGCATCACCA CCGACTGGAT GAAAGAAGCG ACCCAGACGG GCGTGTTGCA GGACCACAAC
CTGAGCATTT CGGGTGGATC GCCCAATGTG CGGTACTTCA TCTCGGGTGA GTTTCTGGAT
CAGAAAGGCG TTATCAAAGG CTATCAGTAC AAGCGGGCCA GTTTCCGCTC CAATCTGGAC
GTTACCCTGA CGGATTACCT GACGGTGGGC ACCTCGCTGT TCATTGCCAA CAGCAACCGC
GACGGCGGCC GCGCCAACAT GCTCAACGCA TCGGCCATGA GCCCCTACGG GCAGGAGTAC
AACGCCGACG GGACCTACCG CATTTACCCC ATGTTCCCGG AGCAGTTGTA TACCAACCCA
ATGATTGGCC TGACCGTCGA CCGCGTTGAC CGCAACACCA ACCTGAACGG GAACGCCTAC
CTCGAACTGA AACTGCCGGG CAAACTGAAC GGGCTGAAAT ACCGCATGAA CCTCGGTTAC
TCGTACATTC CGGCCCGCAC GGCGAGTTAT AATGGCCGGG CGGCCAACGA CCTGCTCGGT
ACGGCCAACA CGTTCTTCTC CGAAACCAAC AGCTTCACCC TAGAAAATAT CCTGTCGTAC
AGCCGGGATT TCGGCAAGAA CCACTTCGAC TTCACGGGTC TGTACAGCGC CCAGCAGCGG
AAATACGCCA CCGCAACCGG AACGGCAACG GGCTTTGTCA ACGACCAGTT GTCGTTCAAT
AACCTGGGGG CCGGTGCCAC GCAGTCGAGT AACTCCTATG CCGACCGCTA CGGCCTGAAC
TCGCAAATGG GTCGGGTCAA CTACTCGTAC GACAGCCGGT ATCTGTTCAC CGTTACGGCC
CGTCGCGATG GTTCGTCAGT GTTCGGAGCC AATACTACCA AGTACGGCCT GTTTCCCTCG
GCGGCTATCG GCTGGAACAT CAGCAATGAG GCTTTCATGA AAAACGTGAA CCTGGTGAGC
AACCTGAAAC TGCGTTTCTC GTACGGCAAA TCGGGTAACG AAGCCATCAG CGTATACCGG
ACCATTACGA CCGACAACAC CGTTCGGTCG CCCTTCAACG GCGTGAGCAC CATCGGCGCA
CAGCCGGGCA ACCTGGGCAA CGCCAATCTG CAATGGGAGA CGACCCTCAG CCGCAACATC
GGCGTAGATT TCGGTATCCT CAACAACCGC ATCAACGGTA GCCTTGATCT GTACAAGAAT
AACACCAAAG GATTGCTCCT ATTGCGCAGC CTACCCATCC TGACCGGCTA TTCGAGCGTA
TACGATAACC TCGGCGAAAC GTCGAACACC GGTATCGAAC TCACGCTGAA CACCCGAAAC
GTAACGAATG GCGATTTCAA ATGGGAAAGC ACCGTTGTAT TTGCCTCCAA CCGCAACCGC
ATTCTGGACC TCTACGGCGA CAAGAAAGAC GACCTCGGAA ACCGCTGGTT CATTGGTCAG
CCCATCAGCG TGGTGTACGA CTACAAACTG GCCGGTGTCT GGCAAACGGG CGAAGACGCG
TCCTCGCAGG ACCCCGGCGC GGTGGCCGGT GACCTGAAAT TTGCCGACCT CAACGGCGAC
AAGAAGATCA CCGCCGACGG CGACCGGATG ATTCTGGGCC AGACGGCTCC CAAGTGGACC
GGTGGCCTGA CGAACACCTT CCATTACAAG AATTTCAACC TCAACGTGTT TATCCAGACG
GTTCAGGGCA TAACCCGCAA TAACGCCGAC CTGACCTACG CCGACGAAAC CGGCAAACGG
AACACGCCCA TCGACGTGGG GTACTGGACG GCCAACAACA AGAGCAACAC CCGCCCGTCG
CTGGCGTTCA AGAACCCACG GGGCTACGGC TATGCGTCGG ATGCGAGCTA CACCCGTATC
AAGGACGTAA CGCTGAGCTA CGTCTTCGAT CAGAAACTGC TTGATAAACT GCACCTGGGC
AGCCTGACAG TTTATGCCAG TGGTCGTAAC CTGTACACCT TTACCAACTG GATTGGCTGG
GACCCCGAAG CGGTGCAGTC TTCCCGCGGC TCCGGCGACT GGACGAACAA CTACCCGCTG
ACCCGCTCTT TTGTGATGGG CCTTAACATC AGCCTTCGCT AA
 
Protein sequence
MDIKWLLFCG VLLPAQGVLG QQIASNSSPL YASVRSTVTG LRASETAVIT VTGKVTDEKG 
DALPGATVSL KGGSVGANTD ADGNYTLRIP DGTPNPVLVF SFIGYTLQEV AIGNQTVVNV
QLKGDAKSLN EVVVVGYGTQ KRSDITGSVA SVPKTRLSQL PVTNVLQAIQ GSVAGVNISQ
SSSVPGAAPS TTIRGQNSIN ANSGPYVVVD GIPLSKTGGS LSDINPNDIE SMEVLKDASA
VAIYGTNGAN GVILVTTKRG NTGKPTIRYN NYVGVENFAH MLRPRNGAEY VQKYADYMAQ
TGQKLVNPVP NYDELANYNA GITTDWMKEA TQTGVLQDHN LSISGGSPNV RYFISGEFLD
QKGVIKGYQY KRASFRSNLD VTLTDYLTVG TSLFIANSNR DGGRANMLNA SAMSPYGQEY
NADGTYRIYP MFPEQLYTNP MIGLTVDRVD RNTNLNGNAY LELKLPGKLN GLKYRMNLGY
SYIPARTASY NGRAANDLLG TANTFFSETN SFTLENILSY SRDFGKNHFD FTGLYSAQQR
KYATATGTAT GFVNDQLSFN NLGAGATQSS NSYADRYGLN SQMGRVNYSY DSRYLFTVTA
RRDGSSVFGA NTTKYGLFPS AAIGWNISNE AFMKNVNLVS NLKLRFSYGK SGNEAISVYR
TITTDNTVRS PFNGVSTIGA QPGNLGNANL QWETTLSRNI GVDFGILNNR INGSLDLYKN
NTKGLLLLRS LPILTGYSSV YDNLGETSNT GIELTLNTRN VTNGDFKWES TVVFASNRNR
ILDLYGDKKD DLGNRWFIGQ PISVVYDYKL AGVWQTGEDA SSQDPGAVAG DLKFADLNGD
KKITADGDRM ILGQTAPKWT GGLTNTFHYK NFNLNVFIQT VQGITRNNAD LTYADETGKR
NTPIDVGYWT ANNKSNTRPS LAFKNPRGYG YASDASYTRI KDVTLSYVFD QKLLDKLHLG
SLTVYASGRN LYTFTNWIGW DPEAVQSSRG SGDWTNNYPL TRSFVMGLNI SLR