Gene Slin_2991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2991 
Symbol 
ID8726742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3616965 
End bp3620213 
Gene Length3249 bp 
Protein Length1082 aa 
Translation table11 
GC content51% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003387801 
Protein GI284037871 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.40723 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCATG CTTTACGACA GAGTTACCAA AAACTGCCGC TGTTTATTTT CTGCTGGCTT 
TTTTGCCTGG GAGCTTTTGC TCAGGAGAGA AAAATCACGG GGCGAATTAC AGATGGTAAT
GACAATAGCG CACTTCCCGG TGCAAACGTA GTTGTCAAAG GCACACAAAC GGGCGTAGTG
ACGGATGCCA ATGGGCAATT CTCCTTAAAC GTGGCAACCG GCCGGGACGT ACTCACAATT
TCGGCCATTG GCTATGCCTC ACAGGAAGTT ACCATTGGGG CGCGCACGTC GCTGAACATT
TCGTTATCCC CCGATATCAA AACGCTTAAT GAAGTTGTCG TAACGGGTTA TGGTGCACAG
GCCAAACGGG ATATTACGGG TGCCGTAGCA ACTGTCGATA CCAAACAACT CCTCTCGGTT
CCCTCAACCA ACGTTGGTCA GGCTCTACAA GGTCGTGTTG CGGGGGTTCA GGTAGGGAAT
GAAAACTCCC CCGGTGGCGG GGTCATGGTT CGTATCCGCG GTTTCGGTAC GATCAACGAT
AACTCGCCCC TGTACGTCAT TGATGGCGTG CCTACCAAAG GCAACCTGAA TACATTGAAC
CTGAATGATG TAGAGAGCAT GCAGATTCTG AAAGATGCGT CTGCTGCATC TATTTACGGC
TCACGCGCCG GTAATGGTGT GGTTATTATC ACAACCAAGA AAGGAAAAGC CGGAAAGCCC
AAATTTACGT ACGATACGTA CTACGGTTCG CAACGGCACG GTAAGCTGCT CGATATGCTG
AACACGCAGG AGTACGCCGA CCTGATCTGG GAATCCCGCA GAAACTCCGG TGTACTAGGC
CCCAATGGAA ACCCGGTTCA CTCGCAGTTT GGCAATGGTG TAACCCCGGT CATTCCGGAT
TTCGTGCTCC CTACTGGTGC CTCGGCCAAC GACCCTCGTT TAGCCCGAAA CCCGGATGGA
ACGTATGTCA ACTACAATAA CGACATCAGC TCGCCAGGTT TTCTGCTGAT TACGCCGTCC
AATAAAACAG GAACCAACTG GATGGAAGAG ATTTTTACCA CGGCCCCCAT CCAGAACCAT
CAGTTAGGCG TATCGGGAGG TAGTGAAAGC GGTCGTTACG CCATGTCGCT GAATTACTTC
AACCAGGATG GTATTATGAA GTATACGGGC TACAAACGCT ATTCGTTACG GGCTAATACC
GAGTTTAACG TCAACAAACG GGTTCGTGTT GGCCAGAACT TCCAGGTAGC TTATGGCGAG
CGCATTGGTC AGCCAAATGG TAATAACGCC GAAAGTAACC CCGTTTCGTT CGCCTACCGT
ATTCAGCCGA TCATTCCGGT TTATGACGTA GCCGGAAATT TTGCAGGTAC ACGCGGGGGT
GACCTCGACA ATGCCAATAA CCCGGTTGCC CTGCTGTACC GCAATAAAGA CAACGTTCAG
AAAGAAGTCC GGCTATTTGG TAATGCCTTC GCTGAAGTCG ACATCCTTAA AAACCTGACA
GCCCGCACTA GCTTCGGTAT TGATTACAAC CTTTATAACT ACCGAAACTA TACCATTCGG
GACATCGAAT CGGCCGAAGC ACGCGGTTCG AACCAGCTCC AGACCAACAA CAATTATGAA
TGGACCTGGA CCTGGTATAA TACGCTGACA TATAATGTTA ACCTCGGCGA CCGGCATCGT
TTCAATGTAA TCGCCGGTAC CGAGTCGATC AAAAACTATT TTGAAACCTA CGATGCTACC
CGGACAAATT TTGCGGTAGA CGACATCGAA AACCGCTACC TGAGTGCCGG TACGGGAGTT
CAGACCAACA ACGGAGGTGC GTCGAACTGG CGGCTGGCAT CGGAGTTTGC TAAAGTTAAC
TACGCGCTCG ACGATAAGTA TCTAATCGAC CTGACCGTCC GGCGTGACCG GTCGTCCCGT
TTTGCGAAGG AATTCCGGTC GGCGGTATTC CCGGCTGCGA GCGTAGGCTG GCGTGTTTCG
AAAGAAAACT TCTTTAAGCC ACTCACGCTG TTCGATGACT TGAAATTCCG CGCAGGTTGG
GGCCAGACGG GTAACCAGGA GATTGGTAAC TACAACTCGT TCACCCAGTT CAGCACAAAC
CCTATTACGT CGTTCTACGA CATCAACGGC ACGCGTACGT CGGCGGTGCC CGGTTATGAA
CTTACCCAGT TTGGTAACGC CAAAGCTAAA TGGGAAACCA CGACCAGCCT GAACATTGGT
TTCGACGCCA GCCTGCTTAA AAACAAGCTG ACCGTTGGCT TCGACTGGTA TACCCGCACC
ACCTCCGACA TGCTTTTCCC TGTTCAGGCC CCGCTCACTC AGGGCGTAGC CACGGTGCCT
TTCCAGAATA TTGGTTCGAT GCGTAACCGT GGTATTGACC TGATGATCAA TTACGGCGAT
AAGATCGGTT CCGGCGGCCT CACGTATAAT GTAGGTGCCA ACTTCAGCAC CTACCGCAAC
GTGGTAACGA AGACGAATGG TGATCCCAGC ACGCAGTACT TTGGTATTAA CGATGAGCGG
ATTCAGAACT TTGTGGTCAC CCAGCAAGAC TACGCTATTT CTTCCTTCTT TGGCTATACA
ATCGACGGCA TCTTCCAGAC CAACGAAGAA GCTAAAGCTG CGCCAATCCA GTTTGGTAAC
GCAGCCGCAG AGAACGTAGC TGGTCGCTTT AAATTCCGCG ACATCAATGG TGATGGTAAA
ATTGACACCA AAGACCTCAG CATCATTGGC AGTCCGCATC CGAAGTTCAC ATATGGCCTT
AATCTCAATC TGAACTATAA AAACTTCGGA CTAACCCTGT TTGGACAAGG GGTTGAGGGC
AATCAGATCT TCAATTACAC CAAATACTGG ACGGACTTCC CAACGTTTGG CGGTAACCGC
AGCTCCCGCA TGCTGTATCA ATCCTGGCGG CCCGGCAAAA CGGACGCTAT TCTGCCCCAG
CTTCGCTCAA GCGATCAGGT TAGTATCCAA CCGTCTACCT ATTACCTGGA AAGCGGCTCA
TATTTCCGGA TGAAAAACAT CCAGCTTACC TACCAGCTGC CACAGTCACT GCTCTCGAAA
CTGGGTGTTG GCGCTACCTC AATTTACATT CAGGGCCAGA ACATGTTCAC CATCACCAAA
TACTCCGGCA TGGACCCTGA AATTAACCTG CGTAGCTATT CGGCCGGTAA CGACCGCCAG
ATTGGCGTAG ATGGCGGCTC TTACCCGGTA GCCAAAACCG TATTAGTTGG TTTGAACCTG
TCATTTTAG
 
Protein sequence
MKHALRQSYQ KLPLFIFCWL FCLGAFAQER KITGRITDGN DNSALPGANV VVKGTQTGVV 
TDANGQFSLN VATGRDVLTI SAIGYASQEV TIGARTSLNI SLSPDIKTLN EVVVTGYGAQ
AKRDITGAVA TVDTKQLLSV PSTNVGQALQ GRVAGVQVGN ENSPGGGVMV RIRGFGTIND
NSPLYVIDGV PTKGNLNTLN LNDVESMQIL KDASAASIYG SRAGNGVVII TTKKGKAGKP
KFTYDTYYGS QRHGKLLDML NTQEYADLIW ESRRNSGVLG PNGNPVHSQF GNGVTPVIPD
FVLPTGASAN DPRLARNPDG TYVNYNNDIS SPGFLLITPS NKTGTNWMEE IFTTAPIQNH
QLGVSGGSES GRYAMSLNYF NQDGIMKYTG YKRYSLRANT EFNVNKRVRV GQNFQVAYGE
RIGQPNGNNA ESNPVSFAYR IQPIIPVYDV AGNFAGTRGG DLDNANNPVA LLYRNKDNVQ
KEVRLFGNAF AEVDILKNLT ARTSFGIDYN LYNYRNYTIR DIESAEARGS NQLQTNNNYE
WTWTWYNTLT YNVNLGDRHR FNVIAGTESI KNYFETYDAT RTNFAVDDIE NRYLSAGTGV
QTNNGGASNW RLASEFAKVN YALDDKYLID LTVRRDRSSR FAKEFRSAVF PAASVGWRVS
KENFFKPLTL FDDLKFRAGW GQTGNQEIGN YNSFTQFSTN PITSFYDING TRTSAVPGYE
LTQFGNAKAK WETTTSLNIG FDASLLKNKL TVGFDWYTRT TSDMLFPVQA PLTQGVATVP
FQNIGSMRNR GIDLMINYGD KIGSGGLTYN VGANFSTYRN VVTKTNGDPS TQYFGINDER
IQNFVVTQQD YAISSFFGYT IDGIFQTNEE AKAAPIQFGN AAAENVAGRF KFRDINGDGK
IDTKDLSIIG SPHPKFTYGL NLNLNYKNFG LTLFGQGVEG NQIFNYTKYW TDFPTFGGNR
SSRMLYQSWR PGKTDAILPQ LRSSDQVSIQ PSTYYLESGS YFRMKNIQLT YQLPQSLLSK
LGVGATSIYI QGQNMFTITK YSGMDPEINL RSYSAGNDRQ IGVDGGSYPV AKTVLVGLNL
SF