Gene Slin_0030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0030 
Symbol 
ID8723758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp33811 
End bp36171 
Gene Length2361 bp 
Protein Length786 aa 
Translation table11 
GC content54% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003384903 
Protein GI284034973 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATTAC GGCTACTCCT GTTTCTCTTC TTTTTTTTCC TGGGCTTTTG CTCATCTGCT 
GTAGCCCAGC AGCGACTTCG GCTGAGTATT GTTGTTCGGG ACGGCATCAC CCGAAAGCCA
ATTTCCGGCG CCAGCTTATT TATTGCTGAA ACAAAACTGG GAGGACGGGC CGATAGTGCC
GGGGTTATTT CAATTACCCA TCCACCGGGA GTACTCACGG CCTACGTGTC GGCAGTGGGC
TATTTTCGCG GTCGTGAAAC GGTCGTCCTT GATTTTAACA AACGAGTCGA GTATTACCTG
CAACCCCGCT CAACGGATCT GGATGAGGTT GATGTACGCG CCATTCGCAA GGACAAAAAC
ATCAAGGATG TGCAGATGGG ACAGATTCAA CTCAGTATGC CCGAGTTGAA ACGGATGCCG
GTTGTCCTGG GCGAACCGGA TATTCTGAAA GCACTGTCCC TACAGGCCGG TGTTACAACG
GCGGGCGAAG GAGCGGGGGG CTTTAATGTC CGGGGCGGAC GGGCCGATCA GAACCTGGTA
CTGGTTGATG GAGCCCCCCT GTTCAACACA TCGCACCTGC TGGGCTTCTA CACAAACATT
AATCCCGATA TGGTGCAGGA CGTGAGTCTA AGCAAAGGGG CTTTTTCGGC GCAGTACGGC
GGTCGGGTAT CCTCGTTGCT CCTCATGAGT ACCCGAAATG GCAATAAAGA TGGCTGGCGG
GTGTCGGGTG GGGTGAGCCC GGTGAGTGTG CGGGCGGTGG TGGACGGACC AATTACAAAA
AAGCTGACTT TACTCGCTGG CGGGCGCATT GCCTTCCCGA ACTACCTGCT ACAGTTATTT
CCAACGACAA GCGTCAAAAA CAGCCGGGCT TTCTTCTACG ATGGCAATCT TAAACTAACG
TATACACCCG ACGAACGCAA TACCATTTCG CTCTCGGCCT ATCGGAGTCA GGATAACTTT
CGATTTCCGG GTGACACCCT GTACGGCTGG CAATCGAATG TGCTGACGGG CCGCTGGAGT
TACTTGATTC GACCCAATAT GCAGCTCAAT CTGGCCGCTT TGTATAGCGG TTATTACCTC
AACGTTGACG GCGTAACGCC TAATCTGGGT TTCCGCTTCA CCTCGCATAT CGAGCAGCGG
GAAGGGAAAG CGGATGTCTT TTATACGATT GCAAAGAAGC ATAAAGTTCA GGTCGGTGTC
AACGGCATTC TGTACGGTAT CCAGACGGGG GCAATTCAGC CAGCGGGCAA TCTGTCCAGT
ATTAATCCCA AACAGGTGAA CCCCGAACGG GGGCGTGAGC TGGGGGCTTA CGTGAATACC
GAGTGGGAGT TGATGCCCGC CGTGACTCTG ATGGCTGGGC TTCGCTATTC GGCCTTTGCC
ATGCTGGGGC CCCAAACGGT TTACGGTTAT GCGGAGAATG TGCCCGTCTC TCCGGAAACC
GTAACCGACT CGGTGCTCTA CCGTTCGGGT CAGGTGGCGC AGGCGTATGG CGGTTTTGAG
CCCCGGCTGG CCCTGCGCGT GCAACTGGCT AAACATACGG CCATCAAAGC CAGTGCCGGT
CGGACGCGCC AGTATCTGAA CCTTATCTCC AATACAACCG CCATTACACC CCTCGATTTC
TGGAAACTGA GCGACCGCTA CCAGCGCCCT CAGATTGCCG ACCAGGTATC GCTGGGGATA
TTTCAGAACT TTCTGGATAA CGGGCTCGAG CTAAGTGTGG AAGGGTATTA CAAACGGCTT
CAGAATCAGA TCGAATACCG ATACGGAGCT GACCTCATCC TTAATCCGAA ACTCGAAACA
GCCCTGGTGC AGGCCGCCGG AAAAGCCTAC GGAGTTGAAC TGGGCCTGAG CAAAACCAGC
GGACGACTGA CCGGACAGCT TAACTATACG TACGCCCGCT CGCTGATTGC GGTGCAAACG
GCGTTCGATG CCCTCCGCAT CAATGGTGGT GCGTATTATC CTGCCTACAT CGACCGGCCC
CATACGGTCA ATGTGCAGGC CCGCTGGTCG ATGTCGCACA ACTGGTCGTT TTCGAGCAAT
TTTGTTTATT ATACCGGTGT ACCGGCCACC TATCCCGACG GGCAGTATAC GTATAACGGT
GAGCCGGTAC AGGACTATTC CCGCCGGAAT GCTGACCGGA TTCCCGATTA CCACCGGCTG
GACGTTGCCT TCTCCAAAGA TACCCGCTTC AATAAAGCGC AAAAGCGGTA CGGAATCTGG
ACGCTGGGTA TCTACAACCT GTACGCGCAC AAGAATCCGT ACTCCATTTA TTTTACCCGG
TTTAACCAGC GCACCGAATC GTACCGGCTG TCGGTATTTG GTACGATGAT CCCATCCATC
GCTTACAACT TCTTCTTTTA G
 
Protein sequence
MGLRLLLFLF FFFLGFCSSA VAQQRLRLSI VVRDGITRKP ISGASLFIAE TKLGGRADSA 
GVISITHPPG VLTAYVSAVG YFRGRETVVL DFNKRVEYYL QPRSTDLDEV DVRAIRKDKN
IKDVQMGQIQ LSMPELKRMP VVLGEPDILK ALSLQAGVTT AGEGAGGFNV RGGRADQNLV
LVDGAPLFNT SHLLGFYTNI NPDMVQDVSL SKGAFSAQYG GRVSSLLLMS TRNGNKDGWR
VSGGVSPVSV RAVVDGPITK KLTLLAGGRI AFPNYLLQLF PTTSVKNSRA FFYDGNLKLT
YTPDERNTIS LSAYRSQDNF RFPGDTLYGW QSNVLTGRWS YLIRPNMQLN LAALYSGYYL
NVDGVTPNLG FRFTSHIEQR EGKADVFYTI AKKHKVQVGV NGILYGIQTG AIQPAGNLSS
INPKQVNPER GRELGAYVNT EWELMPAVTL MAGLRYSAFA MLGPQTVYGY AENVPVSPET
VTDSVLYRSG QVAQAYGGFE PRLALRVQLA KHTAIKASAG RTRQYLNLIS NTTAITPLDF
WKLSDRYQRP QIADQVSLGI FQNFLDNGLE LSVEGYYKRL QNQIEYRYGA DLILNPKLET
ALVQAAGKAY GVELGLSKTS GRLTGQLNYT YARSLIAVQT AFDALRINGG AYYPAYIDRP
HTVNVQARWS MSHNWSFSSN FVYYTGVPAT YPDGQYTYNG EPVQDYSRRN ADRIPDYHRL
DVAFSKDTRF NKAQKRYGIW TLGIYNLYAH KNPYSIYFTR FNQRTESYRL SVFGTMIPSI
AYNFFF