Gene Slin_0955 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0955 
Symbol 
ID8724685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1154932 
End bp1158171 
Gene Length3240 bp 
Protein Length1079 aa 
Translation table11 
GC content50% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003385805 
Protein GI284035875 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.14605 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.53847 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTGTC TGTTTTCAAG CAGCTACACT CAGGCTAAAC CCTTTAATCA TAACACTATG 
AAAAAGCTCT ATCAGTTAGT TGCCTTTAGT GTATTGGCTG GCTATTGTAC AAGCCATGCC
TTTGGACAGG GTGTCAGTAA AGGCAGCCCA CCGCCCCGTA TAATGGAGCT GATGGCGTTG
GCCAGTTCTG CGCAGCACTC AACGAGTAGC GATCCGAAAG TCGCTGATGT TGTTGTGACA
GGGAGAGTGA TTGATGAAAA GGGAAGTGGA TTGCCGGGCG TTAGTGTCGT TCTGAAAGGA
TCAACACAGG GCACAACGAC CGATGGTTCC GGGGGGTATC GGATTTCGGC CCCCAACTCG
GCGGCAACGC TTGTTTTTAG TTTTGTAGGC TATCAAAAAA AGGAAGTACT GATCGGGAGC
CAAACGTCGA TAACCGTTTC CCTGATTCCG GACGATCAGA CGCTCAGCGA AGTAGTTGTG
GTGGGCTACG GTTCGCAACG TCGGCAGGAT TTAACGTCGG CCGTTTCGGT CATTAATATG
CGGGATATTG GCGAACAGCC CGCCAACAAC CCTAACCAGA TTCTTCAGGG ACGCGCACCG
GGCGTTGTTG TCAAACAAAA AAGTGGTACA CCCGGTGGTG TGTTTGAGGT TCGGGTTCGG
GGTATTGGCT CGCTGGGTGC CGGTAGTAAC CCGCTGTATG TTATCGATGG ATTTGCGGTT
GGTACAACAG TTGGACAAAA CCTCAACCCA AATGATATTG AGAGTGTCAC GGTGCTGAAA
GACGCAGCGT CTACGGCTAT TTATGGTGCC AGAGGTTCTA ATGGGGTAGT GCTCATTACG
ACCAAACAGG CTAAAGAAGG GAAAGTTAAT GTCAACCTGT CCCTCGATTA CGGGATTCAG
GCAGTGCCCC GGTCCAGACG GGTAACCATG CTGACCGGTC CTGAATTTGC CCAGTTCAAG
AAGGACATTT TTATGGATCA GATTCGCATT CTTCAGAATC GGGAACCCGC CGAAAGTGAA
GTGCCTATTG GTTACCGGTT TCCGGAGCAG ACCAAGTACT CGACCAACTG GTTTGACCTC
ATCATGCACG ATAATGCGCC GTATTCGGAC CTTAACATGA CTGTCTCGTC TGGCTCGGGT
CCGTTAAAAT CGTTGATTTC GGTGGGGTAT TACAAGGAAG ACGGCATTAT CAAAAATACA
AATTATGATC GTATCTCCGT TCGCTCCAAT CTGGGTGGGC AAGTCAATAA ATTCATTAAC
GTTGGACTGA ATATCAATGG TACCTACACC CGCCAGAACC TGGCCAACAC CGATGGTCGA
AGCGCCCTGG TAGGTGGAGC CTTATTGATG GACCCTCGGG CGACGCCTTA CAATCCGGAT
GGGTCGCTAA TTCCCTACAT CAACGGAGTC GATGGAGTTT TTGGCTTCCC GAACCCGCTT
TTTGTGCTGC AAAACGTATA CCGCAAACGG AACATCGCCG ATCTGTTGAC CAATGGGTTT
GTCGAGCTGT CCTTCCTGCG GAATTTCAAG TTCAGGACCT CCGCAAACGT AAAACTGACC
AACAACACTT ACAAAGAGTA TGTGCCATCG ACTATCGGCT TGTCGGTAGC GTCCGGTACG
GCCGGTGCGC CCCCACGAAT TGCTACCCAA ACGGATAATA CCGAAGAGCT GACGAATTAC
TCACTGGATC AGTTGCTTAC CTACAAGCCG CAATTGTCGG CAAATCATAG TCTGGACGTT
CTGGTAGGCT ATACCGCTCA GCAGGAGAAA GTACGCGGCT TTACGGGTAC TGGTAATACG
TTTCCCGACG ACCTGGTTCC GTTTCTAGGA GCGGCATCTA TCCGGTCGGC TTCTTCCACT
GAATTTGGCT GGAGTATGCT GGCCTATTTG AGTCGCGTCA ATTACTCGTA CAAAGACAAA
TACTTGCTCT CGGCGTCTTT CCGGCGGGAG GGAAGTTCGA GATTCGGGGC GAAGAATAAG
TATGGTGATT TCCCGGCTGC TTCTATCGGC TGGCGCATCA CCGAAGAATC GTTCATGCCC
AAAACAGCCT GGCTCAGCGA CCTGAAACTG CGGGCGAGCT GGGGCGTAAC GGGTAACAAC
GATATTGGTA ACTACCCAAG TCTGGCATTC GTTGGCGCCA ACAACTACAT TCTGGGAAAC
TCATTTGCAG CGGGTAAAGT CGTCAGCTCA TTTGCTAACT CGGAGTTGAA ATGGGAAAAA
TCGAATCAGC TGGATATCGG TATGGATCTG GCCCTGTTCA ACAGCAAGCT GATTTTTAAC
GCCGAATATT ACCGGAAGAT TACCAACGAC ATGCTGCTGC CGGTATCCAT TCCGTCGGTA
TCGGGCTTTA CAACCAGTCT GGATAACATC GGTAAAGTAG AGAACCACGG GTTCGAACTG
GGCGCTGAAT TCCGGACCAA CATTGGTCAG GTCAACTTTC GCACCAACGC CAACATCAGC
TTTAACCGGA ACAAGATTCT GGCTATTAAA GGCGCCAATG ACGCACTGTA TTATGGCAGC
TTCTATGGAG GCTATAACGT TCAGAAGGTA GGACGCCCAA TTGGTATGAT TTACGGCTAC
AAAAAGCTGG GTATTTTTAA CACGCAGGCC GAAATCGATG CAGCACCCAA GCAGGACGGG
GTTATTCCGG GCGGTATGAA GTTCGCGGAT ACCAACGGTG ATGGCGTTAT TTCGTACGAT
ACGCAGGACA TGGTTGAGAT TGGCAACCCG AATCCGGCCT TTACCTGGGC CTGGACATTT
GCCGCAGACT ATAAGAAGTT CGATATCAAC CTGATGTTCC TCGGTGCGCA GGACTTCGAT
ATTTACCGGA ACATTGAGGC TTCGACTATG AATATGGATG GTGTATTCAA CGTGCTCGAT
AAGGCCAAAG ATCGCTGGCG GTCGGCTAGT AATCCGGGGA CAAACCCATC TGACGTACAC
GCGCAGGGCG GTACCAGCTA CTTCAAATGG TCTCGCGAAA GTAGCGACCG GTATGTGTAT
GATGGCAGTT ATGTATGGCT GAAAACCGTT ACCATTGGCT ACAACTTCCC CAAGTTCAAA
TCCATCCTGA GCAATGCCCG GGTTTTCGTA ACGGGTAACA ACTTATTGAT CTTCACAAAG
TATCCCGGCA ATAACCCGGA TGCAGGTGTC CGGAACTCGA ACTCGGTCGA ATTAAACAAT
GACGACGAGT CTTATCCGGT TCCCAGAACC TATGCTGCCG GTATCAAACT GAACTTCTAA
 
Protein sequence
MSCLFSSSYT QAKPFNHNTM KKLYQLVAFS VLAGYCTSHA FGQGVSKGSP PPRIMELMAL 
ASSAQHSTSS DPKVADVVVT GRVIDEKGSG LPGVSVVLKG STQGTTTDGS GGYRISAPNS
AATLVFSFVG YQKKEVLIGS QTSITVSLIP DDQTLSEVVV VGYGSQRRQD LTSAVSVINM
RDIGEQPANN PNQILQGRAP GVVVKQKSGT PGGVFEVRVR GIGSLGAGSN PLYVIDGFAV
GTTVGQNLNP NDIESVTVLK DAASTAIYGA RGSNGVVLIT TKQAKEGKVN VNLSLDYGIQ
AVPRSRRVTM LTGPEFAQFK KDIFMDQIRI LQNREPAESE VPIGYRFPEQ TKYSTNWFDL
IMHDNAPYSD LNMTVSSGSG PLKSLISVGY YKEDGIIKNT NYDRISVRSN LGGQVNKFIN
VGLNINGTYT RQNLANTDGR SALVGGALLM DPRATPYNPD GSLIPYINGV DGVFGFPNPL
FVLQNVYRKR NIADLLTNGF VELSFLRNFK FRTSANVKLT NNTYKEYVPS TIGLSVASGT
AGAPPRIATQ TDNTEELTNY SLDQLLTYKP QLSANHSLDV LVGYTAQQEK VRGFTGTGNT
FPDDLVPFLG AASIRSASST EFGWSMLAYL SRVNYSYKDK YLLSASFRRE GSSRFGAKNK
YGDFPAASIG WRITEESFMP KTAWLSDLKL RASWGVTGNN DIGNYPSLAF VGANNYILGN
SFAAGKVVSS FANSELKWEK SNQLDIGMDL ALFNSKLIFN AEYYRKITND MLLPVSIPSV
SGFTTSLDNI GKVENHGFEL GAEFRTNIGQ VNFRTNANIS FNRNKILAIK GANDALYYGS
FYGGYNVQKV GRPIGMIYGY KKLGIFNTQA EIDAAPKQDG VIPGGMKFAD TNGDGVISYD
TQDMVEIGNP NPAFTWAWTF AADYKKFDIN LMFLGAQDFD IYRNIEASTM NMDGVFNVLD
KAKDRWRSAS NPGTNPSDVH AQGGTSYFKW SRESSDRYVY DGSYVWLKTV TIGYNFPKFK
SILSNARVFV TGNNLLIFTK YPGNNPDAGV RNSNSVELNN DDESYPVPRT YAAGIKLNF