Gene Slin_1702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1702 
Symbol 
ID8725439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2037812 
End bp2040862 
Gene Length3051 bp 
Protein Length1016 aa 
Translation table11 
GC content51% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003386547 
Protein GI284036617 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.784403 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGAA AATTACTCCT ATTCGTATGG CTGTTTTTTT GTGTTACCGG TTTGGTATTC 
GCTCAGGAAC AGTCTGTTAC GGGAAAGGTG ACGGATGCGG ATGGTAATCC GATTCCGGGG
GCCAGTGTAG TGCTAAAGGG ACGTACAGTA GGTACAAATA CTGACTCAAA TGGGGCCTTT
AAATTGAATG TGCCGGCTAA TGGTACACTT ATTTTCAGCT TTATTGGTTT CGCTACACAA
GAGGTAGCGA TTGGTAACCG TTCTGTTGTA AACGTACAAT TAGTCGATGG TAATCAGCAA
CTCGATGAGG TAATCGTAAC AGGACTTGCA ACAAGTGTAA AGCGTTCCAA TTCGGCCAAT
GCCGTCGCGA GCTTATCGGC CAAACAACTT ACCGGTGCTA CCACACCGGT AACTACCGAT
GGCGCTATGC AGGGTAAACT TGCCGGAGCT AACATTCAGG CCAACGGTTC TGTACCGGGC
GGAGGCTTTA ACGTACAGCT ACGGGGTGTG TCCACGTTGG GTTCGTCGGC GTCGCAGCCA
CTGTACATTG TGGATGGTGT TTATATTGAC AATGGCCAGT ATTCAAGTGG TCGTTCGGAG
GCCAACAAAG CCGGGGCCGG TTCGGCGACC GCTTCGCAGG ATAACAACGG TAACCGCCTG
GCGGACCTGA ACCCCGACGA CATTGAGAGT ATGGAAGTCC TGAAAGGCTC ATCGGCAGCG
GCTATCTACG GAACCCGCGC CAACGCCGGG GTTATTATCA TCACTACCAA GCGTGGTAAA
GGCGGACGGA CCAATGTATC GTTTGGACAG GATTTGGGCA TCTCCAAAGC GTATAGTTAT
TATGGTGGAG CGGACTGGAC AGCCGATAAA CTGACCAATT ATTTTTCCGC AGCAGACATA
TCAAAACTAC AGGCTGCCAA GCAGAATGGT ACCTATACGG ATTGGGAACG GGTAATTTAT
GGTGAAACCG GTTCTATCAA AAATACCCAC CTGAGCGTAA CGGGTGGGAA TGAAAAAACC
AAATTCTACG TAAACGGAAG CGCATCGAAT GAAACAGGTA TCATTAAAAA TACGGGTTTT
ACGCGTTATT CGATCCGGGC TAACATCGAT CATAAATTGA ATAACTGGAT CGACTTTGGT
ATCTCGACAA ACTACGTTAA TTCGAACAAC GACCGGGGCT GGACAGGTAA CGATAATTCG
AACATCAACT ACGGGTACTC ATTGCCCTAC ACCAAACCGT ACACTAACCT GTACCCGGAT
GCAACCGGTG TTTACCCCGA TAATGACCCG AGCGTAGGCG AAAATCCGCT GGCTATTCGC
GACCGGGCCG TTAACAACCA GAAAGTGAAT CGCTTTATTC AGGGCTTCAA CGCCAACTTC
CGACTCATCA ACAATGCAAC GACTTCGCTG ACCATTAAAG TAAATGGTGG TCTGGACTAT
CAGAGCGGTT TTTCCCGAAT CTGGTTACCA ACCGATCTTC AGTCGCAGCG GCAGGAAGCC
AATCCGGGCT TCGCGCAGGA TACCCGTACG GAAGTGTACA ACTCCAATAT TCAGGTGGCC
GGTGTGTTTA CCCATGCCGC GATGGGTGGT AAGCTTAACC TGACATCGTC GGCGGGGGCT
GTGCGTCTGC ATCGGGATTT CAACTATAAT TACGTTCGGG GGCAAAAATT GCCGGTGGGG
GTTTCCAACC CGGCACGTGG TGGTGTACAG TCGATTGCTG CCGAATATCA GCTTAATACC
GACGTTGGCA TTTTTGCCCA GCAGGAAGCT AACTACGACG ATAAAGTTAT TGGAACCGTC
GGGATTCGTT TCGATAAATC GGATTTGAAC GGCAACAACT TCGGTAAATA CTACGCATTC
CCGAAAGCAT CGCTGGCCGT TAACCTGACC CGTTTCGGTA ACTGGGCGAT TGCTTCGGGT
GCCATAAGTG CCCTTAAGCC GCGTATTGCT TATGGTTCTA CGGCTGGTTT GCCAAGCTGG
GGAACACCCT ACTCGCAGTT AGGCTCTACG GGTATTGGCG GGTTGAGCGG CTTACAGCCA
TCAACGGTAT TGGGTAACAA CCAGATCAAG CCGGAACGCG CTACCGAACT GGAGTATGGT
CTGGACTTTG GCCTGTTCAA CAACCGCATT ACCGGCGAAT TTACCTTGTA CAACAAGAAA
GTGTTCGACC TGATTCAGCC GTTGACCACG GCCCCAACCA CGGGGGTTAC ATCCACCAAC
ATCAACGCGG CCGATTTGGT AAACCGGGGC CTTGAGTTGA CCATTGGCGC TGAGGTTATC
CGCAGTAAAG CTATTACCTG GTTCGTCCAG CCGATCTTCT GGTTCAACCG CTCCGAAATC
ACCCGCCTGG ACATTCCCGA GCGCCTGACC GGTGGCTTTG GGGCTACGTT TGGTCAGTGG
CGGGTAAAAC AGGGGTACTC ACCAACCCAG ATTGTGGGTC AGCCCCGTAC GCTGGCGGCA
AGTGATCCGG GCTATGCCTC GTCGTGGACG AACTATGGTG ATCAGCAGCC TAAGTACGAA
TTCTCGCTTA ATCAGCGCAT TACCTTCCTG AAGAACTTCG AGTTTTCGGC TCTGTTGCAT
TACCGGCATA AGTTCACGGT TGTTTCGCTG CAACGCGTTC TGTGGGATGA AGGCGGGAAC
ACCTCCGACT GGAACAGTAC AAGTCTGGGT CTGACGGATG GTGGTAAAGT AGCTGGTTCG
GGCGATCAGG TGGCACCAAA TGGTATTGCC CGCCAGAATG TGAATGGACT GGACGCCAAT
GGTGTTCCCC GCGAAGGATA CAACCCATCA ATTGCCAGCT TCCTGAAAAT GCGTGAAGTT
TCGCTGTATT ACCGGGTGCC AAAAGCGGTG CTTGGTTCGG CTTTCCGCAA TGTTATTCAA
GGGGTACGCG TTGGTGTTTC GGGAACGAAC TTACTCCGCT GGACGAATTA CAAAGCAGGT
TACGATCCGG AGAACTCGAA CTTTGGCTCA CTGGCACTGG GTAGCGGTGT CGACATTGGT
AGCGCGCCAT TGGCCCGTCG GATGATGTTC CACATTGCCA TTGACTTGTA G
 
Protein sequence
MNRKLLLFVW LFFCVTGLVF AQEQSVTGKV TDADGNPIPG ASVVLKGRTV GTNTDSNGAF 
KLNVPANGTL IFSFIGFATQ EVAIGNRSVV NVQLVDGNQQ LDEVIVTGLA TSVKRSNSAN
AVASLSAKQL TGATTPVTTD GAMQGKLAGA NIQANGSVPG GGFNVQLRGV STLGSSASQP
LYIVDGVYID NGQYSSGRSE ANKAGAGSAT ASQDNNGNRL ADLNPDDIES MEVLKGSSAA
AIYGTRANAG VIIITTKRGK GGRTNVSFGQ DLGISKAYSY YGGADWTADK LTNYFSAADI
SKLQAAKQNG TYTDWERVIY GETGSIKNTH LSVTGGNEKT KFYVNGSASN ETGIIKNTGF
TRYSIRANID HKLNNWIDFG ISTNYVNSNN DRGWTGNDNS NINYGYSLPY TKPYTNLYPD
ATGVYPDNDP SVGENPLAIR DRAVNNQKVN RFIQGFNANF RLINNATTSL TIKVNGGLDY
QSGFSRIWLP TDLQSQRQEA NPGFAQDTRT EVYNSNIQVA GVFTHAAMGG KLNLTSSAGA
VRLHRDFNYN YVRGQKLPVG VSNPARGGVQ SIAAEYQLNT DVGIFAQQEA NYDDKVIGTV
GIRFDKSDLN GNNFGKYYAF PKASLAVNLT RFGNWAIASG AISALKPRIA YGSTAGLPSW
GTPYSQLGST GIGGLSGLQP STVLGNNQIK PERATELEYG LDFGLFNNRI TGEFTLYNKK
VFDLIQPLTT APTTGVTSTN INAADLVNRG LELTIGAEVI RSKAITWFVQ PIFWFNRSEI
TRLDIPERLT GGFGATFGQW RVKQGYSPTQ IVGQPRTLAA SDPGYASSWT NYGDQQPKYE
FSLNQRITFL KNFEFSALLH YRHKFTVVSL QRVLWDEGGN TSDWNSTSLG LTDGGKVAGS
GDQVAPNGIA RQNVNGLDAN GVPREGYNPS IASFLKMREV SLYYRVPKAV LGSAFRNVIQ
GVRVGVSGTN LLRWTNYKAG YDPENSNFGS LALGSGVDIG SAPLARRMMF HIAIDL