Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_1702 |
Symbol | |
ID | 8725439 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 2037812 |
End bp | 2040862 |
Gene Length | 3051 bp |
Protein Length | 1016 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003386547 |
Protein GI | 284036617 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.784403 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGAA AATTACTCCT ATTCGTATGG CTGTTTTTTT GTGTTACCGG TTTGGTATTC GCTCAGGAAC AGTCTGTTAC GGGAAAGGTG ACGGATGCGG ATGGTAATCC GATTCCGGGG GCCAGTGTAG TGCTAAAGGG ACGTACAGTA GGTACAAATA CTGACTCAAA TGGGGCCTTT AAATTGAATG TGCCGGCTAA TGGTACACTT ATTTTCAGCT TTATTGGTTT CGCTACACAA GAGGTAGCGA TTGGTAACCG TTCTGTTGTA AACGTACAAT TAGTCGATGG TAATCAGCAA CTCGATGAGG TAATCGTAAC AGGACTTGCA ACAAGTGTAA AGCGTTCCAA TTCGGCCAAT GCCGTCGCGA GCTTATCGGC CAAACAACTT ACCGGTGCTA CCACACCGGT AACTACCGAT GGCGCTATGC AGGGTAAACT TGCCGGAGCT AACATTCAGG CCAACGGTTC TGTACCGGGC GGAGGCTTTA ACGTACAGCT ACGGGGTGTG TCCACGTTGG GTTCGTCGGC GTCGCAGCCA CTGTACATTG TGGATGGTGT TTATATTGAC AATGGCCAGT ATTCAAGTGG TCGTTCGGAG GCCAACAAAG CCGGGGCCGG TTCGGCGACC GCTTCGCAGG ATAACAACGG TAACCGCCTG GCGGACCTGA ACCCCGACGA CATTGAGAGT ATGGAAGTCC TGAAAGGCTC ATCGGCAGCG GCTATCTACG GAACCCGCGC CAACGCCGGG GTTATTATCA TCACTACCAA GCGTGGTAAA GGCGGACGGA CCAATGTATC GTTTGGACAG GATTTGGGCA TCTCCAAAGC GTATAGTTAT TATGGTGGAG CGGACTGGAC AGCCGATAAA CTGACCAATT ATTTTTCCGC AGCAGACATA TCAAAACTAC AGGCTGCCAA GCAGAATGGT ACCTATACGG ATTGGGAACG GGTAATTTAT GGTGAAACCG GTTCTATCAA AAATACCCAC CTGAGCGTAA CGGGTGGGAA TGAAAAAACC AAATTCTACG TAAACGGAAG CGCATCGAAT GAAACAGGTA TCATTAAAAA TACGGGTTTT ACGCGTTATT CGATCCGGGC TAACATCGAT CATAAATTGA ATAACTGGAT CGACTTTGGT ATCTCGACAA ACTACGTTAA TTCGAACAAC GACCGGGGCT GGACAGGTAA CGATAATTCG AACATCAACT ACGGGTACTC ATTGCCCTAC ACCAAACCGT ACACTAACCT GTACCCGGAT GCAACCGGTG TTTACCCCGA TAATGACCCG AGCGTAGGCG AAAATCCGCT GGCTATTCGC GACCGGGCCG TTAACAACCA GAAAGTGAAT CGCTTTATTC AGGGCTTCAA CGCCAACTTC CGACTCATCA ACAATGCAAC GACTTCGCTG ACCATTAAAG TAAATGGTGG TCTGGACTAT CAGAGCGGTT TTTCCCGAAT CTGGTTACCA ACCGATCTTC AGTCGCAGCG GCAGGAAGCC AATCCGGGCT TCGCGCAGGA TACCCGTACG GAAGTGTACA ACTCCAATAT TCAGGTGGCC GGTGTGTTTA CCCATGCCGC GATGGGTGGT AAGCTTAACC TGACATCGTC GGCGGGGGCT GTGCGTCTGC ATCGGGATTT CAACTATAAT TACGTTCGGG GGCAAAAATT GCCGGTGGGG GTTTCCAACC CGGCACGTGG TGGTGTACAG TCGATTGCTG CCGAATATCA GCTTAATACC GACGTTGGCA TTTTTGCCCA GCAGGAAGCT AACTACGACG ATAAAGTTAT TGGAACCGTC GGGATTCGTT TCGATAAATC GGATTTGAAC GGCAACAACT TCGGTAAATA CTACGCATTC CCGAAAGCAT CGCTGGCCGT TAACCTGACC CGTTTCGGTA ACTGGGCGAT TGCTTCGGGT GCCATAAGTG CCCTTAAGCC GCGTATTGCT TATGGTTCTA CGGCTGGTTT GCCAAGCTGG GGAACACCCT ACTCGCAGTT AGGCTCTACG GGTATTGGCG GGTTGAGCGG CTTACAGCCA TCAACGGTAT TGGGTAACAA CCAGATCAAG CCGGAACGCG CTACCGAACT GGAGTATGGT CTGGACTTTG GCCTGTTCAA CAACCGCATT ACCGGCGAAT TTACCTTGTA CAACAAGAAA GTGTTCGACC TGATTCAGCC GTTGACCACG GCCCCAACCA CGGGGGTTAC ATCCACCAAC ATCAACGCGG CCGATTTGGT AAACCGGGGC CTTGAGTTGA CCATTGGCGC TGAGGTTATC CGCAGTAAAG CTATTACCTG GTTCGTCCAG CCGATCTTCT GGTTCAACCG CTCCGAAATC ACCCGCCTGG ACATTCCCGA GCGCCTGACC GGTGGCTTTG GGGCTACGTT TGGTCAGTGG CGGGTAAAAC AGGGGTACTC ACCAACCCAG ATTGTGGGTC AGCCCCGTAC GCTGGCGGCA AGTGATCCGG GCTATGCCTC GTCGTGGACG AACTATGGTG ATCAGCAGCC TAAGTACGAA TTCTCGCTTA ATCAGCGCAT TACCTTCCTG AAGAACTTCG AGTTTTCGGC TCTGTTGCAT TACCGGCATA AGTTCACGGT TGTTTCGCTG CAACGCGTTC TGTGGGATGA AGGCGGGAAC ACCTCCGACT GGAACAGTAC AAGTCTGGGT CTGACGGATG GTGGTAAAGT AGCTGGTTCG GGCGATCAGG TGGCACCAAA TGGTATTGCC CGCCAGAATG TGAATGGACT GGACGCCAAT GGTGTTCCCC GCGAAGGATA CAACCCATCA ATTGCCAGCT TCCTGAAAAT GCGTGAAGTT TCGCTGTATT ACCGGGTGCC AAAAGCGGTG CTTGGTTCGG CTTTCCGCAA TGTTATTCAA GGGGTACGCG TTGGTGTTTC GGGAACGAAC TTACTCCGCT GGACGAATTA CAAAGCAGGT TACGATCCGG AGAACTCGAA CTTTGGCTCA CTGGCACTGG GTAGCGGTGT CGACATTGGT AGCGCGCCAT TGGCCCGTCG GATGATGTTC CACATTGCCA TTGACTTGTA G
|
Protein sequence | MNRKLLLFVW LFFCVTGLVF AQEQSVTGKV TDADGNPIPG ASVVLKGRTV GTNTDSNGAF KLNVPANGTL IFSFIGFATQ EVAIGNRSVV NVQLVDGNQQ LDEVIVTGLA TSVKRSNSAN AVASLSAKQL TGATTPVTTD GAMQGKLAGA NIQANGSVPG GGFNVQLRGV STLGSSASQP LYIVDGVYID NGQYSSGRSE ANKAGAGSAT ASQDNNGNRL ADLNPDDIES MEVLKGSSAA AIYGTRANAG VIIITTKRGK GGRTNVSFGQ DLGISKAYSY YGGADWTADK LTNYFSAADI SKLQAAKQNG TYTDWERVIY GETGSIKNTH LSVTGGNEKT KFYVNGSASN ETGIIKNTGF TRYSIRANID HKLNNWIDFG ISTNYVNSNN DRGWTGNDNS NINYGYSLPY TKPYTNLYPD ATGVYPDNDP SVGENPLAIR DRAVNNQKVN RFIQGFNANF RLINNATTSL TIKVNGGLDY QSGFSRIWLP TDLQSQRQEA NPGFAQDTRT EVYNSNIQVA GVFTHAAMGG KLNLTSSAGA VRLHRDFNYN YVRGQKLPVG VSNPARGGVQ SIAAEYQLNT DVGIFAQQEA NYDDKVIGTV GIRFDKSDLN GNNFGKYYAF PKASLAVNLT RFGNWAIASG AISALKPRIA YGSTAGLPSW GTPYSQLGST GIGGLSGLQP STVLGNNQIK PERATELEYG LDFGLFNNRI TGEFTLYNKK VFDLIQPLTT APTTGVTSTN INAADLVNRG LELTIGAEVI RSKAITWFVQ PIFWFNRSEI TRLDIPERLT GGFGATFGQW RVKQGYSPTQ IVGQPRTLAA SDPGYASSWT NYGDQQPKYE FSLNQRITFL KNFEFSALLH YRHKFTVVSL QRVLWDEGGN TSDWNSTSLG LTDGGKVAGS GDQVAPNGIA RQNVNGLDAN GVPREGYNPS IASFLKMREV SLYYRVPKAV LGSAFRNVIQ GVRVGVSGTN LLRWTNYKAG YDPENSNFGS LALGSGVDIG SAPLARRMMF HIAIDL
|
| |