Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_2412 |
Symbol | |
ID | 8726156 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 2909307 |
End bp | 2912396 |
Gene Length | 3090 bp |
Protein Length | 1029 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | TonB-dependent receptor |
Protein accession | YP_003387231 |
Protein GI | 284037301 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.525644 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCATTTT CTACCACTAG TTCAGCCCAG TCAAAAATCG GGTGGCTCGT AGGCCTGCTG CTCGTACTGG AGCTGTCTGC GCTGGCCCAA TCGAACCGGC CCGTTACGGT CAGCGGTCTG GTTACCTCAG CCGAAAGTAG CCAGGGAATA CCCGGCGCCA ACGTTATGGT TAAAAACACC CAGCAAGGCA CGACGACCAA CGCCAACGGG GAGTTTACGC TGGCAGCCCC GGCGGGTTCA CTTGTGCTGA TCGTATCGTC GATTGGCTTC CAGACACAGG AAGTACCGGT TTCGGGCCAG AGCAAGCTAA CCATCGCCCT GCAAGCCGAT AATCGTTCAT TGAACGAGGT CATCGTTGTC GGCTACGGTA CGGTAAAGAA GAGCGACTTA ACCGGCTCGG TTTCATCAGT GAGAGCCGCT GAACTAAAAC AAACTCCGAT TGCCAACTTC GTTCAGGGTC TTCAGGCCCG GGCTTCGGGG GTGCAGGTTA CACAGAACTC AGGCGCACCG GGGGGCAGCA TCAGCGTTCG CATTCGGGGC AATAACTCGA TTAGCGGCAG CAGCGAACCG CTCTATGTAG TCGACGGATT CCCCATTGCC GGGGGCGACA ACCCGGTGGC GGGGGGCGGC AGCGGACTCG GAAATGACAA TGGCAACCGG CTCTCGGTGC TTTCTACGTT GAACCCCAAC GATATTGAGT CGATGGAGGT GCTGAAAGAT GCCTCGGCCA CCGCTATTTA CGGCACGCGT GGGGCCAATG GCGTAGTGTT GATTACGACC AAACGGGGTA AATCCGGCAA AACCCGCGTG AGCTATGACG GCTATTACGG GCAGCAGCAA ATCCGAAAAA CGCTGGATGT AATGAACGCC ACGCAGTTTG CCAAGTACGA AAACGAAATT ACCGGTACGC AACTTTATCC CAATCCCGAT CAATTAGGGC AGGGTACCGA CTGGCAGTCG CTTATTTTCC GCAAAGCCCC TATGCAGAGC CACCAGCTAT CCGTATCGGG CGGCAACGAA CGCTCGCAGT TCGCGCTGTC GATGAATTAT TTCGATCAGG ACGGGATCAT CATTAACTCC AACTTCAAGC GGGGATCGGT GCGGGTCAAT CTGGATAATA CGATCAGTAA AAACCTTAAA ATAGGCACGA GCCTGACCTA TACCTACTCC GTCAACAATG GAGCCATTAC CGCCACGCTG GGCGATGGTG GTCCGGCGGG GGGAATTATT TTGTCGGCAC TTACGGCTCC GCCCGTCTTT TCCCCCTACA ATGCCGACGG TTCACCGACT ATTTTCACAA ACCGCTACCT GGACCTCAAC AACCCCGTTG CGCTGGCTAC GGAGGTGATG AACCGGAACA CAACCCGCCG TTTTCTGGGT AACATCTTTG CCGACTGGAC CATTACCAAT GGCTTAACTT ACCGGGCTTC GTTTGGGGGC GATCTCGTTA CGGACACCCG CGACTCATAC GTTACCCGCA ACATCCGGGC GGGTTCGCAG GTGAACGGCA TTGGCGGAAA AGGGAATGCC AACACCAATA CGGTACTGCA CGAGAGTCTG CTGAATTACC ATCGGCTGTT TGGTGTGCAT GATGTAAACG TGACCGGCGT GTTTTCGACC CAGGGGCAGC TACAAACTGC CGATGCCATG ACCGGTCAGC AATTTCCCAA CGACCTTGTG CTGAACAATA ACCTCTCGCA GGCATCCATT TTAACCATTG CCAGCAACAA ACAGGCGTGG CGTTTAGACT CGTACACGGG GCGGATCAAC TACAATTACA AAAGCAAATA CCTGCTCACG CTGACGGGCC GGGTAGATGG CTCAAGCCGG TTTGGCGATA ATAACAAGTA CGGATTTTTC CCCTCGGTAG CGGGGGCATG GCGAGTATCG GAAGAAGGAT TCATGCAGGG GCAGCAGGTG TTGAGCGACC TGAAACTCCG GGCCAGTTAT GGCATAACGG GCAACGCCGA TATTCCACTC TACAACTCTT TATCGCGACT CAACTCGGTT GGAAATTACA ACTTCAATAA CGTGCGTACC ATCGGTATTG CCGCAGCCAA CATCAGCAAT CCCGACCTGA AATGGGAGAA AAGTGCACAG GCCGATATTG GCCTGGATTT CGGCTTGCTC AACAACCGGA TTCAGGTAAC AGCCGATGTG TATTACAAAA AAACGACCGA TCTGCTGCTG TCGCGTACCA TTCCACTCTC GTCGGGTTTC GGGTCGGTAT TCGGCAACTT TGGTTCGGTC GAGAACCGTG GCATCGAAGT AACCGTCAAT GCGGGCGTAC TGAATGGTCC GCTGAAGTGG GATATTAACG GCAACATCTC GGCCAATCGC AACAAGCTCA CGCTCATCGA CGGGACCCGT ACCGAAATCA TTCCCGGTGG TGGCGATGCC TCTATTGGTG CTTTTACCAA CAACAGCATC CTGCGGGTAG GTGCGCCCAT CGGCTCGTTC TATGGCTATG TGTTCGATGG CATTTACCAA ACCGGCGACA ACATCCCGAC GGGGCGTATT CCGGGCAACA TCCGGTATCG TGATCTGAAT GGCGACGGGG TCATTTCGGG AGCGGATCAG ACCATTATCG GTAACCCGAA CCCGAGCTAT ATTTTCGGAA TCAACAACAC CCTGAAGTAC AAAGGCTTCG ACTTAAGCTT ATTCGTGCAG GGTGTACAGG GCAACCAGAT CTTTGCCGTT TCGAGAGTCC GGCTGGAAGC CGGGGCGGGT GCCATCAATC AGTATGCCAC CTACGTAAAC CGCTGGACAT CCACCAATCC ATCGAACCAG TACCTGAAAG CCTCGACGGG GCAGCGGGTA AACCAGTCGG ACATTCACAT CGAAGATGGT TCGTTTGTCC GCTTCAAGAA CATAACCCTC GGTTACACGA TTCCTGCCGC TGGCAAACTG GCCTGGCTGG CGAATTCGCG GGTGTATGTG AGTGCCAACA ATTTTGCTAC CCTGACCAAC TATTCGGGCT ATGATCCGGA GGTGAACACC GCAGGGCAGA ACAACCTCAA CCTGGGTGTA GACAACATCG GATTCCCCGT ATCCAAGTCG TTCATTGCCG GTCTTCAACT CAACTTCTAA
|
Protein sequence | MPFSTTSSAQ SKIGWLVGLL LVLELSALAQ SNRPVTVSGL VTSAESSQGI PGANVMVKNT QQGTTTNANG EFTLAAPAGS LVLIVSSIGF QTQEVPVSGQ SKLTIALQAD NRSLNEVIVV GYGTVKKSDL TGSVSSVRAA ELKQTPIANF VQGLQARASG VQVTQNSGAP GGSISVRIRG NNSISGSSEP LYVVDGFPIA GGDNPVAGGG SGLGNDNGNR LSVLSTLNPN DIESMEVLKD ASATAIYGTR GANGVVLITT KRGKSGKTRV SYDGYYGQQQ IRKTLDVMNA TQFAKYENEI TGTQLYPNPD QLGQGTDWQS LIFRKAPMQS HQLSVSGGNE RSQFALSMNY FDQDGIIINS NFKRGSVRVN LDNTISKNLK IGTSLTYTYS VNNGAITATL GDGGPAGGII LSALTAPPVF SPYNADGSPT IFTNRYLDLN NPVALATEVM NRNTTRRFLG NIFADWTITN GLTYRASFGG DLVTDTRDSY VTRNIRAGSQ VNGIGGKGNA NTNTVLHESL LNYHRLFGVH DVNVTGVFST QGQLQTADAM TGQQFPNDLV LNNNLSQASI LTIASNKQAW RLDSYTGRIN YNYKSKYLLT LTGRVDGSSR FGDNNKYGFF PSVAGAWRVS EEGFMQGQQV LSDLKLRASY GITGNADIPL YNSLSRLNSV GNYNFNNVRT IGIAAANISN PDLKWEKSAQ ADIGLDFGLL NNRIQVTADV YYKKTTDLLL SRTIPLSSGF GSVFGNFGSV ENRGIEVTVN AGVLNGPLKW DINGNISANR NKLTLIDGTR TEIIPGGGDA SIGAFTNNSI LRVGAPIGSF YGYVFDGIYQ TGDNIPTGRI PGNIRYRDLN GDGVISGADQ TIIGNPNPSY IFGINNTLKY KGFDLSLFVQ GVQGNQIFAV SRVRLEAGAG AINQYATYVN RWTSTNPSNQ YLKASTGQRV NQSDIHIEDG SFVRFKNITL GYTIPAAGKL AWLANSRVYV SANNFATLTN YSGYDPEVNT AGQNNLNLGV DNIGFPVSKS FIAGLQLNF
|
| |