Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_0277 |
Symbol | |
ID | 8724005 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 363888 |
End bp | 367178 |
Gene Length | 3291 bp |
Protein Length | 1096 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003385140 |
Protein GI | 284035210 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.97163 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.971813 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCAT CGTTCTACAG ATTTCTGCAG ACCGCTTTTC TGGGTACAGT ATTATTACTG TGGAGCTTGA ACGCTTCCGC CCAGGACCGT CGCCTAACGG GTAAAATCAC GGGCGTTGAT GGACCGGTAC CCGGGGCCAA CGTTGTACTT AAAGGTACCC AAACGGGTAC TGCCACTGAT GCTGACGGCA ACTATGCGCT GAACATTCGG GGTGCTAGCC CGGTGCTGGT GATATCAGCC ATTGGCTTCA AAACTCTGGA AGTAGCTGTA GGTAACCGCA CGTCGGTTGA CGTAAAGCTC GAAGACGACG CCACCGCCCT GAGCGAGGTT GTTGTAACAG GTTATTCGAC AGAAAACCGT CGGGATGTAA CCGGCGCAGT ATCTACCGTA AAGCCCGCCC AACTGAAAGT TGTGCCATCA ACAAACGTTG AACAGCAATT GCAGGGCCGG GTAGCTGGTG TAACCGTTAT TACAAACGGC CAGCCTGGTA CGACTAGCCA GGTTCGGGTA CGGGGTTTTG GTTCGTTTGG TGGTAACCAG CCTTTGTACG TTGTTGATGG TGTACCAACC CAGAGCATCC AGTACATTGC TCCGGATGAT ATCGAGTCAA CTACGGTTCT TAAAGATGCG GCTTCGGCAT CCGTTTATGG AGCACGTGCT GCATCGGGCG TTATTGTATT GACAACCAAG AAAGGCCAGC GCCGGGCTCA GAAACTGAGC ATTAGCTACG ATGGCTTGTA CGGTGTTACC GACCCAGGTC ATGGTCAGAA AATTCTGACG CCACAGGAAC AGGCTGACTG GACCTGGCAG GCCCGCAAAA ACGATATTTT TCAGGCTGGC GGAACAGTTG GTCCAACTAG CTTTACGGGT ATTGCTAATG GTCAGTACGG ATCGGGACAA ACACCTGTTC TGCCAGATTA TTTGCTGGTA GGCAACCAGA CAGGCGTATC AGCATCTGCA GTAAACCTGG AAGCAGAGCG TGCAAAGTAT AACATTAACC CTGCCAATGG AGCCATTTAC AATGTTATTC CCGCCAACAA GGCAGGAACA GACTGGTATG GTGCCATCAC TCGCGTAGCT CCCCTGATGC GTCACACACT GGGCTTCTCC GGTGGCACAG AGTCGAGCCG GTTTTACCTA AGTCTGGGTA TGCAGAAACA AGCTGGTATT ATCACCTACA ACGACTTCTC GCGTTATACG TTACGCGTGA ACACGGAATT TGATATTACA AAGAAACTGC GGTTTGGCGA GAACGTGCAG TTGGCCTATG TTTCGGCAAC GGGTCTGCAG GGTAGCACAG GAAGCACACT TGGTAATGGC ACCAACAATA ACTCCAGCGT TGCGGCCGAT GAAAACGATA TCCTCCTGGC GTTTCGTCAG GCACCGATCA TCCCTGTATA CAACGCATTC GGTGGTTATG CAGGTACAGC AGCATCGGGC TTTAACAACG CCCGGAACCC GGTGGCTAAC CGTATCGGAG CGAAGGACAA CATCAACTAC AACCTGATTG CGTTTGGTAA TGCTTACCTG GAGTATGATG TCATTCCGGC GTTAACACTG CGCAGTAGCC TGGGCGGTAC CTATTTCAGC AACTACAACA ATGCGTATAA CCGGTCTCAA TACGAAAACT CGGAGAACAA TACGAACTAT GTATATAACG AATCGTCGAA CGTTGGGTTA GCCTGGACGT TCACCAATAC GGCTCAGTAC AAACAGAAGT TTGGCATTCA TGATGTTAGC GTTCTGGCAG GTATCGAAGC GTTGAACACA GGAAGCGGTC GGGGTATCAG TGGCTCCGGA CTGAACCCAT TCACGACAGA TCCAAACTAC GTAACCATCG GCACGACAAC GCCGGGCGCT ACGCGTAGTG TAAATAGCTA CTACGGCAAG GGCAATAACT TCTATTCTTT GTTTGCTCAG GCGCGTTATA CGTTCAATGA CAAGTATATC CTTACGGGTG TAGTTCGTCG GGATGGTTCG TCGCAATTTG GTTCTCAGAA CCGCTACGGG GTATTCCCCG CTGTTTCGGC GGCCTGGCGT CTGTCATCCG AAGATTTTAT GAAGAACCTG CCGTGGGTAT CTGATTTGAA AGTACGTGGT GGCTATGGCT TGATGGGTAA CTCGAACTAC CTGAGTTCGA CAAACCAGTA CAATCTGTTT GGTTCCAATG CAGGTAACAG CTACGACATT ACAGGGGCCA ACACATCGGT ACAGGCCGGT TACTACCGTA GCCAGATTGG TAATGCTGCC GCCAAGTGGG AAACCAGTAT CACGTCAAAC GTTGGTATAG ACGGTGCTTT CTTTAACAAC AAGTTAGAAG TAGTTCTTGA TTTCTGGCAG AAAGACACAA AGGACCTGCT CTATCCGCTG GCTCTCCCGG GCGTAGTTGG TGTTCGTGCC AATGCTCCTT ACGTGAACGT AGCCAGCATG CGCAACAAAG GTATTGACCT CCTCATTACA ACGCGGGGTA ACGTAGTTGG CGATCTGGGC TATGAGGTTA CAGCCGTTGG AAGTATCCTG GATAACAAGA TCACAGCCAT TGCGCCATCG GTACCTTACT TCACCGCCAA TGGTCAGCGC TTAAGCTCTC CTGTTGTTCG TAACCAGCCT GGTCATGATC TGTCGTCTTT CTATGGCTAC AACGTAATTG GTCTCTTCAA CAGCAAAGCA GAAGTTGATG GTGCTCCTAC GCAACCGGGT GCGGCTCCGG GTCGTTTCCG TTACCAGGAT ATTAACGGTG ATGGTAAAAT CGACGACGCC GACCGTACAT TCCTGGGAAG CCCGATTCCT AAGTTTACGG GTAGCCTGAC GCTGACGCTG AAGTACAAAG GATTCGACCT GAACACACAG GTGTATGCGT CGCTTGGTAA CAAGATCTTT AACAACTCGA AATGGTACAC CGACTTCTAT CCTTCGTTCC CGGGGGCGGC TGTTAGCCAG CGTGTGAAAG ATTCGTGGTT GCCAACGCAT ACCGACACGA AAGTGCCCAT TTTTGAGAAC ACCTCAAACT TCAGCACAAA TACGGAGTCG AACTCCTACT ACGTAGAAAA CGGCTCGTAT GGCCGGATGC AGTACCTGAC GCTGGGCTAT ACCTTCCCAG CCTCCGTGCT GAACCGGGCT AACCTGAGCC GGTTAAGACT GTCGCTGTCT GCTACGAACC TGTTCACAAT TACGAAGTAT TCTGGTTTGG ATCCAGCCGT TGGCGGCTCG GCTGACCAAA ACTTCGGTAT CGACATCGGT AACTATCCCG TTACCCGTGG CTATAACGTA GGCTTGAGCT TCGGCTTCTA A
|
Protein sequence | MKASFYRFLQ TAFLGTVLLL WSLNASAQDR RLTGKITGVD GPVPGANVVL KGTQTGTATD ADGNYALNIR GASPVLVISA IGFKTLEVAV GNRTSVDVKL EDDATALSEV VVTGYSTENR RDVTGAVSTV KPAQLKVVPS TNVEQQLQGR VAGVTVITNG QPGTTSQVRV RGFGSFGGNQ PLYVVDGVPT QSIQYIAPDD IESTTVLKDA ASASVYGARA ASGVIVLTTK KGQRRAQKLS ISYDGLYGVT DPGHGQKILT PQEQADWTWQ ARKNDIFQAG GTVGPTSFTG IANGQYGSGQ TPVLPDYLLV GNQTGVSASA VNLEAERAKY NINPANGAIY NVIPANKAGT DWYGAITRVA PLMRHTLGFS GGTESSRFYL SLGMQKQAGI ITYNDFSRYT LRVNTEFDIT KKLRFGENVQ LAYVSATGLQ GSTGSTLGNG TNNNSSVAAD ENDILLAFRQ APIIPVYNAF GGYAGTAASG FNNARNPVAN RIGAKDNINY NLIAFGNAYL EYDVIPALTL RSSLGGTYFS NYNNAYNRSQ YENSENNTNY VYNESSNVGL AWTFTNTAQY KQKFGIHDVS VLAGIEALNT GSGRGISGSG LNPFTTDPNY VTIGTTTPGA TRSVNSYYGK GNNFYSLFAQ ARYTFNDKYI LTGVVRRDGS SQFGSQNRYG VFPAVSAAWR LSSEDFMKNL PWVSDLKVRG GYGLMGNSNY LSSTNQYNLF GSNAGNSYDI TGANTSVQAG YYRSQIGNAA AKWETSITSN VGIDGAFFNN KLEVVLDFWQ KDTKDLLYPL ALPGVVGVRA NAPYVNVASM RNKGIDLLIT TRGNVVGDLG YEVTAVGSIL DNKITAIAPS VPYFTANGQR LSSPVVRNQP GHDLSSFYGY NVIGLFNSKA EVDGAPTQPG AAPGRFRYQD INGDGKIDDA DRTFLGSPIP KFTGSLTLTL KYKGFDLNTQ VYASLGNKIF NNSKWYTDFY PSFPGAAVSQ RVKDSWLPTH TDTKVPIFEN TSNFSTNTES NSYYVENGSY GRMQYLTLGY TFPASVLNRA NLSRLRLSLS ATNLFTITKY SGLDPAVGGS ADQNFGIDIG NYPVTRGYNV GLSFGF
|
| |