Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1474 |
Symbol | |
ID | 6270952 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 1343054 |
End bp | 1346092 |
Gene Length | 3039 bp |
Protein Length | 1012 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641725574 |
Product | fibronectin type III domain protein |
Protein accession | YP_001880080 |
Protein GI | 187730038 |
COG category | [S] Function unknown |
COG ID | [COG4733] Phage-related protein, tail component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACGGAAA AAGACATCAC CATTAAGGGC AAAACCACCT CGCAGTATCT GGCCTCGGTG GTGGTGGATA ACCTGCCGCC GCGCCCGTTT AATATCCGGA TGCGCAGAAT GACGCCGGAC AGCACCACAG ACCAGCTGCA GAACAAAACG CTCTGGTCGT CATACACCGA AATCATCGAT GTGAAACAGT GCTACCCGAA CACGGCACTG GTCGGCGTGC AGGTGGACTC GGAACAGTTC GGCAGCCAGC AGGTGAGCCG TAATTATCAT CTTCGCGGGC GCATTCTGCA GGTGCCGTCG AACTATAACC CGCAGACGCG GCAATACAGC GGTATCTGGG ACGGAACGTT TAAGCCAGCA TACAGCAACA ACATGGCCTG GTGTCTGTGG GATATGCTGA CCCATCCGCG CTACGGCATG GGGAAGCGTC TCGGTGCGGC GGATGTGGAC AAATGGGCGC TGTATGTCAT CGGCCAGAAT TGCGACCAGT CGGTGCCGGA TGGCTTTGGT GGCACGGAGC CGCGCATCAC CTGTAATGCC TACCTGACCA CACAGCGCAA GGCGTGGGAT GTTCTCAGTG ATTTCTGCTC GGCGATGCGC TGTATGCCGG TATGGAACGG GCAGACGCTG ACGTTCGTGC AGGACCGACC GTCGGATAAG GTGTGGACCT ATAACCGCAG TAATGTGGTG ATGCCGGATG ATGGCGCGCC GTTCCGCTAC AGCTTCAGTG CCCTGAAGGA CCGTCATAAT GCCGTTGAGG TGAACTGGAT TGACCCGGAT AACGGCTGGG AGACGGCGAC AGAGCTTGTT GAAGATACGC AGGCCATTGC CCGTTACGGT CGTAACGTCA CGAAGATGGA TGCCTTTGGC TGTACCAGCC GGGGGCAGGC GCACCGCGCC GGGCTGTGGC TGATTAAAAC GGAACTGCTG GAAACGCAGA CCGTGGACTT CAGCGTGGGT GCGGAAGGGC TTCGCCATGT ACCGGGGGAT GTCATTGAAA TCTGCGATGA TGACTATGCG GGTATCAGCA CCGGCGGGCG CGTGCTGGCG GTGAACAGCC AGACCCGGAC GCTGACGCTC GACCGTGAAA TCACGCTGCC ATCCTCCGGT ACCACGCTGA TAAGCCTGGT TGACGGTCAG GGTAATCCGG TCAGCGTGGA GGTCCAGTCC GTCACCGACG GCGTGAAGGT GAAAGTGAGC CGTGTTCCTG ACGGCGTTGC AGAATACAGC GTGTGGGGGC TGAAGCTGCC GACGCTGCGC CAGCGCCTGT TCCGCTGCGT GAGTATCCGT GAGAACGATG ACGGCACGTA TGCCATCACC GCCGTGCAGC ATGTACCGGA GAAAGAAGCC ATCGTGGATA ACGGGGCGCA CTTTGACGGT GACCAGAGCG GCACGGTGAA TGGTGTCACG CCGCCAGCAG TGCAGCACCT GACCGCCGAA GTCACTGCAG ACAGCGGGGA ATATCAGGTG CTGGCGCGAT GGGACACACC GAAGGTGGTG AAGGGCGTGA GCTTCCTGCT CCGTCTGACC GTAACAGCGG ACGACGGCAG TGAGCGACTG GTCAGCACGG CCCGGACGAC GGAAACCACA TACCGCTTCA CGCAACTGGC GCTGGGGAAC TACAGGCTGA CAGTCCGGGC GGTAAATGCC CGGGGGCAGC AGGGCGATCC GGCGTCGGTA TCGTTCCGGA TTGCCGCACC GGCAGCACCG TCGAGGATTG AGCTGACGCC GGGCTATTTT CAGATAACCG CCACGCCGCA TCTTGCTGTT TATGACCCGA CGGTACAGTT TGAGTTCTGG TTCTCGGAAA AGCGGATTAC CGATATCAGG CAGGTTGAAA CCACAGCCCG CTATCTTGGT ACGGCGCTGT ACTGGATAGC CGCCAGTATC AATATCAAAC CGGGCCATGA TTATTATTTT TACGTTCGCA GTGTGAACAC CGTTGGCAAA TCGGCATTCG TGGAGGCTGT CGGTCGGGCG AGCGATGATG CGGAAGGTTA CCTGGATTTT TTCAAAGGAG AAATCGGGAA AACACATCTG GCCCAGGAGT TGTGGACGCA GATTGATAAC GGTCAGCTTG CGCCGGACCT GGCTGAAATC AGGACGTCCA TTACGAATGT CAGCAATGAA ATCACGCAGA CCGTCAATAA AAAACTGGAA GACCAGAGTG CGGCAATCCA GCAGATACAG AAAGTTCAGG TTGATACAAA TAATAACCTG AACAGCATGT GGGCCGTGAA ACTGCAGCAG ATGAAGGACG GACGCCTTTA TATTGCGGGT ATCGGAGCCG GTATTGAGAA TACGCCAGCA GGTATGCAGA GTCAGGTGCT TCTGGCTGCT GACCGGATTG CGATGATTAA TCCTGCGAAT GGCAACACAA AGCCGATGTT TGTTGGTCAG GGCGATCAGA TATTCATGAA CGACGTGTTC CTGAAACGCC TGACGGCTCC GACCATTACC AGCGGCGGTA ATCCTCCGGC ATTTTCCCTG ACACCGGACG GGCGGCTGAC GGCGAAAAAT GCCGATATCA GCGGTAACGT GAATGCGAAC TCCGGGACGC TCAACAACGT CACGATTAAC GAGAACTGTC GGGTTCTGGG AAAATTGTCC GCCAACCAGA TTGAAGGCGA TCTCGTTAAA ACAGTGGGCA AAGCTTTCCC CCGGGACTCC CGTGCACCGG AACGGTGGCC ATCAGGGACC ATCACCGTCA GGGTTTATGA CGATCAGCCG TTTGACCGGC AGATTGTTAT TCCGGCGGTG GCATTCAGCG GCGCTAAACA TGAGCGGGAG AATAACGATA TTTATTCGTC ATGCCGCCTG ATAGTACGGA AAAACGGTGC TGAAATTTAT AACCGTACCG CGCTGGATAA TACGCTGATT TACAGTGGTG TTATTGATAT GCCAGCTGGT CGCGGCCACA TGACGCTGGA GTTTTCGGTG TCAGCATGGC TGGTGAATAA CTGGTATCCC ACAGCAAGTA TCAGCGATTT GCTGGTTGTG GTGATGAAGA AAGCCACCGC AGGCATCAGT ATCAGCTGA
|
Protein sequence | MTEKDITIKG KTTSQYLASV VVDNLPPRPF NIRMRRMTPD STTDQLQNKT LWSSYTEIID VKQCYPNTAL VGVQVDSEQF GSQQVSRNYH LRGRILQVPS NYNPQTRQYS GIWDGTFKPA YSNNMAWCLW DMLTHPRYGM GKRLGAADVD KWALYVIGQN CDQSVPDGFG GTEPRITCNA YLTTQRKAWD VLSDFCSAMR CMPVWNGQTL TFVQDRPSDK VWTYNRSNVV MPDDGAPFRY SFSALKDRHN AVEVNWIDPD NGWETATELV EDTQAIARYG RNVTKMDAFG CTSRGQAHRA GLWLIKTELL ETQTVDFSVG AEGLRHVPGD VIEICDDDYA GISTGGRVLA VNSQTRTLTL DREITLPSSG TTLISLVDGQ GNPVSVEVQS VTDGVKVKVS RVPDGVAEYS VWGLKLPTLR QRLFRCVSIR ENDDGTYAIT AVQHVPEKEA IVDNGAHFDG DQSGTVNGVT PPAVQHLTAE VTADSGEYQV LARWDTPKVV KGVSFLLRLT VTADDGSERL VSTARTTETT YRFTQLALGN YRLTVRAVNA RGQQGDPASV SFRIAAPAAP SRIELTPGYF QITATPHLAV YDPTVQFEFW FSEKRITDIR QVETTARYLG TALYWIAASI NIKPGHDYYF YVRSVNTVGK SAFVEAVGRA SDDAEGYLDF FKGEIGKTHL AQELWTQIDN GQLAPDLAEI RTSITNVSNE ITQTVNKKLE DQSAAIQQIQ KVQVDTNNNL NSMWAVKLQQ MKDGRLYIAG IGAGIENTPA GMQSQVLLAA DRIAMINPAN GNTKPMFVGQ GDQIFMNDVF LKRLTAPTIT SGGNPPAFSL TPDGRLTAKN ADISGNVNAN SGTLNNVTIN ENCRVLGKLS ANQIEGDLVK TVGKAFPRDS RAPERWPSGT ITVRVYDDQP FDRQIVIPAV AFSGAKHERE NNDIYSSCRL IVRKNGAEIY NRTALDNTLI YSGVIDMPAG RGHMTLEFSV SAWLVNNWYP TASISDLLVV VMKKATAGIS IS
|
| |