Gene SbBS512_E1474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1474 
Symbol 
ID6270952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1343054 
End bp1346092 
Gene Length3039 bp 
Protein Length1012 aa 
Translation table11 
GC content57% 
IMG OID641725574 
Productfibronectin type III domain protein 
Protein accessionYP_001880080 
Protein GI187730038 
COG category[S] Function unknown 
COG ID[COG4733] Phage-related protein, tail component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGGAAA AAGACATCAC CATTAAGGGC AAAACCACCT CGCAGTATCT GGCCTCGGTG 
GTGGTGGATA ACCTGCCGCC GCGCCCGTTT AATATCCGGA TGCGCAGAAT GACGCCGGAC
AGCACCACAG ACCAGCTGCA GAACAAAACG CTCTGGTCGT CATACACCGA AATCATCGAT
GTGAAACAGT GCTACCCGAA CACGGCACTG GTCGGCGTGC AGGTGGACTC GGAACAGTTC
GGCAGCCAGC AGGTGAGCCG TAATTATCAT CTTCGCGGGC GCATTCTGCA GGTGCCGTCG
AACTATAACC CGCAGACGCG GCAATACAGC GGTATCTGGG ACGGAACGTT TAAGCCAGCA
TACAGCAACA ACATGGCCTG GTGTCTGTGG GATATGCTGA CCCATCCGCG CTACGGCATG
GGGAAGCGTC TCGGTGCGGC GGATGTGGAC AAATGGGCGC TGTATGTCAT CGGCCAGAAT
TGCGACCAGT CGGTGCCGGA TGGCTTTGGT GGCACGGAGC CGCGCATCAC CTGTAATGCC
TACCTGACCA CACAGCGCAA GGCGTGGGAT GTTCTCAGTG ATTTCTGCTC GGCGATGCGC
TGTATGCCGG TATGGAACGG GCAGACGCTG ACGTTCGTGC AGGACCGACC GTCGGATAAG
GTGTGGACCT ATAACCGCAG TAATGTGGTG ATGCCGGATG ATGGCGCGCC GTTCCGCTAC
AGCTTCAGTG CCCTGAAGGA CCGTCATAAT GCCGTTGAGG TGAACTGGAT TGACCCGGAT
AACGGCTGGG AGACGGCGAC AGAGCTTGTT GAAGATACGC AGGCCATTGC CCGTTACGGT
CGTAACGTCA CGAAGATGGA TGCCTTTGGC TGTACCAGCC GGGGGCAGGC GCACCGCGCC
GGGCTGTGGC TGATTAAAAC GGAACTGCTG GAAACGCAGA CCGTGGACTT CAGCGTGGGT
GCGGAAGGGC TTCGCCATGT ACCGGGGGAT GTCATTGAAA TCTGCGATGA TGACTATGCG
GGTATCAGCA CCGGCGGGCG CGTGCTGGCG GTGAACAGCC AGACCCGGAC GCTGACGCTC
GACCGTGAAA TCACGCTGCC ATCCTCCGGT ACCACGCTGA TAAGCCTGGT TGACGGTCAG
GGTAATCCGG TCAGCGTGGA GGTCCAGTCC GTCACCGACG GCGTGAAGGT GAAAGTGAGC
CGTGTTCCTG ACGGCGTTGC AGAATACAGC GTGTGGGGGC TGAAGCTGCC GACGCTGCGC
CAGCGCCTGT TCCGCTGCGT GAGTATCCGT GAGAACGATG ACGGCACGTA TGCCATCACC
GCCGTGCAGC ATGTACCGGA GAAAGAAGCC ATCGTGGATA ACGGGGCGCA CTTTGACGGT
GACCAGAGCG GCACGGTGAA TGGTGTCACG CCGCCAGCAG TGCAGCACCT GACCGCCGAA
GTCACTGCAG ACAGCGGGGA ATATCAGGTG CTGGCGCGAT GGGACACACC GAAGGTGGTG
AAGGGCGTGA GCTTCCTGCT CCGTCTGACC GTAACAGCGG ACGACGGCAG TGAGCGACTG
GTCAGCACGG CCCGGACGAC GGAAACCACA TACCGCTTCA CGCAACTGGC GCTGGGGAAC
TACAGGCTGA CAGTCCGGGC GGTAAATGCC CGGGGGCAGC AGGGCGATCC GGCGTCGGTA
TCGTTCCGGA TTGCCGCACC GGCAGCACCG TCGAGGATTG AGCTGACGCC GGGCTATTTT
CAGATAACCG CCACGCCGCA TCTTGCTGTT TATGACCCGA CGGTACAGTT TGAGTTCTGG
TTCTCGGAAA AGCGGATTAC CGATATCAGG CAGGTTGAAA CCACAGCCCG CTATCTTGGT
ACGGCGCTGT ACTGGATAGC CGCCAGTATC AATATCAAAC CGGGCCATGA TTATTATTTT
TACGTTCGCA GTGTGAACAC CGTTGGCAAA TCGGCATTCG TGGAGGCTGT CGGTCGGGCG
AGCGATGATG CGGAAGGTTA CCTGGATTTT TTCAAAGGAG AAATCGGGAA AACACATCTG
GCCCAGGAGT TGTGGACGCA GATTGATAAC GGTCAGCTTG CGCCGGACCT GGCTGAAATC
AGGACGTCCA TTACGAATGT CAGCAATGAA ATCACGCAGA CCGTCAATAA AAAACTGGAA
GACCAGAGTG CGGCAATCCA GCAGATACAG AAAGTTCAGG TTGATACAAA TAATAACCTG
AACAGCATGT GGGCCGTGAA ACTGCAGCAG ATGAAGGACG GACGCCTTTA TATTGCGGGT
ATCGGAGCCG GTATTGAGAA TACGCCAGCA GGTATGCAGA GTCAGGTGCT TCTGGCTGCT
GACCGGATTG CGATGATTAA TCCTGCGAAT GGCAACACAA AGCCGATGTT TGTTGGTCAG
GGCGATCAGA TATTCATGAA CGACGTGTTC CTGAAACGCC TGACGGCTCC GACCATTACC
AGCGGCGGTA ATCCTCCGGC ATTTTCCCTG ACACCGGACG GGCGGCTGAC GGCGAAAAAT
GCCGATATCA GCGGTAACGT GAATGCGAAC TCCGGGACGC TCAACAACGT CACGATTAAC
GAGAACTGTC GGGTTCTGGG AAAATTGTCC GCCAACCAGA TTGAAGGCGA TCTCGTTAAA
ACAGTGGGCA AAGCTTTCCC CCGGGACTCC CGTGCACCGG AACGGTGGCC ATCAGGGACC
ATCACCGTCA GGGTTTATGA CGATCAGCCG TTTGACCGGC AGATTGTTAT TCCGGCGGTG
GCATTCAGCG GCGCTAAACA TGAGCGGGAG AATAACGATA TTTATTCGTC ATGCCGCCTG
ATAGTACGGA AAAACGGTGC TGAAATTTAT AACCGTACCG CGCTGGATAA TACGCTGATT
TACAGTGGTG TTATTGATAT GCCAGCTGGT CGCGGCCACA TGACGCTGGA GTTTTCGGTG
TCAGCATGGC TGGTGAATAA CTGGTATCCC ACAGCAAGTA TCAGCGATTT GCTGGTTGTG
GTGATGAAGA AAGCCACCGC AGGCATCAGT ATCAGCTGA
 
Protein sequence
MTEKDITIKG KTTSQYLASV VVDNLPPRPF NIRMRRMTPD STTDQLQNKT LWSSYTEIID 
VKQCYPNTAL VGVQVDSEQF GSQQVSRNYH LRGRILQVPS NYNPQTRQYS GIWDGTFKPA
YSNNMAWCLW DMLTHPRYGM GKRLGAADVD KWALYVIGQN CDQSVPDGFG GTEPRITCNA
YLTTQRKAWD VLSDFCSAMR CMPVWNGQTL TFVQDRPSDK VWTYNRSNVV MPDDGAPFRY
SFSALKDRHN AVEVNWIDPD NGWETATELV EDTQAIARYG RNVTKMDAFG CTSRGQAHRA
GLWLIKTELL ETQTVDFSVG AEGLRHVPGD VIEICDDDYA GISTGGRVLA VNSQTRTLTL
DREITLPSSG TTLISLVDGQ GNPVSVEVQS VTDGVKVKVS RVPDGVAEYS VWGLKLPTLR
QRLFRCVSIR ENDDGTYAIT AVQHVPEKEA IVDNGAHFDG DQSGTVNGVT PPAVQHLTAE
VTADSGEYQV LARWDTPKVV KGVSFLLRLT VTADDGSERL VSTARTTETT YRFTQLALGN
YRLTVRAVNA RGQQGDPASV SFRIAAPAAP SRIELTPGYF QITATPHLAV YDPTVQFEFW
FSEKRITDIR QVETTARYLG TALYWIAASI NIKPGHDYYF YVRSVNTVGK SAFVEAVGRA
SDDAEGYLDF FKGEIGKTHL AQELWTQIDN GQLAPDLAEI RTSITNVSNE ITQTVNKKLE
DQSAAIQQIQ KVQVDTNNNL NSMWAVKLQQ MKDGRLYIAG IGAGIENTPA GMQSQVLLAA
DRIAMINPAN GNTKPMFVGQ GDQIFMNDVF LKRLTAPTIT SGGNPPAFSL TPDGRLTAKN
ADISGNVNAN SGTLNNVTIN ENCRVLGKLS ANQIEGDLVK TVGKAFPRDS RAPERWPSGT
ITVRVYDDQP FDRQIVIPAV AFSGAKHERE NNDIYSSCRL IVRKNGAEIY NRTALDNTLI
YSGVIDMPAG RGHMTLEFSV SAWLVNNWYP TASISDLLVV VMKKATAGIS IS