Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2717 |
Symbol | |
ID | 6269636 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 2516094 |
End bp | 2518736 |
Gene Length | 2643 bp |
Protein Length | 880 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641726681 |
Product | fimbrial usher family protein |
Protein accession | YP_001881161 |
Protein GI | 187731011 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3188] P pilus assembly protein, porin PapC |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.405081 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGACC ATTCTCTTTT TCGATTACGG ATTCTTCCGT GGTGCATTGC GCTGGCAATG TCAGGGAGTT ATAGCAGTGT CTGGGCTGAA GACGACATTC AGTTTGATTC CCGTTTTCTG GAATTAAAAG GCGACACAAA AATTGATCTG AAGCGTTTTT CCAGCCAGGG ATATGTTGAG CCCGGAAAAT ACAATTTACA GGTTCAACTA AATAAACAGC CATTGGCGGA AGAGTACGAT ATTTACTGGT ATGCTGGTGA AGATGACGCG AGCAAAAGCT ATGCTTGTCT GACACCGGAA CTGGTAGCGC AGTTTGGTTT AAAAGAAGAC GTGGCGAAAA ATCTGCAATG GAGCCACGAT GCTAAATGCC TGAAATCCGG TCAACTGGAA GGCATGGAAA TTAAGGCTGA TTTAAGCCAG TCCGCATTAG TCATTTCACT GCCACAGGCT TACCTCGAAT ATACTTATCC CGACTGGGAT CCGCCTTCAC GTTGGGATGA CGGCATCTCC GGGATCGTCG CGGACTACAG CATCAACGCA CAAACCCGGC ACGAAGAAAA TGGCGGTGAT GATAGTAACG AGATCAGCGG CAACGGGACG GTCGGGGTTA ACCTGGGGCC GTGGCGTATG CGTGCTGACT GGCAGACTAA CTATCAACAT ACTCGCAGTA ATGATGACGA TGAATTCAGC GGCGATGAAA CTCAAAAAAA ATGGGAGTGG AGTCGCTACT ATGCCTGGCG GGCGTTACCA TCATTAAAAG CCAAACTGGC GCTGGGCGAG GATTACCTCA GATCCGATAT TTTTGATGGT TTTAACTATG TTGGTGGCAG TGTCAGTACT GACGATCAAA TGTTGCCTCC CAATCTGCGC GGCTACGCGC CAGACATTTC CGGCGTGGCA CACACCACAG CAAAAGTGAC CGTCAGCCAG ATGGGGCGTG TGATTTACGA AACGCAGGTT CCGGCTGGAC CGTTTCGTAT TCAGGATCTT GGTGATTCTG TCTCCGGTAC GTTGCATATT CGCATTGAAG AACAGAACGG CCAGGTGCAG GAATATGACA TCAGCACCGC CTCGATGCCA TACCTCACTC GTCCAGGTCA GATTCGCTAT AAGATCATGA TGGGCCGTCC GCAAGAGTGG GGACACCATG TCGAGGGTGA ATTTTTTTCT GGTGCTGAAG CTTCCTGGGG GATCGCTAAC GGCTGGTCGT TATATGGCGG CGCACTGGGA GATGAAAACT ATCAGTCTGC GGCGCTTGGC GTCGGTCGCG ATTTGTCTAC ATTCGGCGCG GTCGCGTTTG ATGTTACTCA CTCGCACACC AAACTGGATA AAGACACCGC TTATGGCAAA GGTTCGCTGG ACGGTAACTC CTTCCGTGTG AGTTATTCCA AAGACTTTGA CCAGCTCAAC AGCCGCGTTA CTTTTGCTGG ATATCGCTTC TCGGAAGAGA ACTTTATGAC CATGAGCGAG TATCTGGATG CCAGTGACAG CGGAATGGTA CGCACGGGCA ACGACAAAGA GATGTACACC GCCACTTATA ACCAGAACTT CCGCGATGCG GGTGTTTCGG TTTATCTCAA CTATACCCGC CATACCTACT GGGATCGCGA GGAGCAGATA AACTACAACA TCATGCTCTC GCACTATTTC AATATGGGCA GTATTCGTAA TGTCAGCATC TCGATGACTG GCTACCGTTA CGAGTATGAC AACCAGGCCG ACAAAGGCAT GTACATTTCG CTCAGTATGC CGTGGGGCGA CAACAGTACC GTTAGCTATA ATGGTAACTA TGGCAGTGGG ACGGACAGCA GTCAGGTCGG TTATTTCAGC CGTGTCGATG ACGCGACTCA CTATCAGTTG AACGTCGGCA CCAGTGACAA ACACACCAGC GTTGACGGCT ATTACAGCCA TGATGGTTCG CTGGCGCAGG TTGACCTCAG TGCGAACTAC CATGAAGGGC AATACACCTC TGCGGGCTTG TCGTTACAGG GCGGCGCAAC GCTTACTGCC CACGGTGGCG CACTTCACCG TACCCAGAAT ATGGGCGGGA CACGCTTGTT GATTGATGCC GATGGCGTTG CCGATGTTCC GGTGGAAGGT AACGGGGCTG CTGTTTATAC CAATATGTTT GGTAAAGCCG TCGTTTCTGA CGTCAATAAC TATTACCGCA ATCAGGCGTA TATCGACCTC AACAGATTGC CTGAAAACGC TGAAGCAACC CAGTCGGTGG TGCAAGCCAC GCTAACTGAA GGTGCCATTG GCTACCGTAA ATTTGCTGTC ATTAGCGGAC AAAAAGCGAT GGCGGTGCTG CGCCTGAGCG ACGGCAGCCA TCCTCCGTTT GGCGCAGAAG TAAAAAATGA TAACGAGCAG ACAGTGGGCC TTGTCGATGA TGACGGCAAT GTTTATCTGG CTGGGGTGAA ACCTGGCGAA CACATGAGTG TGTTCTGGAG TGGTGTTGCG CATTGCGATA TCAACCTGCC GGACCCGCTG CCTGCCGATC TGTTTAACGG CTTGTTACTG CCATGCCAGC ATAAAGGCAA TGTAGCACCT ATCACTTCGC CGGCGGTCAA ACCGGCGATT CAGGAACAGA CACAGCGGGT GACGCCAACG GAACCTCCGA CTTCAATTTC AGTAAACCAG TAA
|
Protein sequence | MPDHSLFRLR ILPWCIALAM SGSYSSVWAE DDIQFDSRFL ELKGDTKIDL KRFSSQGYVE PGKYNLQVQL NKQPLAEEYD IYWYAGEDDA SKSYACLTPE LVAQFGLKED VAKNLQWSHD AKCLKSGQLE GMEIKADLSQ SALVISLPQA YLEYTYPDWD PPSRWDDGIS GIVADYSINA QTRHEENGGD DSNEISGNGT VGVNLGPWRM RADWQTNYQH TRSNDDDEFS GDETQKKWEW SRYYAWRALP SLKAKLALGE DYLRSDIFDG FNYVGGSVST DDQMLPPNLR GYAPDISGVA HTTAKVTVSQ MGRVIYETQV PAGPFRIQDL GDSVSGTLHI RIEEQNGQVQ EYDISTASMP YLTRPGQIRY KIMMGRPQEW GHHVEGEFFS GAEASWGIAN GWSLYGGALG DENYQSAALG VGRDLSTFGA VAFDVTHSHT KLDKDTAYGK GSLDGNSFRV SYSKDFDQLN SRVTFAGYRF SEENFMTMSE YLDASDSGMV RTGNDKEMYT ATYNQNFRDA GVSVYLNYTR HTYWDREEQI NYNIMLSHYF NMGSIRNVSI SMTGYRYEYD NQADKGMYIS LSMPWGDNST VSYNGNYGSG TDSSQVGYFS RVDDATHYQL NVGTSDKHTS VDGYYSHDGS LAQVDLSANY HEGQYTSAGL SLQGGATLTA HGGALHRTQN MGGTRLLIDA DGVADVPVEG NGAAVYTNMF GKAVVSDVNN YYRNQAYIDL NRLPENAEAT QSVVQATLTE GAIGYRKFAV ISGQKAMAVL RLSDGSHPPF GAEVKNDNEQ TVGLVDDDGN VYLAGVKPGE HMSVFWSGVA HCDINLPDPL PADLFNGLLL PCQHKGNVAP ITSPAVKPAI QEQTQRVTPT EPPTSISVNQ
|
| |