Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4334 |
Symbol | |
ID | 6271988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 4048619 |
End bp | 4050049 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641728142 |
Product | hypothetical protein |
Protein accession | YP_001882557 |
Protein GI | 187731290 |
COG category | [S] Function unknown |
COG ID | [COG5339] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.478027 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATACGTA AGTCAGCTAC AGGTGTTATT GTTGCGTTAG CCGTAATCTG GGGTGGTGGC ACATGGTACA CAGGTACGCA AATTCAGCCT GGTGTCGAAA AATTTATTAA AGATTTTAAC GATGCTAAAA AGAAAGGTGA ACATGCCTAC GATATGACGT TAAGTTATAA AAACTTTGAC AAAGGCTTTT TTAATTCCCA TTTTCAAATG CAAATTTCAT TTGATAACGG TGCACCCGAT CTCAATATCA AGCCAGGCCA GAAGGTTGCA TTTGATGTGG ATGTTGAGCA CGGTCCGTTG CCCATCACAA TGTTAATGCA TGGTAATGTC ATCCCAGCAC TGGCAGCGGC AAAAGTGAAC TTAGTGAATA ATGAACTGAC ACAACCGCTA TTTATCGCCG CGAAAAATAA ATCGCCCGTG GAAGCGACAT TGCGATTCGC GTTTGGTGGC TCATTCTCTA CGACATTAGA TGTTGCCCCT GCAGAGTATG GAAAGTTTTC TTTTGGTGAG GGCCAGTTTA CTTTTAATGG TGATGGTAGT TCATTGTCTA ACCTGGATAT TGAAGGCAAA GTCGAAGATA TTGTTCTGCA ATTATCACCA ATGAACAAAG TAACGGCAAA AAGTTTTACC ATTGATTCTC TGACACGATT AGAAGAAAAG AAATTTCCGG TTGGTGAAAG CGAGTCGAAA TTTAATCAGG TTAATATTAT CAATCAGGGG GAAGACGTTG CCCAAATCGA TGCTTTCGTT GCAAAAACCA GGCTGGATCG CGTTAAAGAC AAAGATTATA TCAATGTCAA TCTGACCTAC GAACTTGATA AGTTAACAAA AGGGAATCAG CAACTCGGTA GTGGTGAGTG GTCATTGATT GCTGAATCTA TTGATCCCTC AGCAGTGCGC CAATTTATCA TCCAGTATAA CATTGCGATG CAGAAGCAGC TTGCTGCACA CCCTGAGTTA GCAAACGATG AAGTTGCTCT GCAAGAAGTG AATGCTGCAT TGTTCAAAGA GTATTTACCG TTATTACAAC AAAGTGAGCC GACCATTAAA CAACCGGTAA GATGGAAGAA CGCACTCGGC GAACTAAATG CCAATCTGGA TATCAGTATT GCCGACCCAG CCAAATCTTC ATCATCCACA AACAAAGATA TCAAATCGCT CAATTTTGAT GTGAAGTTAC CGCTTAATGT CGTCACAGAA ACCGCAAAAC AGCTTAATTT ATCTGAAGGA ATGGATGCGG AAAAAGCGCA AAAGCGGGCT GATAAACAAA TCAGCGGGAT GATGACCTTA GGTCAGATGT TTCAGTTAAT CACGATTGAC AACAATACCG CCCCGCTGCA ACTGCGTTAT ACACCGGGTA AAGTTGTTTT TAACGGACAG GAGATGAGCG AAGAAGAATT TATGTCTCGT GCCGGACGTT TTGTTCATTA A
|
Protein sequence | MIRKSATGVI VALAVIWGGG TWYTGTQIQP GVEKFIKDFN DAKKKGEHAY DMTLSYKNFD KGFFNSHFQM QISFDNGAPD LNIKPGQKVA FDVDVEHGPL PITMLMHGNV IPALAAAKVN LVNNELTQPL FIAAKNKSPV EATLRFAFGG SFSTTLDVAP AEYGKFSFGE GQFTFNGDGS SLSNLDIEGK VEDIVLQLSP MNKVTAKSFT IDSLTRLEEK KFPVGESESK FNQVNIINQG EDVAQIDAFV AKTRLDRVKD KDYINVNLTY ELDKLTKGNQ QLGSGEWSLI AESIDPSAVR QFIIQYNIAM QKQLAAHPEL ANDEVALQEV NAALFKEYLP LLQQSEPTIK QPVRWKNALG ELNANLDISI ADPAKSSSST NKDIKSLNFD VKLPLNVVTE TAKQLNLSEG MDAEKAQKRA DKQISGMMTL GQMFQLITID NNTAPLQLRY TPGKVVFNGQ EMSEEEFMSR AGRFVH
|
| |