Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1689 |
Symbol | |
ID | 6272386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 1537287 |
End bp | 1538252 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641725770 |
Product | hypothetical protein |
Protein accession | YP_001880268 |
Protein GI | 187733577 |
COG category | [S] Function unknown |
COG ID | [COG3781] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.00204582 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATCGTAA TCACGTTTAA TAGGGCGACT TTTCCACGCC TGAAGATAAC CATGATTGTT CGTCCACAAC AACACTGGCT GCGCCGTATT TTTGTCTGGC ACGGCTCAGT ATTATCCAAG ATATCCTCGC GCTTACTACT CAATTTTCTC TTTTCTATCG CTGTTATTTT CATGCTGCCC TGGTACACGC ATCTGGGCAT CAAATTCACC CTCGCACCGT TCAGCATTCT CGGTGTCGCC ATCGCCATTT TTCTTGGTTT TCGTAATAAT GCCGGGTACG CCCGTTACGT TGAAGCGCGA AAACTTTGGG GACAGTTGAT GATTGCCTCA CGGTCGTTAC TACGTGAGGT AAAAACGACA TTGCCGGATT CGGCAAGTGT AAGGGAGTTT GCCAGGCTGC AAATCGCCTT CGCCCACTGT TTACGCATGA CATTACGCAA ACAGCCACAG GCGGAAGTGC TGGCTCATTA TCTCAAGACT GAAGATCTTC AGCCTGTACT GGCTTCGAAC TCTCCAGCTA ACCGTATCTT GTTAATAATG GGGGAGTGGT TGGCGGTTCA GCGCCGCAAT GGGCAGCTTT CAGATATCCT GTTTATTAGC CTCAACGATC GGCTTAATGA TATTTCAGCG GTCCTGGCAG GATGCGAGCG CATTGCCTAT ACGCCAATTC CCTTTGCCTA CACCCTGATT TTGCATCGTA CTGTTTATCT GTTTTGTATC ATGCTGCCGT TCGCGCTGGT CGTGGACCTG CATTACATGA CGCCTTTTAT CTCTGTGCTG ATTTCCTACA CTTTTATTTC GCTGGATTGT CTGGCGGAAG AACTGGAAGA ACCGTTCGGT ACTGAAAACA ATGATTTACC GCTGGATGCC ATCTGCAACG CTATTGAAAT TGACCTCTTA CAGATGAACG ATGAAGCCGA AATTCCAGCA AAAATTCTTC CCGATCGTCA TTACCAGCTG ACGTGA
|
Protein sequence | MIVITFNRAT FPRLKITMIV RPQQHWLRRI FVWHGSVLSK ISSRLLLNFL FSIAVIFMLP WYTHLGIKFT LAPFSILGVA IAIFLGFRNN AGYARYVEAR KLWGQLMIAS RSLLREVKTT LPDSASVREF ARLQIAFAHC LRMTLRKQPQ AEVLAHYLKT EDLQPVLASN SPANRILLIM GEWLAVQRRN GQLSDILFIS LNDRLNDISA VLAGCERIAY TPIPFAYTLI LHRTVYLFCI MLPFALVVDL HYMTPFISVL ISYTFISLDC LAEELEEPFG TENNDLPLDA ICNAIEIDLL QMNDEAEIPA KILPDRHYQL T
|
| |