Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2192 |
Symbol | |
ID | 6271489 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 1992573 |
End bp | 1993961 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641726217 |
Product | YjhS |
Protein accession | YP_001880705 |
Protein GI | 187733233 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00000609291 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTTTA AACACTATGA TGTTGTCAGG GCGGCGTCGC CGTCAGATCT TGCGGAAAAG CTGACACATA AACTGAAAGA GGGCTGGCAG CCGTTTGGTA GTCCGGTGGC CATAACCCCT TATACTCTGA TGCAGGCGAT TGCAGCAGAA GGTGCAGTAA TCAGCGCCAC CAGCGACCCG GAGTATTACT TTGTTGTGGT TCTGGCAGGG CAGTCAAACG GCATGTCGTA TGGTGAAGGC CTTCCGCTGC CGGGGACATA TGACCGTCCG GACCCGCGTA TTAAGCAACT GGCGCGTCGC AGTACGGTGA CACCGGGCGG TGCAGCATGC AAATATAACG ACATCATTCC GGCGGACCAT TGTCTGCATG ATGTGCAGGA CATGAGCCGT CTTAACCATC CGAAAGCGGA CCTGTCAAAG GGGCAGTACG GAACCGTGGG GCAGGGGCTG CATATCGCCA AAAAACTGCT GCCGTTTATA CCGGCGAATG CGGGCATTCT GCTGGTTCCA TGCTGTCGTG GTGGTTCAGC GTTCACCACC GGAGCTGATG GCACATACAG TGACGCGAGT GGTGCCTCGG AGAATTCAAC CCGCTGGGGT GTGGACAAGC CGCTGTATAA GGACCTTATC GGTCGAACAA AAGCGGCACT GGAGAAGAAC CCGAAAAATG TGCTGTTTGC CGTGGTGTGG ATGCAGGGGG AATTTGATTT TGGCGGTACG CCGGCAAATC ATGCCGCACA GTTTGGTGCG CTGGTTGATA AATTCCGTGC AGACCTGGCG GATATGGCAG GCCAGTGCGT CGGTGGCTCT GCTGGCGGTG TTCCCTGGAT ATGCGGGGAC ACGACGTATT TCTGGAAGCA GAAGAACGAA TCCACGTACC AGACGGTGTA CGGCAGCTAT AAAAATAAAA CGGAAAAGAA TATCCATTTC GTACCGTTCA TGACCGATGA GAACGGGGTG AATGTGCCGA CGAACAAACC GGAAGAAGAC CCGGACATTC CGGGTATCGG TTATTACGGT TCGAAATGGC GTGACAGCTC AGCCACCTGG ACGTCACAGG ACAGGGCGAG CCATTTCAGT TCATGGGCTC GCCGTGGGAT TATTTCCGAC CGTCTGGCAA CGGCGATTCT GAGCTGCGCG GGTAAGTCTT CTGCGTTTGT TAATGGTACT GCCGGGGTGG TTGTTCCAGA CAGGCCGGTT ACCACCTCAG AGTCTGTAAT TTTTTACGAT GCCAAAAAAG CTACAGACAA TCAGCTGAAA CCTTATGGCT GGGACGGTAT GTATGGCAGA CGCACACTGG TTGATGACAG CGGCAATAAA GCTCTGCGAA TTGAGAAAAA TAACAGCTCG AAATCCTGGT CAATGTACTG TGAGGTGTAC TGGCAATAG
|
Protein sequence | MAFKHYDVVR AASPSDLAEK LTHKLKEGWQ PFGSPVAITP YTLMQAIAAE GAVISATSDP EYYFVVVLAG QSNGMSYGEG LPLPGTYDRP DPRIKQLARR STVTPGGAAC KYNDIIPADH CLHDVQDMSR LNHPKADLSK GQYGTVGQGL HIAKKLLPFI PANAGILLVP CCRGGSAFTT GADGTYSDAS GASENSTRWG VDKPLYKDLI GRTKAALEKN PKNVLFAVVW MQGEFDFGGT PANHAAQFGA LVDKFRADLA DMAGQCVGGS AGGVPWICGD TTYFWKQKNE STYQTVYGSY KNKTEKNIHF VPFMTDENGV NVPTNKPEED PDIPGIGYYG SKWRDSSATW TSQDRASHFS SWARRGIISD RLATAILSCA GKSSAFVNGT AGVVVPDRPV TTSESVIFYD AKKATDNQLK PYGWDGMYGR RTLVDDSGNK ALRIEKNNSS KSWSMYCEVY WQ
|
| |