Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1266 |
Symbol | |
ID | 6270194 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 1159112 |
End bp | 1160977 |
Gene Length | 1866 bp |
Protein Length | 621 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641725388 |
Product | YjhS |
Protein accession | YP_001879899 |
Protein GI | 187732889 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0323184 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATTTA AACACTATGA TGTTGTCAGG GCGGCGTCGC CGTCAGATCT TGCGGAAAAG CTGACACATA AACTGAAAGA GGGCTGGCAG CCGTTTGGTA GTCCGGTGGC CATAACCCCT TATACTCTGA TGCAGGCGAT TGCAGCAGAA GGTGCAGTAA TCAGCGCCAC CAGCAACCCG GAGTATTACT TTGTTGTGGT TCTGGCAGGG CAGTCAAACG GCATGTCGTA TGGTGAAGGC CTTCCGCTGC CGGGGACATA TGACCGTCCG GACCCGCGTA TTAAGCAACT GGCGCGTCGC AGTACGGTGA CACCGGGCGG TGCAGCATGC AAATATAACG ACATCATTCC GGCGGACCAT TGTCTGCATG ATGTGCAGGA CATGAGCCGT CTTAACCATC CGAAAGCGGA CCTGTCAAAG GGGCAGTACG GAACCGTGGG GCAGGGGCTG CATATCGCCA AAAAACTGCT GCCGTTTATA CCGGCGAATG CGGGCATTCT GCTGGTTCCG TGCTGTCGTG GTGGTTCAGC GTTCACCACC GGAGCTGATG GCACATACAG TGACGCGAGT GGTGCCTCGG AGAATTCAAC CCGCTGGGGT GTGGACAAGC CGCTGTATAA GGACCTTATC GGTCGAACAA AAGCGGCACT GGAGAAGAAC CCGAAAAATG TGCTGTTTGC CGTGGTGTGG ATGCAGGGGG AATTTGATTT TGGCGGTACG CCGGCAAATC ATGCCGCACA GTTTGGTGCG CTGGTTGATA AATTCCGTGC AGACCTGGCG GATATGGCAG GCCAGTGCGT CGGTGGCTCT GCTGGCGGTG TTCCCTGGAT ATGCGGGGAC ACGACGTATT TCTGGAAGCA GAAGAACGAA TCCACGTACC AGACGGTGTA CGGCAGCTAT AAAAATAAAA CGGAAAAGAA TATCCATTTC GTACCGTTCA TGACCGATGA GAACGGGGTG AATGTGCCGA CGAACAAACC GGAAGAAGAC CCGGACATTC CGGGTATCGG TTATTACGGT TCGAAATGGC GTGACAGCTC AGCCACCTGG ACGTCACAGG ACAGGGCGAG CCATTTCAGT TCATGGGCTC GCCGTGGGAT TATTTCCGAC CGTCTGGCAA CGGCGATTCT GAGCTGCGCG GGTAAGTCTT CTGCGTTTGT TAATGGTACT GCCGGGGTGG TTGTTCCAGA CAGGCCGGTT ACCACCTCAG AGTCTGTAAT TTTTTACGAT GCCAAAAAAG CTACAGACAA TCAGCTGAAA CCTTATGGCT GGGACGGTAT GTATGGCAGA CGCACACTGG TTGATGACAG CGGCAATAAA GCTCTGCGAA TTGAGAAAAA TAACAGCGCG AAATCCTGGT CAATGTACTG TGATATTGCT GCAGACAAGG CAAAACTTTT ACTGGAAAAA GGCGGGGAAA TTGCTGTCCG GTTTAAAATC CCCGAAAACG TCAATCTTGA GACAACCAGA AACAAGTATG CCTTTGGTTT GTACTGGCGA ATAGCGGAAT GGCCGGGTGA GGGTGGTGAA GGCCATCTGA GTTCTTTCTT TGTCCAGACA GATAAAGCCA GTATTAATAT TGCATACCAT CATACAGTTA ATCAACAAAA AGAACTTGGC ACGTTTGGCG CATTCGACCA TGACTGGCAT ACGCTTGCAT TTAAATTTAA GGGCAGTAAC AGCATTAATG TTACTCCGGT GCTTGATGGT GTGGATGGAC AGGCGTTTGA CCTGGTGAAA TGGGCCAATA CTGCTAATGG ACTCAACAGG TTTGTCATTA CGGATATTAC AGGTAGTGCA GAAACCTACC CTGTACTTAT TGATACGGTG GAAGTTAAAG CAAACAAAGC TGGAGCAGCC GCATAA
|
Protein sequence | MAFKHYDVVR AASPSDLAEK LTHKLKEGWQ PFGSPVAITP YTLMQAIAAE GAVISATSNP EYYFVVVLAG QSNGMSYGEG LPLPGTYDRP DPRIKQLARR STVTPGGAAC KYNDIIPADH CLHDVQDMSR LNHPKADLSK GQYGTVGQGL HIAKKLLPFI PANAGILLVP CCRGGSAFTT GADGTYSDAS GASENSTRWG VDKPLYKDLI GRTKAALEKN PKNVLFAVVW MQGEFDFGGT PANHAAQFGA LVDKFRADLA DMAGQCVGGS AGGVPWICGD TTYFWKQKNE STYQTVYGSY KNKTEKNIHF VPFMTDENGV NVPTNKPEED PDIPGIGYYG SKWRDSSATW TSQDRASHFS SWARRGIISD RLATAILSCA GKSSAFVNGT AGVVVPDRPV TTSESVIFYD AKKATDNQLK PYGWDGMYGR RTLVDDSGNK ALRIEKNNSA KSWSMYCDIA ADKAKLLLEK GGEIAVRFKI PENVNLETTR NKYAFGLYWR IAEWPGEGGE GHLSSFFVQT DKASINIAYH HTVNQQKELG TFGAFDHDWH TLAFKFKGSN SINVTPVLDG VDGQAFDLVK WANTANGLNR FVITDITGSA ETYPVLIDTV EVKANKAGAA A
|
| |