Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1162 |
Symbol | |
ID | 6270752 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 1059091 |
End bp | 1061031 |
Gene Length | 1941 bp |
Protein Length | 646 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641725295 |
Product | hypothetical protein |
Protein accession | YP_001879809 |
Protein GI | 187732120 |
COG category | [R] General function prediction only |
COG ID | [COG4248] Uncharacterized protein with protein kinase and helix-hairpin-helix DNA-binding domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCCA CTTTATATAC TGCTACTGGT GAGTGCGTTA CGCCAGGCCG TGAACTGGGC AAAGGTGGCG AAGGCGCGGT TTATGATATC AATGAGTTTG TCGATAGCGT CGCCAAGATT TATCACACGC CGCCACCCGC CTTAAAACAG GACAAACTTG CCTTTATGGC TGCGACAGCT GACGCGCAGT TGTTGAATTA TGTCGCCTGG CCGCAGGCAA CGCTTCACGG TGGGCGAGGC GGAAAGGTTA TCGGTTTTAT GATGCCAAAA GTTTCTGGTA AAGAACCGAT TCATATGATC TATAGCCCGG CACATCGTCG CCAGAGTTAC CCTCATTGTG CGTGGGATTT TCTACTCTAT GTTGCGCGCA ATATTGCTTC ATCTTTTGCT ACGGTTCACG AGCACGGGCA CGTCGTGGGG GACGTAAACC AGAACAGCTT TATGGTAGGT CGCGACAGCA AAGTGGTGTT GATCGATAGT GACTCCTTTC AGATTAACGC CAATGGCACA CTGCATTTAT GCGAAGTCGG CGTGTCGCAT TTTACGCCGC CAGAGCTACA AACCTTGCCA TCATTTGTCG GTTTTGAACG TACCGCGAAT CACGATAATT TTGGCCTTGC GTTGCTGATT TTTCACGTCT TGTTTGGTGG GCGGCATCCT TATTCTGGTG TGCCGCTTAT CTCTGATGCG GGTAATGCGC TGGAGACGGA TATTGCCCAT TTCCGTTATG CCTACGCGTC AGACAACCAG CGACGTGGTT TAAAACCGCC GCCACGATCG ATTCCGCTGT CGATGTTACC GGGCGATGTT GAAGCCATGT TTCAGCAGGC GTTTACGGAA AGTGGCGTAG CAACCGGGCG TCCGACGGCT AAAGCGTGGG TAGCAGCACT GGATTCTCTA CGCCAACAGT TAAAGAAATG TACCGTTTCG GCAATGCATG TTTATCCCGC TCATTTGACC GACTGCCCGT GGTGTACGCT GGATAATCAA GGCGTTATCT ATTTTATTGA TCTCGGCGAA GAGGTCATTA CCACCGGCGG TGATTTTGTG CTGGCGAAAG TCTGGGCGAT GGTGATGGCG TCAGTAGCGC CGCCAGCATT GCAACTGCCA TTACCCGATC ATTTCCAACC GACTGGCAGG CCGCTTCCTT TAGGCCTGTT ACGGCGCGAA TACATCATTC TGATTGAGAT CGCACTGTCA GCGTTATCGC TGCTGCTTTG CGGCCTTCAG GCAGAACCGC GTTATATTAT TTTGGTTCCT GTGCTGGCGG CTATCTGGAT TATTGGCAGT CTGACAAGCA AAGCGTACAA AGCAGAAGTT CAGCAACGCC GTGAGGCATT TAATCGCGCG AAAATGGACT ATGACCATTT AGTCAGCCAG AGCCAACAGT TGGGCGGGCT GGAAGGTTTT ATCGCCAAAC GGACGATGCT CGAAAAAATG AAGGACGAAA TTCTCGGGTT ACCGGAAGAA GAAAAACGTG CTCTGGCAGC ACTTCACGAC ACCGCAAGGG AACGGCAGAA GCAGAAGTTT CTGGAGGGAT TTTTTATTGA TGTTGCCTCT ATTCCCGGCG TTGGCCCTGC GCGTAAAGCG GCGTTACGGT CCTTTGGTAT TGAAACAGCA GCGGATGTTA CCCGTCGTGG GGTTAAGCAA GTTAAAGGGT TTGGTGATCA TCTGACCCAG GCGGTCATCG ACTGGAAAGC GAGCTGTGAA CGCCGTTTTG TGTTCAGGCC GAACGAAGCG GTAACGCCTG CAGAAAGACA AGCGGTAATG GCGAAAATGG CCGCCAAACG ACATCGGCTG GAATTGGCGT TGACTGTCGG CGCGACAGAG TTGCAGCGAT TCCGCCTTCA TGCTCCAGCA CGGACCATGC CGTTGATGGA ACCGTTACGT CAGGCGGCAG AAAAACTGGC TCAGGCGCAG GCAGATTTAA GTCGCTGCTG A
|
Protein sequence | MKPTLYTATG ECVTPGRELG KGGEGAVYDI NEFVDSVAKI YHTPPPALKQ DKLAFMAATA DAQLLNYVAW PQATLHGGRG GKVIGFMMPK VSGKEPIHMI YSPAHRRQSY PHCAWDFLLY VARNIASSFA TVHEHGHVVG DVNQNSFMVG RDSKVVLIDS DSFQINANGT LHLCEVGVSH FTPPELQTLP SFVGFERTAN HDNFGLALLI FHVLFGGRHP YSGVPLISDA GNALETDIAH FRYAYASDNQ RRGLKPPPRS IPLSMLPGDV EAMFQQAFTE SGVATGRPTA KAWVAALDSL RQQLKKCTVS AMHVYPAHLT DCPWCTLDNQ GVIYFIDLGE EVITTGGDFV LAKVWAMVMA SVAPPALQLP LPDHFQPTGR PLPLGLLRRE YIILIEIALS ALSLLLCGLQ AEPRYIILVP VLAAIWIIGS LTSKAYKAEV QQRREAFNRA KMDYDHLVSQ SQQLGGLEGF IAKRTMLEKM KDEILGLPEE EKRALAALHD TARERQKQKF LEGFFIDVAS IPGVGPARKA ALRSFGIETA ADVTRRGVKQ VKGFGDHLTQ AVIDWKASCE RRFVFRPNEA VTPAERQAVM AKMAAKRHRL ELALTVGATE LQRFRLHAPA RTMPLMEPLR QAAEKLAQAQ ADLSRC
|
| |