Gene SbBS512_E1162 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1162 
Symbol 
ID6270752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1059091 
End bp1061031 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content52% 
IMG OID641725295 
Producthypothetical protein 
Protein accessionYP_001879809 
Protein GI187732120 
COG category[R] General function prediction only 
COG ID[COG4248] Uncharacterized protein with protein kinase and helix-hairpin-helix DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCCA CTTTATATAC TGCTACTGGT GAGTGCGTTA CGCCAGGCCG TGAACTGGGC 
AAAGGTGGCG AAGGCGCGGT TTATGATATC AATGAGTTTG TCGATAGCGT CGCCAAGATT
TATCACACGC CGCCACCCGC CTTAAAACAG GACAAACTTG CCTTTATGGC TGCGACAGCT
GACGCGCAGT TGTTGAATTA TGTCGCCTGG CCGCAGGCAA CGCTTCACGG TGGGCGAGGC
GGAAAGGTTA TCGGTTTTAT GATGCCAAAA GTTTCTGGTA AAGAACCGAT TCATATGATC
TATAGCCCGG CACATCGTCG CCAGAGTTAC CCTCATTGTG CGTGGGATTT TCTACTCTAT
GTTGCGCGCA ATATTGCTTC ATCTTTTGCT ACGGTTCACG AGCACGGGCA CGTCGTGGGG
GACGTAAACC AGAACAGCTT TATGGTAGGT CGCGACAGCA AAGTGGTGTT GATCGATAGT
GACTCCTTTC AGATTAACGC CAATGGCACA CTGCATTTAT GCGAAGTCGG CGTGTCGCAT
TTTACGCCGC CAGAGCTACA AACCTTGCCA TCATTTGTCG GTTTTGAACG TACCGCGAAT
CACGATAATT TTGGCCTTGC GTTGCTGATT TTTCACGTCT TGTTTGGTGG GCGGCATCCT
TATTCTGGTG TGCCGCTTAT CTCTGATGCG GGTAATGCGC TGGAGACGGA TATTGCCCAT
TTCCGTTATG CCTACGCGTC AGACAACCAG CGACGTGGTT TAAAACCGCC GCCACGATCG
ATTCCGCTGT CGATGTTACC GGGCGATGTT GAAGCCATGT TTCAGCAGGC GTTTACGGAA
AGTGGCGTAG CAACCGGGCG TCCGACGGCT AAAGCGTGGG TAGCAGCACT GGATTCTCTA
CGCCAACAGT TAAAGAAATG TACCGTTTCG GCAATGCATG TTTATCCCGC TCATTTGACC
GACTGCCCGT GGTGTACGCT GGATAATCAA GGCGTTATCT ATTTTATTGA TCTCGGCGAA
GAGGTCATTA CCACCGGCGG TGATTTTGTG CTGGCGAAAG TCTGGGCGAT GGTGATGGCG
TCAGTAGCGC CGCCAGCATT GCAACTGCCA TTACCCGATC ATTTCCAACC GACTGGCAGG
CCGCTTCCTT TAGGCCTGTT ACGGCGCGAA TACATCATTC TGATTGAGAT CGCACTGTCA
GCGTTATCGC TGCTGCTTTG CGGCCTTCAG GCAGAACCGC GTTATATTAT TTTGGTTCCT
GTGCTGGCGG CTATCTGGAT TATTGGCAGT CTGACAAGCA AAGCGTACAA AGCAGAAGTT
CAGCAACGCC GTGAGGCATT TAATCGCGCG AAAATGGACT ATGACCATTT AGTCAGCCAG
AGCCAACAGT TGGGCGGGCT GGAAGGTTTT ATCGCCAAAC GGACGATGCT CGAAAAAATG
AAGGACGAAA TTCTCGGGTT ACCGGAAGAA GAAAAACGTG CTCTGGCAGC ACTTCACGAC
ACCGCAAGGG AACGGCAGAA GCAGAAGTTT CTGGAGGGAT TTTTTATTGA TGTTGCCTCT
ATTCCCGGCG TTGGCCCTGC GCGTAAAGCG GCGTTACGGT CCTTTGGTAT TGAAACAGCA
GCGGATGTTA CCCGTCGTGG GGTTAAGCAA GTTAAAGGGT TTGGTGATCA TCTGACCCAG
GCGGTCATCG ACTGGAAAGC GAGCTGTGAA CGCCGTTTTG TGTTCAGGCC GAACGAAGCG
GTAACGCCTG CAGAAAGACA AGCGGTAATG GCGAAAATGG CCGCCAAACG ACATCGGCTG
GAATTGGCGT TGACTGTCGG CGCGACAGAG TTGCAGCGAT TCCGCCTTCA TGCTCCAGCA
CGGACCATGC CGTTGATGGA ACCGTTACGT CAGGCGGCAG AAAAACTGGC TCAGGCGCAG
GCAGATTTAA GTCGCTGCTG A
 
Protein sequence
MKPTLYTATG ECVTPGRELG KGGEGAVYDI NEFVDSVAKI YHTPPPALKQ DKLAFMAATA 
DAQLLNYVAW PQATLHGGRG GKVIGFMMPK VSGKEPIHMI YSPAHRRQSY PHCAWDFLLY
VARNIASSFA TVHEHGHVVG DVNQNSFMVG RDSKVVLIDS DSFQINANGT LHLCEVGVSH
FTPPELQTLP SFVGFERTAN HDNFGLALLI FHVLFGGRHP YSGVPLISDA GNALETDIAH
FRYAYASDNQ RRGLKPPPRS IPLSMLPGDV EAMFQQAFTE SGVATGRPTA KAWVAALDSL
RQQLKKCTVS AMHVYPAHLT DCPWCTLDNQ GVIYFIDLGE EVITTGGDFV LAKVWAMVMA
SVAPPALQLP LPDHFQPTGR PLPLGLLRRE YIILIEIALS ALSLLLCGLQ AEPRYIILVP
VLAAIWIIGS LTSKAYKAEV QQRREAFNRA KMDYDHLVSQ SQQLGGLEGF IAKRTMLEKM
KDEILGLPEE EKRALAALHD TARERQKQKF LEGFFIDVAS IPGVGPARKA ALRSFGIETA
ADVTRRGVKQ VKGFGDHLTQ AVIDWKASCE RRFVFRPNEA VTPAERQAVM AKMAAKRHRL
ELALTVGATE LQRFRLHAPA RTMPLMEPLR QAAEKLAQAQ ADLSRC