Gene SbBS512_E3786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E3786 
Symbol 
ID6270817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3511719 
End bp3514040 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content57% 
IMG OID641727649 
Productputative transcriptional accessory protein 
Protein accessionYP_001882084 
Protein GI187730372 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAATG ATTCGTTCTG CCGCATTATT GCGGGTGAAA TTCAGGCGCG CCCGGAACAG 
GTTGACGCTG CCGTTCGCCT GCTTGACGAA GGGAATACCG TGCCGTTTAT CGCACGTTAT
CGTAAGGAAA TCACCGGCGG TCTGGATGAC ACGCAGCTGC GTAATCTGGA AACGCGTCTG
AGCTATCTGC GCGAGCTGGA AGAGAGACGT CAGGCGATCC TCAAGTCCAT TTCCGAGCAA
GGCAAACTCA CCGATGATCT GGCGAAGGCC ATCAACGCCA CCCTAAGCAA AACCGAACTC
GAAGACCTCT ACCTGCCCTA CAAACCTAAA CGCCGCACCC GCGGGCAAAT CGCCATTGAA
GCAGGGCTTG AGCCGTTGGC TGACCTGCTG TGGAGCGATC CGTCACACAC GCCAGAAGTC
GCCGCTGCAC AATATGTTGA TGCCGATAAA GGCGTGGCAG ATACCAAAGC CGCGCTGGAT
GGCGCGCGCT ATATCCTGAT GGAACGGTTT GCCGAAGATG CCGCGCTGCT GGCGAAAGTG
CGTGATTATC TGTGGAAGAA CGCGCATTTG GTTTCTACGG TGGTGAGCGG TAAAGAAGAG
GAAGGGGCGA AATTCCGCGA CTATTTCGAT CATCACGAAC CGTTGTCCAC GGTGCCTTCT
CACCGCGCGC TGGCGATGTT CCGTGGGCGT AACGAAGGCG TACTCCAGCT TTCGCTGAAT
GCCGATCCGC AGTTCGATGA GCCGCCCAAA GAGAGCTATT GCGAGCAAAT CATCATGGAT
CACCTTGGCC TGCGCCTGAA CAATGCCCCG GCGGATAGCT GGCGCAAAGG CGTAGTGAGC
TGGACCTGGC GCATCAAGGT GCTGATGCAT CTGGAAACCG AACTGATGGG TACCGTGCGC
GAACGCGCGG AAGATGAAGC AATCAACGTC TTTGCCCGTA ACCTGCACGA TCTGCTGATG
GCGGCCCCTG CCGGACTGCG TGCAACGATG GGCCTCGATC CGGGTCTGCG TACTGGGGTA
AAAGTGGCGG TGGTCGATGC CACTGGCAAA CTGGTAGCGA CCGATACCAT TTACCCGCAC
ACCGGACAAG CCGCAAAAGC AGCGATGACC GTTGCTGCCT TGTGTGAAAA ACATAACGTT
GAACTGGTGG CGATCGGCAA CGGTACAGCT TCCCGCGAAA CTGAACGTTT CTATCTCGAC
GTGCAGAAGC AGTTCCCGAA AGTGACCGCA CAGAAAGTGA TCGTCAGCGA AGCTGGCGCG
TCGGTTTACT CAGCTTCCGA GCTGGCAGCG CAGGAGTTCC CGGATCTCGA CGTTTCGCTG
CGTGGCGCGG TGTCTATCGC CCGCCGTTTG CAGGATCCGC TGGCGGAGCT GGTGAAAATC
GATCCGAAAT CTATCGGCGT AGGTCAGTAT CAGCATGACG TCAGCCAGAC GCAACTGGCC
CGCAAACTGG ACGCAGTAGT AGAAGACTGC GTAAACGCCG TTGGCGTCGA TCTCAACACT
GCTTCTGTTC CGCTGTTAAC CCGCGTGGCG GGCCTGACGC GCATGATGGC GCAAAACATC
GTGGCCTGGC GCGATGAGAA CGGCCAGTTC CAGAACCGTC AGCAACTGCT GAAAGTCAGC
CGTCTGGGGC CGAAAGCCTT CGAGCAGTGC GCGGGCTTCT TGCGCATTAA CCACGGTGAT
AACCCGCTGG ATGCTTCTAC CGTCCACCCG GAAGCCTATC CGGTGGTGGA ACGCATTCTG
GCAGCAACAC AGCAGGCACT GAAAGATCTG ATGGGTAACA GCAGCGAACT GCGTAACCTG
AAAGCGTCTG ACTTTACTGA TGAAAAATTC GGTGTGCCGA CAGTAACTGA CATCATCAAA
GAGCTGGAAA AACCGGGTCG CGATCCGCGT CCGGAATTTA AAACCGCTCA GTTTGCCGAT
GGCGTCGAGA CAATGAACGA CCTGCAACCG GGTATGATCC TCGAAGGTGC GGTGACCAAC
GTCACCAACT TTGGCGCGTT TGTCGATATC GGCGTGCATC AGGACGGCCT GGTTCACATC
TCTTCATTGT CGAACAAGTT TGTGGAAGAT CCGCATACCG TGGTGAAAGC GGGCGACATT
GTGAAGGTGA AAGTGCTGGA AGTGGATCTT CAGCGTAAAC GTATCGCCCT GACTATGCGT
CTGGACGAGC AGCCTGGCGA AACCAACGCC CGTCGCGGCG GCGGTAATGA ACGCCCGCAA
AACAACCGCC CGGCAGCCAA ACCACGCGGT CGTGAAGCGC AGCCTGCCGG TAATAGCGCG
ATGATGGATG CGCTGGCGGC GGCAATGGGC AAAAAACGTT AA
 
Protein sequence
MMNDSFCRII AGEIQARPEQ VDAAVRLLDE GNTVPFIARY RKEITGGLDD TQLRNLETRL 
SYLRELEERR QAILKSISEQ GKLTDDLAKA INATLSKTEL EDLYLPYKPK RRTRGQIAIE
AGLEPLADLL WSDPSHTPEV AAAQYVDADK GVADTKAALD GARYILMERF AEDAALLAKV
RDYLWKNAHL VSTVVSGKEE EGAKFRDYFD HHEPLSTVPS HRALAMFRGR NEGVLQLSLN
ADPQFDEPPK ESYCEQIIMD HLGLRLNNAP ADSWRKGVVS WTWRIKVLMH LETELMGTVR
ERAEDEAINV FARNLHDLLM AAPAGLRATM GLDPGLRTGV KVAVVDATGK LVATDTIYPH
TGQAAKAAMT VAALCEKHNV ELVAIGNGTA SRETERFYLD VQKQFPKVTA QKVIVSEAGA
SVYSASELAA QEFPDLDVSL RGAVSIARRL QDPLAELVKI DPKSIGVGQY QHDVSQTQLA
RKLDAVVEDC VNAVGVDLNT ASVPLLTRVA GLTRMMAQNI VAWRDENGQF QNRQQLLKVS
RLGPKAFEQC AGFLRINHGD NPLDASTVHP EAYPVVERIL AATQQALKDL MGNSSELRNL
KASDFTDEKF GVPTVTDIIK ELEKPGRDPR PEFKTAQFAD GVETMNDLQP GMILEGAVTN
VTNFGAFVDI GVHQDGLVHI SSLSNKFVED PHTVVKAGDI VKVKVLEVDL QRKRIALTMR
LDEQPGETNA RRGGGNERPQ NNRPAAKPRG REAQPAGNSA MMDALAAAMG KKR