Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E3786 |
Symbol | |
ID | 6270817 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 3511719 |
End bp | 3514040 |
Gene Length | 2322 bp |
Protein Length | 773 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641727649 |
Product | putative transcriptional accessory protein |
Protein accession | YP_001882084 |
Protein GI | 187730372 |
COG category | [K] Transcription |
COG ID | [COG2183] Transcriptional accessory protein |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAATG ATTCGTTCTG CCGCATTATT GCGGGTGAAA TTCAGGCGCG CCCGGAACAG GTTGACGCTG CCGTTCGCCT GCTTGACGAA GGGAATACCG TGCCGTTTAT CGCACGTTAT CGTAAGGAAA TCACCGGCGG TCTGGATGAC ACGCAGCTGC GTAATCTGGA AACGCGTCTG AGCTATCTGC GCGAGCTGGA AGAGAGACGT CAGGCGATCC TCAAGTCCAT TTCCGAGCAA GGCAAACTCA CCGATGATCT GGCGAAGGCC ATCAACGCCA CCCTAAGCAA AACCGAACTC GAAGACCTCT ACCTGCCCTA CAAACCTAAA CGCCGCACCC GCGGGCAAAT CGCCATTGAA GCAGGGCTTG AGCCGTTGGC TGACCTGCTG TGGAGCGATC CGTCACACAC GCCAGAAGTC GCCGCTGCAC AATATGTTGA TGCCGATAAA GGCGTGGCAG ATACCAAAGC CGCGCTGGAT GGCGCGCGCT ATATCCTGAT GGAACGGTTT GCCGAAGATG CCGCGCTGCT GGCGAAAGTG CGTGATTATC TGTGGAAGAA CGCGCATTTG GTTTCTACGG TGGTGAGCGG TAAAGAAGAG GAAGGGGCGA AATTCCGCGA CTATTTCGAT CATCACGAAC CGTTGTCCAC GGTGCCTTCT CACCGCGCGC TGGCGATGTT CCGTGGGCGT AACGAAGGCG TACTCCAGCT TTCGCTGAAT GCCGATCCGC AGTTCGATGA GCCGCCCAAA GAGAGCTATT GCGAGCAAAT CATCATGGAT CACCTTGGCC TGCGCCTGAA CAATGCCCCG GCGGATAGCT GGCGCAAAGG CGTAGTGAGC TGGACCTGGC GCATCAAGGT GCTGATGCAT CTGGAAACCG AACTGATGGG TACCGTGCGC GAACGCGCGG AAGATGAAGC AATCAACGTC TTTGCCCGTA ACCTGCACGA TCTGCTGATG GCGGCCCCTG CCGGACTGCG TGCAACGATG GGCCTCGATC CGGGTCTGCG TACTGGGGTA AAAGTGGCGG TGGTCGATGC CACTGGCAAA CTGGTAGCGA CCGATACCAT TTACCCGCAC ACCGGACAAG CCGCAAAAGC AGCGATGACC GTTGCTGCCT TGTGTGAAAA ACATAACGTT GAACTGGTGG CGATCGGCAA CGGTACAGCT TCCCGCGAAA CTGAACGTTT CTATCTCGAC GTGCAGAAGC AGTTCCCGAA AGTGACCGCA CAGAAAGTGA TCGTCAGCGA AGCTGGCGCG TCGGTTTACT CAGCTTCCGA GCTGGCAGCG CAGGAGTTCC CGGATCTCGA CGTTTCGCTG CGTGGCGCGG TGTCTATCGC CCGCCGTTTG CAGGATCCGC TGGCGGAGCT GGTGAAAATC GATCCGAAAT CTATCGGCGT AGGTCAGTAT CAGCATGACG TCAGCCAGAC GCAACTGGCC CGCAAACTGG ACGCAGTAGT AGAAGACTGC GTAAACGCCG TTGGCGTCGA TCTCAACACT GCTTCTGTTC CGCTGTTAAC CCGCGTGGCG GGCCTGACGC GCATGATGGC GCAAAACATC GTGGCCTGGC GCGATGAGAA CGGCCAGTTC CAGAACCGTC AGCAACTGCT GAAAGTCAGC CGTCTGGGGC CGAAAGCCTT CGAGCAGTGC GCGGGCTTCT TGCGCATTAA CCACGGTGAT AACCCGCTGG ATGCTTCTAC CGTCCACCCG GAAGCCTATC CGGTGGTGGA ACGCATTCTG GCAGCAACAC AGCAGGCACT GAAAGATCTG ATGGGTAACA GCAGCGAACT GCGTAACCTG AAAGCGTCTG ACTTTACTGA TGAAAAATTC GGTGTGCCGA CAGTAACTGA CATCATCAAA GAGCTGGAAA AACCGGGTCG CGATCCGCGT CCGGAATTTA AAACCGCTCA GTTTGCCGAT GGCGTCGAGA CAATGAACGA CCTGCAACCG GGTATGATCC TCGAAGGTGC GGTGACCAAC GTCACCAACT TTGGCGCGTT TGTCGATATC GGCGTGCATC AGGACGGCCT GGTTCACATC TCTTCATTGT CGAACAAGTT TGTGGAAGAT CCGCATACCG TGGTGAAAGC GGGCGACATT GTGAAGGTGA AAGTGCTGGA AGTGGATCTT CAGCGTAAAC GTATCGCCCT GACTATGCGT CTGGACGAGC AGCCTGGCGA AACCAACGCC CGTCGCGGCG GCGGTAATGA ACGCCCGCAA AACAACCGCC CGGCAGCCAA ACCACGCGGT CGTGAAGCGC AGCCTGCCGG TAATAGCGCG ATGATGGATG CGCTGGCGGC GGCAATGGGC AAAAAACGTT AA
|
Protein sequence | MMNDSFCRII AGEIQARPEQ VDAAVRLLDE GNTVPFIARY RKEITGGLDD TQLRNLETRL SYLRELEERR QAILKSISEQ GKLTDDLAKA INATLSKTEL EDLYLPYKPK RRTRGQIAIE AGLEPLADLL WSDPSHTPEV AAAQYVDADK GVADTKAALD GARYILMERF AEDAALLAKV RDYLWKNAHL VSTVVSGKEE EGAKFRDYFD HHEPLSTVPS HRALAMFRGR NEGVLQLSLN ADPQFDEPPK ESYCEQIIMD HLGLRLNNAP ADSWRKGVVS WTWRIKVLMH LETELMGTVR ERAEDEAINV FARNLHDLLM AAPAGLRATM GLDPGLRTGV KVAVVDATGK LVATDTIYPH TGQAAKAAMT VAALCEKHNV ELVAIGNGTA SRETERFYLD VQKQFPKVTA QKVIVSEAGA SVYSASELAA QEFPDLDVSL RGAVSIARRL QDPLAELVKI DPKSIGVGQY QHDVSQTQLA RKLDAVVEDC VNAVGVDLNT ASVPLLTRVA GLTRMMAQNI VAWRDENGQF QNRQQLLKVS RLGPKAFEQC AGFLRINHGD NPLDASTVHP EAYPVVERIL AATQQALKDL MGNSSELRNL KASDFTDEKF GVPTVTDIIK ELEKPGRDPR PEFKTAQFAD GVETMNDLQP GMILEGAVTN VTNFGAFVDI GVHQDGLVHI SSLSNKFVED PHTVVKAGDI VKVKVLEVDL QRKRIALTMR LDEQPGETNA RRGGGNERPQ NNRPAAKPRG REAQPAGNSA MMDALAAAMG KKR
|
| |