Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2150 |
Symbol | |
ID | 6270191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 1956336 |
End bp | 1957385 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641726180 |
Product | DNA methylase |
Protein accession | YP_001880669 |
Protein GI | 187730072 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0863] DNA modification methylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00000209549 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTAATA CTGTAAAAAT ATCCAGTTGT GAGTTAATCA ACGCCGACTG CCTGGAATTT ATCCGGTCGT TACCCGAAAA TTCTGTTGAC CTGATAGTCA CGGACCCGCC GTACTTTAAA GTGAAGCCTG AGGGCTGGGA TAACCAGTGG AAGGGCGACG ATGATTACCT GAAGTGGCTG GACCAGTGTC TGGCGCAGTT CTGGCGGGTG CTGAAACCTG CCGGAAGTCT TTACCTGTTC TGTGGTCATC GCCTGGCATC TGATATCGAA ATCATGATGC GTGAACGCTT CAGTGTGCTG AACCATATTA TCTGGGCGAA GCCGTCCGGA CGCTGGAACG GATGCAACAA GGAAAGCCTG CGGGCGTATT TCCCCGCCAC AGAGCGCATT CTGTTCGCGG AACATTATCA GGGGCCGTAT CGTCCGAAAG ATGCCGGGTA TGCGGCGAAG GGCAGGGTAC TGAAACAGCA TGTGATGGCC CCGCTGATTG CTTACTTTCG TGATGCGCGA GCTGCCCTGG GGATAACGGC AAAACAGATT GCAGATGCCA CAGGAAAGAA AAACATGGTG TCGCACTGGT TCAGTGCCAG TCAGTGGCAG CTACCGAACG AAAGCGATTA TCTGAAATTA CAGTCGCTGT TTGCCCGGGT GGCAGAAGAG AAACATCAGC GGAGAGAACT GGAAAAGTCC CATTACCAAC TGGTCAGCAC ATACAGTGAG CTGAGCCGGC AGTATATGGA ACTGCTGAGT GAATATAAAA ATTTGCGGAG GTATTTCGGT GTGACGGTGC AGGTGCCGTA CACCGATGTG TGGACGTATA AACCGGTGCA GTACTATCCA GGGAAACATC CGTGCGAAAA ACCGGCAGAA ATGTTGCAGC AGATAATCAA CGCGAGCAGT CGTCCGGGAG ACCAGGTTGC AGATTTTTTT ATGGGCTCAG GTTCAACGGT AAAAGCGGCA CTGGCGCTCG GGCGTCGTGC GATTGGCGTT GAACTGGAGA CCGGACGTTT TGAGCAGACA GTCAGGGAAG TTCAGGATTT AATCGTTTGA
|
Protein sequence | MLNTVKISSC ELINADCLEF IRSLPENSVD LIVTDPPYFK VKPEGWDNQW KGDDDYLKWL DQCLAQFWRV LKPAGSLYLF CGHRLASDIE IMMRERFSVL NHIIWAKPSG RWNGCNKESL RAYFPATERI LFAEHYQGPY RPKDAGYAAK GRVLKQHVMA PLIAYFRDAR AALGITAKQI ADATGKKNMV SHWFSASQWQ LPNESDYLKL QSLFARVAEE KHQRRELEKS HYQLVSTYSE LSRQYMELLS EYKNLRRYFG VTVQVPYTDV WTYKPVQYYP GKHPCEKPAE MLQQIINASS RPGDQVADFF MGSGSTVKAA LALGRRAIGV ELETGRFEQT VREVQDLIV
|
| |