Gene SbBS512_E2150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2150 
Symbol 
ID6270191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1956336 
End bp1957385 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content52% 
IMG OID641726180 
ProductDNA methylase 
Protein accessionYP_001880669 
Protein GI187730072 
COG category[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00000209549 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTAATA CTGTAAAAAT ATCCAGTTGT GAGTTAATCA ACGCCGACTG CCTGGAATTT 
ATCCGGTCGT TACCCGAAAA TTCTGTTGAC CTGATAGTCA CGGACCCGCC GTACTTTAAA
GTGAAGCCTG AGGGCTGGGA TAACCAGTGG AAGGGCGACG ATGATTACCT GAAGTGGCTG
GACCAGTGTC TGGCGCAGTT CTGGCGGGTG CTGAAACCTG CCGGAAGTCT TTACCTGTTC
TGTGGTCATC GCCTGGCATC TGATATCGAA ATCATGATGC GTGAACGCTT CAGTGTGCTG
AACCATATTA TCTGGGCGAA GCCGTCCGGA CGCTGGAACG GATGCAACAA GGAAAGCCTG
CGGGCGTATT TCCCCGCCAC AGAGCGCATT CTGTTCGCGG AACATTATCA GGGGCCGTAT
CGTCCGAAAG ATGCCGGGTA TGCGGCGAAG GGCAGGGTAC TGAAACAGCA TGTGATGGCC
CCGCTGATTG CTTACTTTCG TGATGCGCGA GCTGCCCTGG GGATAACGGC AAAACAGATT
GCAGATGCCA CAGGAAAGAA AAACATGGTG TCGCACTGGT TCAGTGCCAG TCAGTGGCAG
CTACCGAACG AAAGCGATTA TCTGAAATTA CAGTCGCTGT TTGCCCGGGT GGCAGAAGAG
AAACATCAGC GGAGAGAACT GGAAAAGTCC CATTACCAAC TGGTCAGCAC ATACAGTGAG
CTGAGCCGGC AGTATATGGA ACTGCTGAGT GAATATAAAA ATTTGCGGAG GTATTTCGGT
GTGACGGTGC AGGTGCCGTA CACCGATGTG TGGACGTATA AACCGGTGCA GTACTATCCA
GGGAAACATC CGTGCGAAAA ACCGGCAGAA ATGTTGCAGC AGATAATCAA CGCGAGCAGT
CGTCCGGGAG ACCAGGTTGC AGATTTTTTT ATGGGCTCAG GTTCAACGGT AAAAGCGGCA
CTGGCGCTCG GGCGTCGTGC GATTGGCGTT GAACTGGAGA CCGGACGTTT TGAGCAGACA
GTCAGGGAAG TTCAGGATTT AATCGTTTGA
 
Protein sequence
MLNTVKISSC ELINADCLEF IRSLPENSVD LIVTDPPYFK VKPEGWDNQW KGDDDYLKWL 
DQCLAQFWRV LKPAGSLYLF CGHRLASDIE IMMRERFSVL NHIIWAKPSG RWNGCNKESL
RAYFPATERI LFAEHYQGPY RPKDAGYAAK GRVLKQHVMA PLIAYFRDAR AALGITAKQI
ADATGKKNMV SHWFSASQWQ LPNESDYLKL QSLFARVAEE KHQRRELEKS HYQLVSTYSE
LSRQYMELLS EYKNLRRYFG VTVQVPYTDV WTYKPVQYYP GKHPCEKPAE MLQQIINASS
RPGDQVADFF MGSGSTVKAA LALGRRAIGV ELETGRFEQT VREVQDLIV