Gene SbBS512_E1444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1444 
Symbol 
ID6268756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1318344 
End bp1319393 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content53% 
IMG OID641725544 
ProductDNA methylase 
Protein accessionYP_001880053 
Protein GI187733653 
COG category[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTAATA CTGTAAAAAT ATCCAGTTGT GAGTTAATCA ACGCCGACTG CCTGGAATTT 
ATCCGGTCGT TACCCGAAAA TTCTGTTGAC CTGATAGTCA CGGACCCGCC GTACTTTAAA
GTGAAGCCTG AGGGCTGGGA TAACCAGTGG AAGGGCGACG ATGATTACCT GAAGTGGCTG
GACCAGTGTC TGGCGCAGTT CTGGCGGGTG CTGAAACCTG CCGGAAGTCT TTACCTGTTC
TGTGGTCATC GCCTGGCATC TGATATCGAA ATCATGATGC GTGAACGCTT CAGTGTGCTG
AACCATATTA TCTGGGCGAA GCCGTCCGGA CGCTGGAACG GATGCAACAA GGAAAGCCTG
CGGGCGTATT TCCCCGCCAC AGAGCGCATT CTGTTCGCGG AACATTATCA GGGGCCGTAT
CGTCCGAAAG ATGCCGGGTA TGCGGCGAAG GGCAGTGCAC TGAAACAGCA TGTGATGGCC
CCGCTGATTT CTTACTTTCG TGATGCGCGC GCGGCCCTGG GGATAACGGC AAAACAGATT
GCAGATGCCA CAGGAAAGAA AAACATGGTG TCGCACTGGT TCAGTGCCAG TCAGTGGCAG
CTACCGAATG AAAGCGATTA TCTGAAATTA CAGGCGCTGT TTGCCCGGGT GGCAGAAGAG
AAGCATCGGC GTGGTGAACT GGAAAAGCTC CACCACCAGC TGGTGGATAC GTATACCTCA
CTGAACCGGC AGTATGCGGA GCTGCTGAGT GAATATAAAC ATCTGCGGCG GTATTTTGGC
GTGACGGTGC AGGTGCCGTA TACCGATGTG TGGACGCATA AACCGGTGCA GTTCTATCCC
GGGAAACATC CGTGCGAAAA ACCGGCAGAA ATGCTGCAGC AGATAATCAG CGCAAGTAGC
CGTCCTGGTG ATCTGGTTGC GGATTTTTTC ATGGGGTCGG GTTCAACGGT AAAAGCGGCG
ATGGCACTGG GGCGTCGTGC GATTGGTGTT GAGCTGGAGA CCGGACGTTT TGAGCAGACA
GTCAGGGAAG TTCAGGATTT AATCGTTTGA
 
Protein sequence
MLNTVKISSC ELINADCLEF IRSLPENSVD LIVTDPPYFK VKPEGWDNQW KGDDDYLKWL 
DQCLAQFWRV LKPAGSLYLF CGHRLASDIE IMMRERFSVL NHIIWAKPSG RWNGCNKESL
RAYFPATERI LFAEHYQGPY RPKDAGYAAK GSALKQHVMA PLISYFRDAR AALGITAKQI
ADATGKKNMV SHWFSASQWQ LPNESDYLKL QALFARVAEE KHRRGELEKL HHQLVDTYTS
LNRQYAELLS EYKHLRRYFG VTVQVPYTDV WTHKPVQFYP GKHPCEKPAE MLQQIISASS
RPGDLVADFF MGSGSTVKAA MALGRRAIGV ELETGRFEQT VREVQDLIV