Gene SbBS512_E2348 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2348 
Symbol 
ID6269614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2136361 
End bp2137551 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content52% 
IMG OID641726352 
Producthypothetical protein 
Protein accessionYP_001880834 
Protein GI187732748 
COG category[R] General function prediction only 
COG ID[COG1092] Predicted SAM-dependent methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.287324 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGTAC GTTTAGTGTT AGCCAAAGGG CGCGAAAAAT CATTACTTCG TCGCCATCCG 
TGGGTCTTTT CCGGGACCGT TGCCCGCATG GAAGGTAAAG CCAGCCTCGG TGAAACCATC
GATATTGTTG ATCATCAGGG AAAATGGTTA GCACGCGGCG CTTATTCGCC AGCTTCGCAA
ATCCGGGCGC GCGTCTGGAC GTTTGACCCG TCTGAGTCTA TCGACATTGC TTTTTTTTCC
CGCCGTTTGC AACAAGCACA AAAATGGCGT GACTGGCTGG CGCAAAAAGA TGGCCTCGAC
AGCTATCGTT TAATCGCCGG AGAATCTGAT GGCCTGCCGG GTATTACTAT CGATCGTTTC
GGTAATTTTC TGGTGCTGCA ACTGCTGAGT GCTGGGGCAG AATATCAGCG CGCGGCATTA
GTTAGTGCCC TGCAAACGCT GTACCCGGAA TGTGCGATTT ACGATCGCAG CGATGTTGCG
GTACGTAAAA AAGAAGGGAT GGAGCTGACC CTGGGCCTCG TCACCGGCGA GTTGCCGCCT
GCCCTGCTGC CGATTGAAGA ACACGGCATG AAGCTGCTGG TGGATATTCA GCACGGGCAC
AAAACGGGCT ACTACCTGGA CCAGCGAGAC AGCCGCCTGG CTACCCGCCG CTACGTTGAA
AATAAACGTG TGCTGAACTG TTTCTCCTAT ACCGGTGGTT TCGCCGTATC GGCACTGATG
GGCGGTTGCA GCCAGGTTGT CAGCGTTGAT ACCTCCCAGG AAGCGCTGGA TATTGCACGG
CAGAACGTTG AGCTGAACAA ACTGGATCTG AGCAAGGCTG AGTTTGTCCG TGATGATGTC
TTTAAATTGC TGCGTACTTA TCGCGATCGC GGTGAAAAAT TTGACGTTAT CGTGATGGAC
CCGCCGAAGT TTGTTGAGAA TAAAAGCCAG TTGATGGGCG CGTGTCGTGG CTATAAAGAT
ATCAACATGC TGGCGATTCA GTTGCTGAAT GAAGGCGGTA TTCTCCTGAC TTTCTCCTGT
TCCGGTCTGA TGACCAGCGA TTTATTTCAG AAAATCATCG CGGATGCCGC AATTGATGCC
GGCCGTGATG TACAATTTAT AGAGCAGTTC CGTCAGGCAG CCGATCATCC GGTGATCGCT
ACCTATCCGG AAGGGCTATA TCTGAAAGGG TTTGCCTGTC GCGTCATGTA A
 
Protein sequence
MSVRLVLAKG REKSLLRRHP WVFSGTVARM EGKASLGETI DIVDHQGKWL ARGAYSPASQ 
IRARVWTFDP SESIDIAFFS RRLQQAQKWR DWLAQKDGLD SYRLIAGESD GLPGITIDRF
GNFLVLQLLS AGAEYQRAAL VSALQTLYPE CAIYDRSDVA VRKKEGMELT LGLVTGELPP
ALLPIEEHGM KLLVDIQHGH KTGYYLDQRD SRLATRRYVE NKRVLNCFSY TGGFAVSALM
GGCSQVVSVD TSQEALDIAR QNVELNKLDL SKAEFVRDDV FKLLRTYRDR GEKFDVIVMD
PPKFVENKSQ LMGACRGYKD INMLAIQLLN EGGILLTFSC SGLMTSDLFQ KIIADAAIDA
GRDVQFIEQF RQAADHPVIA TYPEGLYLKG FACRVM