Gene SbBS512_E4441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4441 
Symbol 
ID6272000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp4149124 
End bp4150857 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content50% 
IMG OID641728236 
Producthypothetical protein 
Protein accessionYP_001882649 
Protein GI187732871 
COG category[R] General function prediction only 
COG ID[COG2194] Predicted membrane-associated, metal-dependent hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATTCCA CAGAAGTCCA GGCTAAACCC CTTTTTAGCT GGAAAGCCCT GGGTTGGGCA 
CTGCTCTACT TTTGGTTTTT CTCTACTCTG CTACAGGCCA TTATTTACAT CAGTGGTTAT
AGTGGCACCA ACGGCATTCG CGACTCGCTG TTATTCAGCT CGCTGTGGTT GATCCCGGTA
TTCCTCTTTC CGAAGCGGAT TAAAATTATT GCCGCAGTAA TCGGCGTGGT GCTATGGGCG
GCCTCTCTGG CGGCGCTATG CTACTACGTC ATCTACGGGC AGGAGTTCTC GCAGAGCGTT
CTGTTTGTGA TGTTCGAAAC CAACACCAAC GAAGCCAGCG AGTATTTAAG CCAGTATTTC
AGCCTGAAAA TTGTGCTTAT CGCGCTGGCC TATACGGCGG TGGCAGTTCT GCTGTGGACA
CGCCTGCGCC CGGTCTATAT TCCAAAGCCG TGGCGTTATG TTGTCTCTTT TGCCCTGCTT
TATGGCTTGA TTCTGCATCC GATCGCCATG AATACGTTTA TCAAAAACAA GCCGTTTGAG
AAAACGTTGG ATAACCTGGC CTCGCGTATG GAGCCTGCCG CACCGTGGCA ATTCCTGACC
GGCTATTATC AGTATCGTCA GCAACTAAAC TCGCTAACAA AGTTACTGAA TGAAAATAAT
GCCTTGCCGC CACTGGCTAA TTTCAAAGAT GAATCGGGTA ACGAACCGCG CACTTTAGTG
CTGGTGATTG GCGAGTCGAC CCAGCGCGGA CGCATGAGTC TGTACGGTTA TCCGCGTGAA
ACCACGCCGG AGCTGGATGC GCTGCATAAA ACCGATCCGA ATCTGACCGT GTTTAATAAC
GTAGTTACGT CTCGTCCGTA CACCATTGAA ATCCTGCAAC AGGCGCTGAC CTTTGCCAAT
GAAAAGAACC CGGATCTGTA TCTGACGCAG CCGTCGCTGA TGAACATGAT GAAACAGGCG
GGTTATAAAA CCTTCTGGAT CACCAACCAG CAGACGATGA CCGCCCGCAA TACCATGCTG
ACGGTATTTT CGCGCCAGAC CGACAAGCAG TACTACATGA ACCAGCAACG TACGCAGAGT
GCGCGTGAAT ACGACACCAA CGTGCTGAAG CCGTTCCAGG ATGTGCTGAA TGACCCTGCG
CCGAAGAAAC TGATCATCGT TCATCTGCTG GGTACGCATA TCAAATACAA ATACCGCTAC
CCGGAAAATC AGGGCAAGTT TGATGGCAAT ACCGATCATG TTCCGCCAGG ATTAAGCGCA
GAAGAGCTGG AATCATATAA CGATTATGAC AACGCTAACT TGTATAACGA TCATGTGGTT
GCCAGCCTGA TTAAAGACTT TAAAGCGGCA GACCCGAACG GATTCCTTGT TTACTTCTCT
GACCACGGTG AAGAGGTTTA CGACACGCCG CCGCACAAAA CTCAGGGGCG TAACGAAGAC
AACCCGACGC GCCACATGTA CACCATTCCG TTCCTGCTGT GGACGTCGGA AAAATGGCAA
GCGACTCATC CCCGTGATTT CTCACAGGAT GTCGATCGTA AATACAGCCT GGCGGAACTG
ATCCACACCT GGTCAGATTT GGCGGGCTTA TCTTACGACG GTTACGATCC AACCCGTTCA
GTGGTGAATC CGCAGTTCAA AGAAACTACC CGCTGGATTG GTAACCCGTA CAAGAAAAAC
GCGCTGATCG ATTACGACAC TCTGCCGTAT GGCGACCAGG TAGGTAATCA GTAA
 
Protein sequence
MHSTEVQAKP LFSWKALGWA LLYFWFFSTL LQAIIYISGY SGTNGIRDSL LFSSLWLIPV 
FLFPKRIKII AAVIGVVLWA ASLAALCYYV IYGQEFSQSV LFVMFETNTN EASEYLSQYF
SLKIVLIALA YTAVAVLLWT RLRPVYIPKP WRYVVSFALL YGLILHPIAM NTFIKNKPFE
KTLDNLASRM EPAAPWQFLT GYYQYRQQLN SLTKLLNENN ALPPLANFKD ESGNEPRTLV
LVIGESTQRG RMSLYGYPRE TTPELDALHK TDPNLTVFNN VVTSRPYTIE ILQQALTFAN
EKNPDLYLTQ PSLMNMMKQA GYKTFWITNQ QTMTARNTML TVFSRQTDKQ YYMNQQRTQS
AREYDTNVLK PFQDVLNDPA PKKLIIVHLL GTHIKYKYRY PENQGKFDGN TDHVPPGLSA
EELESYNDYD NANLYNDHVV ASLIKDFKAA DPNGFLVYFS DHGEEVYDTP PHKTQGRNED
NPTRHMYTIP FLLWTSEKWQ ATHPRDFSQD VDRKYSLAEL IHTWSDLAGL SYDGYDPTRS
VVNPQFKETT RWIGNPYKKN ALIDYDTLPY GDQVGNQ