Gene SbBS512_E1150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1150 
Symbol 
ID6270088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1042688 
End bp1044049 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content54% 
IMG OID641725280 
Productpeptidase, U32 family 
Protein accessionYP_001879797 
Protein GI187733327 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.565411 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAAAC CGGAACTCCT TTCCCCGGCG GGAACGCTGA AAAATATGCG TTACGCTTTC 
GCTTATGGCG CAGATGCTGT TTATGCGGGC CAGCCGCGTT ATTCCCTGCG TGTGCGCAAC
AACGAATTCA ACCACGAAAA TCTTCAGCTC GGCATCAATG AAGCCCACGC GCTGGGGAAA
AAGTTTTATG TCGTGGTCAA CATTGCACCG CACAACGCCA AGCTGAAAAC CTTTATCCGT
GACCTGAAAC CGGTGGTGGA AATGGGGCCG GATGCGCTGA TTATGTCCGA TCCAGGGCTG
ATTATGCTGG TGCGTGAGCA CTTCCCTGAA ATGCCGATCC ACCTTTCGGT GCAGGCTAAC
GCCGTGAACT GGGCGACGGT GAAATTCTGG CAGCAAATGG GCCTGACCCG CGTGATCCTC
TCTCGCGAGC TGTCGCTGGA AGAGATTGAA GAGATCCGCA ATCAGGTGCC GGATATGGAG
ATCGAGATCT TCGTTCACGG CGCGCTGTGC ATGGCCTACT CCGGTCGCTG CCTGCTCTCT
GGCTATATCA ACAAGCGCGA CCCGAACCAG GGCACCTGCA CCAACGCCTG CCGCTGGGAG
TACAACGTCC AGGAAGGGAA AGAAGATGAT GTTGGCAACA TCGTACACAA GTACGAGCCG
ATTCCGGTGC AAAATGTTGA GCCGACGCTG GGTATCGGCG CACCAACCGA CAAAGTGTTT
ATGATCGAAG AGGCCCAGCG TCCGGGCGAG TATATGACCG CGTTTGAAGA TGAGCACGGC
ACTTACATCA TGAACTCGAA AGATCTGCGC GCCATCGCCC ATGTAGAACG CCTGACCAAA
ATGGGCGTGC ATTCGCTGAA AATCGAAGGT CGTACCAAAT CTTTCTACTA TTGTGCACGC
ACCGCACAGG TTTACCGCAA AGCTATCGAT GACGCCGCTG CGGGAAAACC GTTCGATACC
AGCCTGCTGG AAACTCTGGA AGGTCTGGCG CATCGTGGCT ATACCGAAGG TTTCCTGCGT
CGTCATACTC ACGACGATTA TCAGAACTAC GAATACGGTT ATTCAGTTTC TGACCGCCAG
CAGTTTGTTG GTGAGTTTAC CGGTGAGCGC AAGGGGGACC TCGCGGCGGT AGCGGTGAAA
AATAAATTCT CCGTTGGCGA CAGCCTTGAG CTGATGACGC CGCAAGGCAA CATTAATTTT
ACCCTTGAGC ACATGGAAAA CGCCAAAGGC GAAGCTATGC CGATAGCACC AGGCGATGGT
TATACTGTGT GGCTCCCGGT CCCGCAGGAT CTTGAGCTCA ATTACGCGCT GCTGATGCGT
AATTTCTCCG GGGAAACCAC GCGTAATCCC CACGGTAAGT GA
 
Protein sequence
MFKPELLSPA GTLKNMRYAF AYGADAVYAG QPRYSLRVRN NEFNHENLQL GINEAHALGK 
KFYVVVNIAP HNAKLKTFIR DLKPVVEMGP DALIMSDPGL IMLVREHFPE MPIHLSVQAN
AVNWATVKFW QQMGLTRVIL SRELSLEEIE EIRNQVPDME IEIFVHGALC MAYSGRCLLS
GYINKRDPNQ GTCTNACRWE YNVQEGKEDD VGNIVHKYEP IPVQNVEPTL GIGAPTDKVF
MIEEAQRPGE YMTAFEDEHG TYIMNSKDLR AIAHVERLTK MGVHSLKIEG RTKSFYYCAR
TAQVYRKAID DAAAGKPFDT SLLETLEGLA HRGYTEGFLR RHTHDDYQNY EYGYSVSDRQ
QFVGEFTGER KGDLAAVAVK NKFSVGDSLE LMTPQGNINF TLEHMENAKG EAMPIAPGDG
YTVWLPVPQD LELNYALLMR NFSGETTRNP HGK