Gene SbBS512_E3614 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E3614 
Symbol 
ID6270808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3367006 
End bp3367986 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content58% 
IMG OID641727483 
Productpeptidase, U32 family 
Protein accessionYP_001881925 
Protein GI187732504 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCTGC TCTGCCCTGC CGGAAATCTC CCGGCGCTTA AGGCGGCCAT CGAAAACGGC 
GCAGATGCTG TTTATATCGG GCTAAAAGAT GATACCAATG CCCGTCACTT CGCCGGCCTT
AACTTTACCG AGAAAAAATT GCAGGAAGCG GTGAGTTTTG TCCATCAACA TCGCCGCAAA
CTTCACATCG CGATTAACAC TTTTGCGCAT CCGGACGGTT ACGCCCGTTG GCAGCGCGCC
GTGGATATGG CGGCGCAGCT GGGTGCCGAC GCGCTGATCC TCGCCGACCT CGCCATGCTG
GAATACGCCG CCGAGCGTTA TCCGAATATT GAGCGCCACG TATCGGTGCA GGCTTCGGCG
ACCAATGAAG AGGCGATTAA CTTTTATCAT CGCCATTTTG ACGTTGCCCG CGTGGTGCTG
CCGCGCGTGT TGTCGATTCA TCAGGTGAAA CAGCTGGCAC GGGTCACACC TGTACCACTG
GAAGTGTTTG CTTTCGGCAG CCTGTGCATT ATGTCGGAAG GCCGTTGCTA TCTGTCGTCG
TATCTGACGG GTGAGTCGCC TAACACCGTG GGCGCGTGTT CTCCGGCCCG TTTCGTGCGC
TGGCAACAAA CGCCGCAGGG GCTGGAATCC CGCCTGAACG AAGTGCTGAT CGACCGTTAT
CAGGACGGCG AAAACGCAGG TTATCCGACG CTGTGTAAAG GGCGTTATCT GGTGGACGGC
GAGCGCTATC ACGCGCTGGA AGAACCAACC AGTCTCAATA CCCTGGAACT GCTGCCGGAG
TTAATGGCGG CGAATATTGC TTCGGTGAAA ATTGAAGGCC GCCAGCGTAG CCCGGCGTAT
GTCAGCCAGG TGGCGAAAGT CTGGCGTCAG GCTATCGACC GTTGTAAGGC CGATCCGCAA
AGCGCGTGGA TGGAGACGCT CGGGTCGATG TCCGAAGGCA CGCAGACCAC CCTTGGCGCG
TATCACCGTA AATGGCAGTG A
 
Protein sequence
MELLCPAGNL PALKAAIENG ADAVYIGLKD DTNARHFAGL NFTEKKLQEA VSFVHQHRRK 
LHIAINTFAH PDGYARWQRA VDMAAQLGAD ALILADLAML EYAAERYPNI ERHVSVQASA
TNEEAINFYH RHFDVARVVL PRVLSIHQVK QLARVTPVPL EVFAFGSLCI MSEGRCYLSS
YLTGESPNTV GACSPARFVR WQQTPQGLES RLNEVLIDRY QDGENAGYPT LCKGRYLVDG
ERYHALEEPT SLNTLELLPE LMAANIASVK IEGRQRSPAY VSQVAKVWRQ AIDRCKADPQ
SAWMETLGSM SEGTQTTLGA YHRKWQ