Gene SbBS512_E3498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E3498 
Symbolgcp 
ID6273172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3250012 
End bp3251025 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content57% 
IMG OID641727379 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_001881826 
Protein GI187731352 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000164259 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGTAC TGGGTATTGA AACTTCCTGC GATGAAACCG GCATCGCCAT TTACGACGAT 
GAAAAAGGTT TGTTAGCCAA CCAATTGTAT AGTCAGGTGA AATTGCACGC TGACTACGGC
GGCGTCGTGC CTGAACTGGC CTCCCGCGAT CACGTGCGCA AAACCGTACC GTTGATCCAG
GCGGCGCTAA AGGAGTCTGG CTTAACGGCA AAAGACATTG ATGCTGTGGC CTATACCGCA
GGCCCTGGAT TAGTCGGCGC ACTGCTGGTT GGCGCAACCG TGGGGCGTTC TCTGGCGTTT
GCCTGGAACG TTCCGGCAAT CCCGGTACAC CATATGGAAG GGCATCTGTT AGCGCCGATG
CTGGAAGATA ACCCGCCGGA ATTTCCGTTT GTCGCGCTGC TGGTGTCCGG CGGTCATACG
CAGTTAATCA GCGTGACTGG CATTGGTCAG TACGAGCTGC TCGGCGAGTC TATCGATGAT
GCCGCCGGTG AAGCGTTTGA TAAAACCGCG AAGCTGCTGG GGCTGGATTA TCCTGGCGGA
CCGTTACTGT CGAAAATGGC GGCTCAGGGT ACTGCCGGGC GCTTTGTTTT CCCGCGTCCG
ATGACCGACC GTCCGGGGCT GGATTTCAGC TTTTCTGGTC TGAAAACCTT TGCGGCGAAC
ACGATTCGTG ACAACGGCAC CGACGACCAG ACGCGTGCTG ATATCGCCCG CGCCTTTGAA
GATGCGGTGG TCGATACGTT GATGATTAAG TGTAAGCGAG CGTTGGATCA GACGGGCTTT
AAGCGACTGG TCATGGCGGG CGGCGTGAGT GCTAACCGCA CGTTACGGGC GAAGCTGGCG
GAAATGATGA AAAAACGCCG CGGCGAAGTG TTCTACGCGC GTCCGGAGTT TTGTACTGAT
AACGGCGCGA TGATCGCCTA TGCCGGAATG GTGCGGTTTA AAGCAGGCGC GACGGCGGAT
CTCGGCGTTA GCGTGCGTCC GCGCTGGCCG CTGGCGGAGT TACCGGCTGC GTAA
 
Protein sequence
MRVLGIETSC DETGIAIYDD EKGLLANQLY SQVKLHADYG GVVPELASRD HVRKTVPLIQ 
AALKESGLTA KDIDAVAYTA GPGLVGALLV GATVGRSLAF AWNVPAIPVH HMEGHLLAPM
LEDNPPEFPF VALLVSGGHT QLISVTGIGQ YELLGESIDD AAGEAFDKTA KLLGLDYPGG
PLLSKMAAQG TAGRFVFPRP MTDRPGLDFS FSGLKTFAAN TIRDNGTDDQ TRADIARAFE
DAVVDTLMIK CKRALDQTGF KRLVMAGGVS ANRTLRAKLA EMMKKRRGEV FYARPEFCTD
NGAMIAYAGM VRFKAGATAD LGVSVRPRWP LAELPAA