Gene Sbal223_3165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_3165 
Symbol 
ID7085778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp3744683 
End bp3745699 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content51% 
IMG OID643462049 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_002359073 
Protein GI217974322 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000255417 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.184542 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGTTC TAGGTATTGA AACATCTTGT GACGAGACAG GTATCGCCGT CTATGACGAT 
GAACTGGGCT TATTATCGCA CACTTTATAC AGTCAAGTTA AGCTGCATGC TGACTATGGT
GGTGTGGTGC CTGAGTTGGC GTCGCGTGAC CATGTACGCA AAATTGTGCC GCTTATTCGT
CAAGCGCTGA AAGATGCGAA TACTGAAATG GCTGATCTCG ACGGGATTGC CTACACCAAG
GGCCCAGGTT TGATTGGTGC TTTGCTTGTT GGTGCCTGTG TGGGGCGTTC ACTGGCTTTT
GCTTGGGACA AGCCTGCAAT CGGTGTGCAC CATATGGAAG GGCACCTGCT GGCGCCTATG
CTCGAAGATG ACGCGCCTGA GTTCCCCTTT GTGGCCTTAT TAGTTTCCGG TGGCCATTCA
ATGTTGGTTA AAGTTGATGG GATTGGCCGT TATGAAGTAT TAGGTGAGTC GGTTGACGAT
GCCGCGGGTG AAGCATTCGA TAAAACGGCC AAGTTGATGG GGCTGGATTA TCCCGGCGGT
CCGCGCTTAG CGAAACTGGC AGCTAAAGGC TTACCCGCAG GCTATAAGTT CCCGCGTCCT
ATGACCGATA GACCAGGACT CGACTTTAGT TTTTCGGGTT TAAAAACCTT TACCGCCAAT
ACCATTGCCG CTGAACCTGA TGATGAGCAA ACGCGCGCCA ATATTGCCCG TGCCTTTGAA
GAAGCCGTGG TTGATACGCT GGCGATTAAA TGTCGTCGTG CGCTGAAGCA AACTGGCTAT
AACCGCTTAG TGATTGCCGG TGGCGTGAGT GCGAATACGC GCTTAAGGGA AACCTTGGCC
GAAATGATGA ACTCGCTAGG TGGACAAGTG TTTTATCCCC GCGGTGAGTT TTGTACCGAT
AACGGCGCCA TGATTGCCTT TGCGGGATTG CAGCGTTTAA AGGCGGGACA ACACGAAGAT
TTAGCGGTAA AAGGTCAACC TCGATGGCCA TTAGATACCT TGCCACCTGT GGCATAA
 
Protein sequence
MRVLGIETSC DETGIAVYDD ELGLLSHTLY SQVKLHADYG GVVPELASRD HVRKIVPLIR 
QALKDANTEM ADLDGIAYTK GPGLIGALLV GACVGRSLAF AWDKPAIGVH HMEGHLLAPM
LEDDAPEFPF VALLVSGGHS MLVKVDGIGR YEVLGESVDD AAGEAFDKTA KLMGLDYPGG
PRLAKLAAKG LPAGYKFPRP MTDRPGLDFS FSGLKTFTAN TIAAEPDDEQ TRANIARAFE
EAVVDTLAIK CRRALKQTGY NRLVIAGGVS ANTRLRETLA EMMNSLGGQV FYPRGEFCTD
NGAMIAFAGL QRLKAGQHED LAVKGQPRWP LDTLPPVA