Gene Shewmr4_2904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2904 
Symbol 
ID4253475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp3465690 
End bp3466706 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content54% 
IMG OID638119539 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_735032 
Protein GI113971239 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000983109 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0445354 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGTTC TAGGTATTGA GACATCTTGT GACGAGACAG GTATTGCCGT CTATGACGAT 
AAGCTGGGGT TACTCTCCCA TGCTTTATAT AGTCAGGTTA AGTTGCATGC GGATTATGGT
GGGGTAGTGC CTGAACTGGC TTCCCGCGAC CATGTGCGCA AAATTGTCCC GCTGATCCGC
CAAGCGTTGA AAAACGCCAA TACCGAGATT GCCGATTTAG ATGGCATTGC CTACACCAAA
GGCCCTGGCC TTATCGGTGC TTTATTAGTG GGCGCTTGTG TTGGCCGCTC ACTTGCCTTT
GCTTGGAACA AGCCCGCCAT CGGTGTGCAT CATATGGAAG GGCATCTGCT GGCGCCAATG
CTGGAAGACG ATGCGCCTGA GTTTCCCTTT GTTGCACTAT TAGTATCAGG TGGCCACTCT
ATGTTAGTAA AGGTTGATGG CATCGGCCTT TATGAAGTTT TAGGTGAGTC GGTGGATGAT
GCCGCCGGTG AAGCCTTCGA CAAAACCGCT AAGCTAATGG GGCTGGATTA CCCCGGCGGT
CCACGGCTTG CGAAACTGGC CGCCAAAGGT GAGCCTGCAG GTTATCAATT TCCGCGGCCA
ATGACCGATA GACCCGGGCT CGATTTTAGC TTCTCAGGCC TGAAAACCTT TACCGCCAAT
ACCATTGCCG CCGAGCCAGA TGATGAGCAA ACCCGCGCCA ACATCGCCCG AGCCTTTGAA
GAAGCGGTTG TGGATACCCT CGCGATAAAA TGTCGCCGCG CGTTAAAACA AACGGGCTAT
AACCGTTTAG TGATCGCGGG TGGCGTGAGC GCCAACACAC GCTTACGGGA AACCTTGGCC
GAGATGATGA CCTCTATCGG TGGCCGAGTT TATTATCCTC GCGGCGAGTT TTGTACTGAC
AACGGCGCCA TGATTGCTTT CGCTGGCCTG CAGCGTTTAA AGGCGGGGCA GCAGGAAGAC
TTAGCGGTCA AAGGTCAACC GAGATGGCCG CTCGATACCT TACCGCCAGT TGCGTGA
 
Protein sequence
MRVLGIETSC DETGIAVYDD KLGLLSHALY SQVKLHADYG GVVPELASRD HVRKIVPLIR 
QALKNANTEI ADLDGIAYTK GPGLIGALLV GACVGRSLAF AWNKPAIGVH HMEGHLLAPM
LEDDAPEFPF VALLVSGGHS MLVKVDGIGL YEVLGESVDD AAGEAFDKTA KLMGLDYPGG
PRLAKLAAKG EPAGYQFPRP MTDRPGLDFS FSGLKTFTAN TIAAEPDDEQ TRANIARAFE
EAVVDTLAIK CRRALKQTGY NRLVIAGGVS ANTRLRETLA EMMTSIGGRV YYPRGEFCTD
NGAMIAFAGL QRLKAGQQED LAVKGQPRWP LDTLPPVA