Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_2904 |
Symbol | |
ID | 4253475 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | - |
Start bp | 3465690 |
End bp | 3466706 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 638119539 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_735032 |
Protein GI | 113971239 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000983109 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0445354 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGTTC TAGGTATTGA GACATCTTGT GACGAGACAG GTATTGCCGT CTATGACGAT AAGCTGGGGT TACTCTCCCA TGCTTTATAT AGTCAGGTTA AGTTGCATGC GGATTATGGT GGGGTAGTGC CTGAACTGGC TTCCCGCGAC CATGTGCGCA AAATTGTCCC GCTGATCCGC CAAGCGTTGA AAAACGCCAA TACCGAGATT GCCGATTTAG ATGGCATTGC CTACACCAAA GGCCCTGGCC TTATCGGTGC TTTATTAGTG GGCGCTTGTG TTGGCCGCTC ACTTGCCTTT GCTTGGAACA AGCCCGCCAT CGGTGTGCAT CATATGGAAG GGCATCTGCT GGCGCCAATG CTGGAAGACG ATGCGCCTGA GTTTCCCTTT GTTGCACTAT TAGTATCAGG TGGCCACTCT ATGTTAGTAA AGGTTGATGG CATCGGCCTT TATGAAGTTT TAGGTGAGTC GGTGGATGAT GCCGCCGGTG AAGCCTTCGA CAAAACCGCT AAGCTAATGG GGCTGGATTA CCCCGGCGGT CCACGGCTTG CGAAACTGGC CGCCAAAGGT GAGCCTGCAG GTTATCAATT TCCGCGGCCA ATGACCGATA GACCCGGGCT CGATTTTAGC TTCTCAGGCC TGAAAACCTT TACCGCCAAT ACCATTGCCG CCGAGCCAGA TGATGAGCAA ACCCGCGCCA ACATCGCCCG AGCCTTTGAA GAAGCGGTTG TGGATACCCT CGCGATAAAA TGTCGCCGCG CGTTAAAACA AACGGGCTAT AACCGTTTAG TGATCGCGGG TGGCGTGAGC GCCAACACAC GCTTACGGGA AACCTTGGCC GAGATGATGA CCTCTATCGG TGGCCGAGTT TATTATCCTC GCGGCGAGTT TTGTACTGAC AACGGCGCCA TGATTGCTTT CGCTGGCCTG CAGCGTTTAA AGGCGGGGCA GCAGGAAGAC TTAGCGGTCA AAGGTCAACC GAGATGGCCG CTCGATACCT TACCGCCAGT TGCGTGA
|
Protein sequence | MRVLGIETSC DETGIAVYDD KLGLLSHALY SQVKLHADYG GVVPELASRD HVRKIVPLIR QALKNANTEI ADLDGIAYTK GPGLIGALLV GACVGRSLAF AWNKPAIGVH HMEGHLLAPM LEDDAPEFPF VALLVSGGHS MLVKVDGIGL YEVLGESVDD AAGEAFDKTA KLMGLDYPGG PRLAKLAAKG EPAGYQFPRP MTDRPGLDFS FSGLKTFTAN TIAAEPDDEQ TRANIARAFE EAVVDTLAIK CRRALKQTGY NRLVIAGGVS ANTRLRETLA EMMTSIGGRV YYPRGEFCTD NGAMIAFAGL QRLKAGQQED LAVKGQPRWP LDTLPPVA
|
| |