Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_04701 |
Symbol | qri7 |
ID | 5730411 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 441304 |
End bp | 442392 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641284827 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_001550355 |
Protein GI | 159903011 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.148459 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.840172 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGATACG ACCAAATGAT GCAAACTGTA CTTGCCCTCG AAACAAGTTG TGACGAGACT GCCGTTGCAT TAGTCCAATT CGAAGGTGGA AAATTCCGTG TAATTGCTAA TTGCATTGCC TCTCAGGCTG ATGAGCATTC AAAATGGGGG GGAGTAGTTC CCGAAATAGC CTCTAGACGA CATCTGGAAT TAATGCCTTT TTTAATTAAA GAAGCATTGA TTGAGGCGAA AACAGGCTTT GAAAGTATTG ATTTAATAGG AGCGACTGTG GCTCCAGGGC TTACAGGGGC TTTGTTGATT GGATCCTTAA CGGCAAGAAG TCTTGCTGCT CTTCACGGGA TTCCTTTCTT TGGTGTGCAT CATTTGGAAG GACACCTTGC TTCTGTTTTG CTTTCTGATG AAGTCCCAAC CCCTCCTTTT TTGGTTCTAT TGGTTAGTGG TGGTCATACA GAACTTATAA GAGTAAACAA AAATTTTGAT TATCAACGTC TTGGGCGAAG TCATGATGAT GCAGCGGGGG AGGCTTTTGA CAAGGTGGCA AGATTATTGG GCCTGAGTTA TCCAGGTGGA CCTTCAATTG AAAAAATTGC TAAAGGAGGT GACCCTAGAA GATTTTCTTT CCCTAAGGGA AGGGTCTCCA ACCCAGGAGG AGGCTATTAC CCATATGATT TCTCTTTTAG TGGTCTTAAA ACAGCTGTAT TGCGTCAGGT CGAGAAGCTT AAAAAATCGG ATATCGATCT TCCCTTGAGT GATCTTGCTG CAAGTTTTGA GCAGGTAGTG GCAGAAGTGC TTGTTGAAAG AAGTCTGAAG TGTGCCGAAG AGCAAGGTAT CGACTCTTTA GTAATGGTGG GAGGTGTGGC TGCAAATCAT CGATTGAGAG AGCTAATGAT GACCAGTTCG AAGGATATTT CTCTGAAAGT TTATTTGGCA TCAAAATCTT TTTGCACAGA CAATGCAGCA ATGATTGGTA CTGCAGCATT GCTTCGCTTG ATTTCTGGCG GTAGTCCTAG TTCTATGGAA TTGGGTGTTT GTGCTCGTTT AGGCCTTGAA GAGGCTTCTC GTTTGTATGA CGACCAGCCT CCTTTTTGA
|
Protein sequence | MRYDQMMQTV LALETSCDET AVALVQFEGG KFRVIANCIA SQADEHSKWG GVVPEIASRR HLELMPFLIK EALIEAKTGF ESIDLIGATV APGLTGALLI GSLTARSLAA LHGIPFFGVH HLEGHLASVL LSDEVPTPPF LVLLVSGGHT ELIRVNKNFD YQRLGRSHDD AAGEAFDKVA RLLGLSYPGG PSIEKIAKGG DPRRFSFPKG RVSNPGGGYY PYDFSFSGLK TAVLRQVEKL KKSDIDLPLS DLAASFEQVV AEVLVERSLK CAEEQGIDSL VMVGGVAANH RLRELMMTSS KDISLKVYLA SKSFCTDNAA MIGTAALLRL ISGGSPSSME LGVCARLGLE EASRLYDDQP PF
|
| |