Gene P9211_04701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_04701 
Symbolqri7 
ID5730411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp441304 
End bp442392 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content44% 
IMG OID641284827 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_001550355 
Protein GI159903011 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.148459 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.840172 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATACG ACCAAATGAT GCAAACTGTA CTTGCCCTCG AAACAAGTTG TGACGAGACT 
GCCGTTGCAT TAGTCCAATT CGAAGGTGGA AAATTCCGTG TAATTGCTAA TTGCATTGCC
TCTCAGGCTG ATGAGCATTC AAAATGGGGG GGAGTAGTTC CCGAAATAGC CTCTAGACGA
CATCTGGAAT TAATGCCTTT TTTAATTAAA GAAGCATTGA TTGAGGCGAA AACAGGCTTT
GAAAGTATTG ATTTAATAGG AGCGACTGTG GCTCCAGGGC TTACAGGGGC TTTGTTGATT
GGATCCTTAA CGGCAAGAAG TCTTGCTGCT CTTCACGGGA TTCCTTTCTT TGGTGTGCAT
CATTTGGAAG GACACCTTGC TTCTGTTTTG CTTTCTGATG AAGTCCCAAC CCCTCCTTTT
TTGGTTCTAT TGGTTAGTGG TGGTCATACA GAACTTATAA GAGTAAACAA AAATTTTGAT
TATCAACGTC TTGGGCGAAG TCATGATGAT GCAGCGGGGG AGGCTTTTGA CAAGGTGGCA
AGATTATTGG GCCTGAGTTA TCCAGGTGGA CCTTCAATTG AAAAAATTGC TAAAGGAGGT
GACCCTAGAA GATTTTCTTT CCCTAAGGGA AGGGTCTCCA ACCCAGGAGG AGGCTATTAC
CCATATGATT TCTCTTTTAG TGGTCTTAAA ACAGCTGTAT TGCGTCAGGT CGAGAAGCTT
AAAAAATCGG ATATCGATCT TCCCTTGAGT GATCTTGCTG CAAGTTTTGA GCAGGTAGTG
GCAGAAGTGC TTGTTGAAAG AAGTCTGAAG TGTGCCGAAG AGCAAGGTAT CGACTCTTTA
GTAATGGTGG GAGGTGTGGC TGCAAATCAT CGATTGAGAG AGCTAATGAT GACCAGTTCG
AAGGATATTT CTCTGAAAGT TTATTTGGCA TCAAAATCTT TTTGCACAGA CAATGCAGCA
ATGATTGGTA CTGCAGCATT GCTTCGCTTG ATTTCTGGCG GTAGTCCTAG TTCTATGGAA
TTGGGTGTTT GTGCTCGTTT AGGCCTTGAA GAGGCTTCTC GTTTGTATGA CGACCAGCCT
CCTTTTTGA
 
Protein sequence
MRYDQMMQTV LALETSCDET AVALVQFEGG KFRVIANCIA SQADEHSKWG GVVPEIASRR 
HLELMPFLIK EALIEAKTGF ESIDLIGATV APGLTGALLI GSLTARSLAA LHGIPFFGVH
HLEGHLASVL LSDEVPTPPF LVLLVSGGHT ELIRVNKNFD YQRLGRSHDD AAGEAFDKVA
RLLGLSYPGG PSIEKIAKGG DPRRFSFPKG RVSNPGGGYY PYDFSFSGLK TAVLRQVEKL
KKSDIDLPLS DLAASFEQVV AEVLVERSLK CAEEQGIDSL VMVGGVAANH RLRELMMTSS
KDISLKVYLA SKSFCTDNAA MIGTAALLRL ISGGSPSSME LGVCARLGLE EASRLYDDQP
PF