Gene P9211_05401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_05401 
Symbol 
ID5730785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp506599 
End bp507840 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content39% 
IMG OID641284899 
ProductZn-dependent protease 
Protein accessionYP_001550425 
Protein GI159903081 
COG category[R] General function prediction only 
COG ID[COG1994] Zn-dependent proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGATAC GTGGAATACC TATTAGGGTC CATCCCAGTT GGTTTCTAAT CCTTTTTGTT 
TTTACATGTG CCTCTCAAGG TCAGATATCG AACCTGTTTG ATTCTGAATT GCCTGTCCTA
TTGAGTTGGG GGATCGGTTT TCTAACTTCT TTGCTTGTTT TTGCATCAGT TGTTTTGCAT
GAACTTGGTC ACTCTTTTAT GGCAATGCAC GAGGGCATAA AAGTGAGAAG CATTACTTTG
TTTTTGTTGG GTGGGGTGGC GCGAATTGAT AAAGAATGTG TGACAGCAAT GTCTTGTTTA
AGAGTTGCTA TAGCGGGCCC TTTAGTAAGT CTTTCTTTAG CAGGTTTGCT TTTGGCCTTT
GTTCAAGTTG CATCTAACAC AAGTTTAATT GCCTCGAATT TATTCTCTCA GCTTGGAACT
ATTAATTTGT TATTGGCTTT CTTTAACTTA TTACCAGGTT TACCTTTAGA TGGAGGTGTA
ATACTAAAGT CAATTGTTTG GCACTTTTCT GGGAGTCAGC GTAAAGGATT AAAGGTTGCA
AACTATTCAG GACGACTATT ATCGGTTTTT GCGGTTTTTC TTGGGACTTT CATTTGGTTA
AGAGGTGGAG GTTTCGGAGG TATATGGTTG ATAATTCTTG GCTGGTTTGG CCTTGCTTCT
TCGCGATCTC AGAATCAGAT TTTCTCATTA CAAGAAATTT TATGCACATT GAATGTTAGC
CAAGCCTCGA GAAGAAATTT TAGAGTCCTT GAAGTGGATC AGTCTCTAAA AAGTATTAGT
GAGTTGAACT TGGGCTCGGC TGAAAATCAA CGGATATCTG AATGGGTGCT TTTGTGCAAT
GCAGGAAGAT GGGTTGGCTA TTTAACAGAT AAAGTTTTGA AGGATGTTCC AGTTCAGGAT
TGGGATAAAT ATTTGGTATC GGAATATAGT CAACCACTAA GTGAGTTGCC TTCGATAAGC
GACAAAGAAC CTCTTTGGCA CGCGGTACTA ACCTTAGAAA AGCTTAAGGC ATCAAGACTG
TTGGTCTTTA ATTCGGCAGG CTTGCCTTCT GGAACTTTAG ATAAAGTAGA TATTGGGAAT
GCAGTCCTAA GTAGACTTGG ACTTAAACTA CCAAAGTCAT TTCTCGAAAC GGCAAGGCAG
AATAACATTT ATCCATTGGG CATCTCATTG GTTCAAGTAG TCGAAGAAAT GATTATCACT
GGATTAGTTC AAGAACCTAA TAGCAATGAG TCGATGAAGT AA
 
Protein sequence
MRIRGIPIRV HPSWFLILFV FTCASQGQIS NLFDSELPVL LSWGIGFLTS LLVFASVVLH 
ELGHSFMAMH EGIKVRSITL FLLGGVARID KECVTAMSCL RVAIAGPLVS LSLAGLLLAF
VQVASNTSLI ASNLFSQLGT INLLLAFFNL LPGLPLDGGV ILKSIVWHFS GSQRKGLKVA
NYSGRLLSVF AVFLGTFIWL RGGGFGGIWL IILGWFGLAS SRSQNQIFSL QEILCTLNVS
QASRRNFRVL EVDQSLKSIS ELNLGSAENQ RISEWVLLCN AGRWVGYLTD KVLKDVPVQD
WDKYLVSEYS QPLSELPSIS DKEPLWHAVL TLEKLKASRL LVFNSAGLPS GTLDKVDIGN
AVLSRLGLKL PKSFLETARQ NNIYPLGISL VQVVEEMIIT GLVQEPNSNE SMK