Gene P9211_08451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_08451 
Symbol 
ID5731334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp738116 
End bp740029 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content40% 
IMG OID641285209 
ProductFtsH ATP-dependent protease-like protein 
Protein accessionYP_001550730 
Protein GI159903386 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0508625 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00322552 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATCAGA GAGTAAAGCT AATACTTCTA TGGTTGCTGC CTATTGGGAT GGTTGTTCTT 
ATAAGCTGGC AAATACTAGG TAATGGAGAC ACAACTGCTC TAAATCAAAG CAGTAGCTCA
CTTGCTACTA GAAACTCAGC TGTTTCAAAA ATGAGTTACG GGCGTTTTAT TGATTACATC
AATGCAGGAA GAGTGACATC CGTTGATATT TATGAGGGTG GTCGTAATGC TGTAGTTGAA
GCAATAGATC CAGAACTAGA CAACAGAGTT CAAAGAATAA GGGTTGATCT ACCAGGGCTA
GCACCTGAAC TTATAAATAA ATTAAAAAGC GAAGGTATAA GCTTTGATGT TCATCCGCCA
AGAACGGCCC CACCTGCTCT AGGAATAATA GGTAATCTTA TCTTCCCAAT ATTATTAATA
GTAGGTCTAG TCTTTCTTGC TAGAAGATCC AATTCTATGC CTGGTGGACC AGGGCAGGCA
ATGCAATTCG GCAAAACAAA GGCAAGATTT GCTATGGAAG CTGAAACTGG AGTCAAGTTT
GACGATGTGG CCGGCGTTAA TGAAGCCAAG CAAGATTTAG AAGAGGTGGT GACCTTCTTG
AAACAACCTG AACGTTTTAC TTCTGTAGGT GCTCAAATTC CTAAAGGTGT TCTTTTAGTG
GGCCCTCCTG GAACAGGTAA AACTCTTCTA GCAAAGGCAA TAGCAGGAGA AGCAGGTGTA
CCTTTCTTTT CTCTTTCAGG CTCAGAATTT GTTGAGATGT TTGTAGGAGT AGGAGCAAGT
CGGGTAAGAG ACTTGTTTAA ACGTGCAAAA GAGAATAGTC CTTGCCTAAT ATTTATTGAT
GAAATTGATG CTGTAGGGAG ACAAAGAGGA GCTGGAATCG GAGGTGGAAA TGACGAAAGG
GAGCAAACTC TTAATCAATT ACTTACAGAA ATGGATGGAT TTGAAGGTAA CAGTGGAATC
ATTATTATTG CAGCAACAAA TAGACCAGAC GTACTAGATT CAGCACTTAT GAGACCAGGG
AGATTTGACA GGCAAGTATC TGTTGATGCT CCAGATATTA AAGGAAGACT TTCTATCTTA
AAGGTACATT CTAGGAACAA GAAATTAGAC AAGGTACTTT CACTTGAAAA TATAGCTCGA
AGGACACCAG GTTTTACAGG GGCAGATCTA GCGAACCTAC TAAATGAAGC GGCAATATTA
ACTGCAAGAA GAAGAAAAGA TTTTATAGGT ATTACGGAAA TAGATGATGC CGTAGATAGA
ATAATTGCTG GAATGGAAGG GCAGCCTCTC ACCGATGGAA GAAGCAAACG ACTGATTGCT
TATCACGAAG TTGGCCATGC GCTTATTGGT ACTCTTGTGA AAGATCATGA CCCCGTGCAG
AAGGTAACTC TTATACCAAG AGGTCAAGCA AAAGGACTGA CTTGGTTCTC TCCAGATGAT
GACCAAATGT TAGTAAGTAA AGCACAACTA AAAGCTAGAA TCATGGGTGC TTTAGGAGGA
AGAGCTGCAG AAGATGTGAT TTTCGGAAAT GCAGAAGTTA CAACTGGTGC AGGTGGGGAT
ATTCAACAAG TTGCTTCAAT GGCCAGGCAA ATGGTAACCA AGTTTGGGAT GAGCGACTTA
GGACCAATAT CATTGGAGAA TAGCTCTCAA GAAGTTTTTA TTGGCAGAGA CCTAATGACA
AGAAGTGATA ATTCAGATGC TATTGCCAAG CAAATTGATG ATCAAGTTAG AGAGATAGTT
AAAAAGTGTT ATAGAGAGAC ACTAGATATA GTAAATAATA ACAAAGCAGC AATGGATGGA
TTAGTAGAGG TATTGGTTGA GAAAGAAACT ATAGATGGAG ATGAATTTAG GGAAATATTA
TCAAATTATT GTGAGATACC AGACAAGAAA AATGTTGAGA ATATAGTCAT ATAG
 
Protein sequence
MNQRVKLILL WLLPIGMVVL ISWQILGNGD TTALNQSSSS LATRNSAVSK MSYGRFIDYI 
NAGRVTSVDI YEGGRNAVVE AIDPELDNRV QRIRVDLPGL APELINKLKS EGISFDVHPP
RTAPPALGII GNLIFPILLI VGLVFLARRS NSMPGGPGQA MQFGKTKARF AMEAETGVKF
DDVAGVNEAK QDLEEVVTFL KQPERFTSVG AQIPKGVLLV GPPGTGKTLL AKAIAGEAGV
PFFSLSGSEF VEMFVGVGAS RVRDLFKRAK ENSPCLIFID EIDAVGRQRG AGIGGGNDER
EQTLNQLLTE MDGFEGNSGI IIIAATNRPD VLDSALMRPG RFDRQVSVDA PDIKGRLSIL
KVHSRNKKLD KVLSLENIAR RTPGFTGADL ANLLNEAAIL TARRRKDFIG ITEIDDAVDR
IIAGMEGQPL TDGRSKRLIA YHEVGHALIG TLVKDHDPVQ KVTLIPRGQA KGLTWFSPDD
DQMLVSKAQL KARIMGALGG RAAEDVIFGN AEVTTGAGGD IQQVASMARQ MVTKFGMSDL
GPISLENSSQ EVFIGRDLMT RSDNSDAIAK QIDDQVREIV KKCYRETLDI VNNNKAAMDG
LVEVLVEKET IDGDEFREIL SNYCEIPDKK NVENIVI