Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_08041 |
Symbol | |
ID | 4717510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 691399 |
End bp | 693312 |
Gene Length | 1914 bp |
Protein Length | 637 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640078518 |
Product | FtsH ATP-dependent protease-like protein |
Protein accession | YP_001009197 |
Protein GI | 123968339 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0315531 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCAAA AATTTAAAAC ATTAATTTTA TGGGCTTTAC CTATACTTTT AGTAATAGCA CTTTCCTACC AATTTTTATC TTCAAGCAAC GTTGATTCAC TTAAATCTAA CGGGACTACT GTTGCGCCAA GAAATTCAGC GGTAGCAAGA GTTAGTTACG GCAGATTTTT AGATTACATT AATTCGGGAA GGGTTACATC TGTCGATATT TTTGAGGGAG GCAGAAATGC AGTTATAGAG ACCATAGATT CGGATTTAGA TAATAAAGTT CAAAGGTTGC GTGTAGATCT TCCCGGCTTA ACACCAGAAC TTATAAATAT TTTAAAAAAA GAGGGAATTA GTTTTGATGT TCATCCAATA AAAACAGCTC CCCCTGCATT AGGAATTTTA GGTAATTTAC TCTTCCCGGC TATCTTAATT GGAGGTTTAA TTTTGTTAGC GAGGAGGTCA AATGGTATGC CTGGGGGGCC TGGACAAGCA ATGCAGTTTG GGAAAACTAA AGCAAGATTT GCAATGGAAG CTGAAACAGG AGTTGTATTC GATGATGTTG CAGGCGTTAA TGAAGCTAAA CAGGATTTGC AAGAAGTTGT TACTTTTCTG AAAAAACCAG AAAAATTTAC TTCTGTAGGA GCAAGAATTC CCAAAGGGGT TCTATTAGTA GGACCTCCTG GTACTGGTAA AACCCTTTTA GCTAAAGCAA TTGCAGGTGA AGCTGGTGTT CCATTCTTCT CATTATCAGG TTCTGAATTT GTTGAAATGT TTGTTGGTGT TGGAGCTAGT AGAGTAAGAG ACCTTTTCAA AAGAGCTAAA GAGAATAGTC CCTGTTTAAT TTTTATTGAT GAAATCGACG CAGTCGGAAG GCAAAGAGGT GCAGGTATTG GTGGAGGTAA TGATGAGAGA GAACAAACTC TCAATCAATT ACTTACTGAA ATGGATGGTT TCGAAGGTAA TAGTGGCATA ATAATAATTG CAGCCACAAA CAGGCCCGAT GTTCTAGACT CAGCGCTAAT GAGACCAGGC AGATTTGATA GGCAGGTAAC TGTAGATGCA CCTGATATCA AGGGCAGACT ATCAATATTG GAAGTTCATG CAAGGAATAA GAAACTTCAA GAGGATTTAA CGCTTGAAAG CATTGCGAGA AGAACACCAG GTTTTACTGG AGCAGATTTA GCAAATTTAT TAAATGAGGC TGCTATATTA ACTGCCAGGA GGAGAAAAGA CTCTATAAGT ATCTCAGAAA TTGATGACTC TGTAGATAGG ATTGTCGCAG GAATGGAAGG TTCCCCATTA ACAGATGGTA GAAGTAAGAG ATTAATTGCT TATCATGAAG TTGGCCATGC TCTCATAGGT TCACTTGTCA AAGCCCATGA TCCTGTTCAA AAAGTGACCG TCATTCCAAG AGGTCAAGCT AAAGGATTAA CTTGGTTTAC CCCAGATGAC GAACAAACCC TTGTTAGCAG GGCGCAATTA AAAGCTAGGA TAATGGGTGC TTTAGGTGGA AGAGCTGCTG AAGATGTTGT TTTTGGAGAA GGTGAGATTA CAACAGGAGC TGGAGGTGAT TTTCAACAAG TTGCTTCAAT GGCTCGCCAA ATGGTTACTA GATTTGGAAT GAGTAATTTA GGTCCGATAG CTTTAGAAAG TGGTAATCAA GAAGTATTTG TTGGTAGAGA TTTAATGACT AGAAGTGAAG TTTCTGATTC AATTTCTAAA CAAATAGATG AAAGTGTAAG AATAATGGTA AAAGAATGTT ATAAAGAGAC CTACGATATA GTAAGCAAAA ATAGAGAAGC TATGGATAAG ATAGTTGACC TATTAATCGA AAAAGAGACA TTAGATGGTG ATGAATTTGT AAGTATTCTC TCCAAATTCA CCAAAATTCC TGAGAAAGAC AGAACACCTC AATTATTAAG CTAA
|
Protein sequence | MNQKFKTLIL WALPILLVIA LSYQFLSSSN VDSLKSNGTT VAPRNSAVAR VSYGRFLDYI NSGRVTSVDI FEGGRNAVIE TIDSDLDNKV QRLRVDLPGL TPELINILKK EGISFDVHPI KTAPPALGIL GNLLFPAILI GGLILLARRS NGMPGGPGQA MQFGKTKARF AMEAETGVVF DDVAGVNEAK QDLQEVVTFL KKPEKFTSVG ARIPKGVLLV GPPGTGKTLL AKAIAGEAGV PFFSLSGSEF VEMFVGVGAS RVRDLFKRAK ENSPCLIFID EIDAVGRQRG AGIGGGNDER EQTLNQLLTE MDGFEGNSGI IIIAATNRPD VLDSALMRPG RFDRQVTVDA PDIKGRLSIL EVHARNKKLQ EDLTLESIAR RTPGFTGADL ANLLNEAAIL TARRRKDSIS ISEIDDSVDR IVAGMEGSPL TDGRSKRLIA YHEVGHALIG SLVKAHDPVQ KVTVIPRGQA KGLTWFTPDD EQTLVSRAQL KARIMGALGG RAAEDVVFGE GEITTGAGGD FQQVASMARQ MVTRFGMSNL GPIALESGNQ EVFVGRDLMT RSEVSDSISK QIDESVRIMV KECYKETYDI VSKNREAMDK IVDLLIEKET LDGDEFVSIL SKFTKIPEKD RTPQLLS
|
| |