Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_08031 |
Symbol | |
ID | 4911658 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 691076 |
End bp | 692989 |
Gene Length | 1914 bp |
Protein Length | 637 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640160385 |
Product | FtsH ATP-dependent protease-like protein |
Protein accession | YP_001091027 |
Protein GI | 126696141 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCAGA AATTTAAAAC ATTAATTTTA TGGGCTTTAC CTATACTTTT AGTAATTGCA CTTTCCTACC AATTTTTATC TTCAAGCAAT GTTGATGCAC TTAAATCTAA TGGAACTACC GTTGCACCCA GGAATTCTGC AGTAGCAAGA GTTAGTTACG GCAGATTTTT AGATTACATT AATTCAGGAA GAGTTACGTC TGTTGATATT TTTGAGGGAG GCAGAAATGC AGTTATAGAG ACAATAGATT CTGATTTAGA TAATAAAGTT CAAAGGTTGC GTGTAGATCT TCCAGGCTTA ACACCAGAAC TTATAAACAT TTTAAAAAAT GAGGGAATCA GTTTTGATGT GCATCCAGTT AAAACAGCCC CTCCTGCATT AGGAATTTTA GGTAATTTAC TTTTCCCAGC TATTTTAATT GGAGGTTTAA TTTTGTTAGC TAGGAGGTCG AATGGTATGC CTGGAGGTCC AGGGCAAGCG ATGCAATTTG GAAAGACAAA GGCGAGATTT GCGATGGAAG CTGAAACAGG TGTTGTTTTC GATGATGTTG CAGGTGTTAA TGAAGCTAAA CAGGATTTGC AAGAAGTTGT CACTTTTCTG AAAAAACCAG AAAAATTTAC TTCTGTAGGT GCAAGGATAC CGAAAGGAGT TCTATTGGTA GGACCTCCTG GTACTGGTAA AACCCTCTTA GCAAAAGCAA TAGCAGGCGA AGCAGGTGTA CCATTTTTCT CATTATCAGG TTCTGAATTT GTTGAAATGT TTGTTGGTGT TGGTGCTAGT AGAGTTAGAG ATCTTTTCAA AAAAGCTAAA GAAAATAGTC CTTGTTTAAT TTTTATTGAT GAAATCGATG CTGTTGGAAG GCAAAGAGGT GCTGGTATTG GTGGTGGTAA TGATGAGAGA GAACAAACTC TCAATCAATT ACTTACTGAA ATGGATGGCT TCGAAGGTAA TAGTGGAATA ATAATAATTG CAGCCACAAA CAGGCCCGAC GTTTTAGACT CAGCTCTAAT GAGGCCTGGC AGATTTGACA GACAGGTAAC TGTAGACGCC CCTGATATAA AAGGCAGACT ATCAATATTG GAAGTTCATG CTAGGAATAA GAAACTTGAT GGAGATTTAA CACTTGAAAG CATTGCTAGA AGGACGCCAG GATTTACTGG AGCAGATTTA GCAAATTTAT TAAATGAGGC TGCTATATTA ACCGCAAGAA GGAGAAAAGA CTCTATCAGT ATTTCAGAGA TTGACGACTC TGTAGATAGA ATTGTTGCAG GAATGGAAGG TTCTCCATTA ACAGATGGTA GAAGCAAGAG ATTAATTGCT TATCATGAGG TGGGTCATGC TCTCATAGGT TCACTTGTAA AAGCTCACGA TCCTGTTCAA AAAGTTACAG TTATTCCAAG AGGTCAGGCT AAAGGATTAA CTTGGTTTAC CCCAGACGAT GAGCAGACTC TTGTTAGCCG GGCTCAACTA AAGGCGAGGA TAATGGGTGC TTTAGGAGGA AGAGCTGCAG AAGATGTGGT TTTTGGAAAA GGTGAAATTA CAACAGGAGC GGGAGGTGAT TTCCAACAAG TTGCTTCAAT GGCCCGCCAG ATGGTTACCA GATTTGGAAT GAGTAATTTA GGTCCGATAG CTTTAGAAAG TGGTAATCAA GAAGTATTTG TTGGTAGAGA TTTAATGACT AGAAGTGAAG TATCTGATTC AATCTCTAAA CAAATTGACG AAAGTGTTAG AGTAATGGTC AAGGAATGTT ATAAAGAAAC CTACGACATA GTTAACAAAA ATAGAGAAGC TATGGATAAG ATAGTTGACC TATTAATCGA AAAAGAAACA TTAGATGGTG AAGAATTTGT AAACATTCTT TCCAAGTTCA CTAAAATCCC AAAGAAAGAG AGAACACCTC AATTATTAAC TTAG
|
Protein sequence | MNQKFKTLIL WALPILLVIA LSYQFLSSSN VDALKSNGTT VAPRNSAVAR VSYGRFLDYI NSGRVTSVDI FEGGRNAVIE TIDSDLDNKV QRLRVDLPGL TPELINILKN EGISFDVHPV KTAPPALGIL GNLLFPAILI GGLILLARRS NGMPGGPGQA MQFGKTKARF AMEAETGVVF DDVAGVNEAK QDLQEVVTFL KKPEKFTSVG ARIPKGVLLV GPPGTGKTLL AKAIAGEAGV PFFSLSGSEF VEMFVGVGAS RVRDLFKKAK ENSPCLIFID EIDAVGRQRG AGIGGGNDER EQTLNQLLTE MDGFEGNSGI IIIAATNRPD VLDSALMRPG RFDRQVTVDA PDIKGRLSIL EVHARNKKLD GDLTLESIAR RTPGFTGADL ANLLNEAAIL TARRRKDSIS ISEIDDSVDR IVAGMEGSPL TDGRSKRLIA YHEVGHALIG SLVKAHDPVQ KVTVIPRGQA KGLTWFTPDD EQTLVSRAQL KARIMGALGG RAAEDVVFGK GEITTGAGGD FQQVASMARQ MVTRFGMSNL GPIALESGNQ EVFVGRDLMT RSEVSDSISK QIDESVRVMV KECYKETYDI VNKNREAMDK IVDLLIEKET LDGEEFVNIL SKFTKIPKKE RTPQLLT
|
| |