Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_15451 |
Symbol | |
ID | 4776663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1347684 |
End bp | 1349600 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640087054 |
Product | FtsH ATP-dependent protease-like protein |
Protein accession | YP_001017554 |
Protein GI | 124023247 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.493249 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCAAC GCTTGCGTCT CATTGCTCTC TGGCTGCTGC CCATCGGTGT TGCCTTACTG CTCGCCTGGC AAATTTTAGG AAATGGCAAG CTCACAGGAG AGCAGCCGAG TAACACCACT CTGGCTCCGC GCAATGCTGC GGTTACGCGG ATGAGCTACG GCCGCTTCCT CGACTATGTG GAAGCAGGAA GGGTCACCGC CGTGGACATT TACGACGGTG GACGGAATGC AGTAGTAGAA GCCGTAGATC CCGAACTCGA CAATCGCGTA CAACGACTAA GAGTTGATCT CCCCGGTCTT GCCCCAGAGT TAATCAATAC ACTCAAAAGC GAGGGGATCA GTTTCGATAT TCATCCTGCT CGTACCACTC CGCCGGCGCT TGGCTTATTA GGGAATCTCC TCTTCCCGCT ACTCCTCATT GGCGGCCTAA TTCTGCTGGC CCGTCGATCC AGCTCAATGC CTGGCGGCCC CGGGCAGGCA ATGCAGTTCG GCAAGACAAA AGCCCGCTTC GCAATGGAAG CTGAAACTGG AGTCAAGTTT GATGATGTTG CAGGTGTAAG TGAAGCAAAA CAGGATCTTG AGGAAGTGGT GACATTCCTG AAGAAACCCG AACGCTTCAC CTCGGTAGGT GCTCAGATTC CAAGAGGTGT TCTGCTGGTA GGGCCTCCAG GTACTGGCAA AACCCTTCTT GCTAAAGCAA TTGCAGGAGA AGCAGGTGTT CCCTTCTTCT CACTCTCTGG GTCGGAATTC GTAGAAATGT TCGTTGGTGT CGGCGCGAGT CGAGTCCGAG ATTTATTCAA GCGGGCAAAA GAAAACACAC CCTGCCTAAT CTTTATTGAT GAAATTGATG CCGTAGGCCG CCAACGTGGT GCCGGAATAG GTGGCGGGAA TGATGAGCGA GAACAAACTC TCAACCAACT GTTAACAGAA ATGGACGGAT TTGAAGGTAA TAGCGGAATC ATCATTATTG CAGCAACAAA CCGTCCTGAT GTACTGGATT CTGCGCTGAT GCGACCAGGG CGTTTTGACA GGCAAGTCAG TGTGGATTCA CCAGATATCA AAGGAAGACT TGCAATTCTT GAAGTGCACG CACGTGACAA GAAACTTGAA GAAGATCTTT CCTTAAAGAA TGTAGCCCGA CGCACACCTG GATTCACCGG GGCAGACCTT GCCAACCTAC TTAATGAGGC GGCAATCCTC ACAGCTCGAC GTCGTAAAAA GGCTATCAGC CTTGATGAAA TTGACGATGC CGTTGACAGA ATCATCGCGG GCATGGAAGG GCGCCCACTC ACTGATGGAC GTAGCAAGAG ATTGATCGCA TACCACGAAG TTGGGCATGC ACTAATAGGA ACATTAGTTA AAGATCATGA CCCTGTCCAA AAAGTGACTC TGATACCACG AGGTCAGGCT CAGGGGCTGA CATGGTTTGC TCCGGATGAA GAGCAAATGC TGGTGACTCG TGCCCAACTC AAGGCTCGAA TCATGGGAGC ACTAGGAGGC AGGGCAGCAG AAGATGTTGT TTTTGGCGAT GCAGAGATCA CAACAGGTGC AGGTGGTGAC ATCCAGCAAG TTGCATCCAT GGCACGCCAA ATGGTGACAC GTTTTGGCAT GAGCGATTTA GGACCAGTCG CTCTGGAGAG CGGAAATCAA GAAGTATTTA TTGGACGTGA CCTGATGACT CGAAGCGAGA TCTCAGATGC AATATCCCGC CAAATAGACG AGGCAGTTCG GGAAATGGTT AAGCTTTGCT ATAGCGAAAC TGTAAAAATC GTCAAGCAAC ATCGGGAAGC TATGGATAGG CTCGTTGAAA TCCTCATCGA AAAGGAAACC ATAGACGGAG AAGAATTCAC TTCAGTGGTA GCAGAATTTA CATCTGTTCC AGAGAAGGAA AGAAGCATAC CTATCCTTCA GAGCTAA
|
Protein sequence | MNQRLRLIAL WLLPIGVALL LAWQILGNGK LTGEQPSNTT LAPRNAAVTR MSYGRFLDYV EAGRVTAVDI YDGGRNAVVE AVDPELDNRV QRLRVDLPGL APELINTLKS EGISFDIHPA RTTPPALGLL GNLLFPLLLI GGLILLARRS SSMPGGPGQA MQFGKTKARF AMEAETGVKF DDVAGVSEAK QDLEEVVTFL KKPERFTSVG AQIPRGVLLV GPPGTGKTLL AKAIAGEAGV PFFSLSGSEF VEMFVGVGAS RVRDLFKRAK ENTPCLIFID EIDAVGRQRG AGIGGGNDER EQTLNQLLTE MDGFEGNSGI IIIAATNRPD VLDSALMRPG RFDRQVSVDS PDIKGRLAIL EVHARDKKLE EDLSLKNVAR RTPGFTGADL ANLLNEAAIL TARRRKKAIS LDEIDDAVDR IIAGMEGRPL TDGRSKRLIA YHEVGHALIG TLVKDHDPVQ KVTLIPRGQA QGLTWFAPDE EQMLVTRAQL KARIMGALGG RAAEDVVFGD AEITTGAGGD IQQVASMARQ MVTRFGMSDL GPVALESGNQ EVFIGRDLMT RSEISDAISR QIDEAVREMV KLCYSETVKI VKQHREAMDR LVEILIEKET IDGEEFTSVV AEFTSVPEKE RSIPILQS
|
| |