Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_14591 |
Symbol | |
ID | 4777616 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1253618 |
End bp | 1255477 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640086968 |
Product | cell division protein FtsH4 |
Protein accession | YP_001017470 |
Protein GI | 124023163 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.584936 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTTCAG GTCCATCGGC ACTCCAGGGG TCGCCATCTC TGAAGCGTTC AACTCTGGAA GCAGATCAGT CTCAGCATTC ATCTGAAGCT GTATCCTTTG CTCCCTTCAA GCAGAAACCC TCAATCAGCT ATAGCCAGCT GCTCAATCAG ATCAAAGAGA AAAAGGTTAA GTCGTTGGAG TTGATACCAG CACGGCGAGA AGTCGTGGTG GTTTTCAAGG ATGGACACCA GGAGAAGGTG GCGATCTTCG ATAATTACCA ACAAATCCTA CGAGTTGCCG AGGCTGCTGA TACCTCTCTG ACGGTGAAGG ACATTCGTCA GGAACAGGCA TTTGCAGGAA TGGCCGGCAA TCTTGTATTG ATCTTGATGG TTGTTGTTGG CTTGACGTTC CTTTTTCGTC GTTCTGCTCA AGTCGCAAAC AGAGCAATGG GATTTGGACG CAGTCAACCT CGGTTGAAAC CGCAAGAGGA TCTACCTATT CGCTTTGATG ATGTAGCGGG GATTACAGAA GCGAAAGAGG AGCTGCAAGA GGTGGTTACT TTCTTGCGGC AACCTGAGAG CTTCATCAAG CTTGGCGCCC TTATCCCAAG GGGTGTTCTT CTAGTGGGTG CTCCTGGAAC AGGGAAAACC CTCTTGGCCA AAGCAATTGC TGGTGAGGCA GACGTTCCTT TTTTCTCTAT GGCCGCCTCT GAATTTGTGG AATTATTTGT TGGCGTGGGC GCTAGTCGTG TAAGGGACCT ATTCCGCAAA GCTAAAGAGA AAGCCCCGTG CATCGTGTTT ATAGATGAAA TTGATGCTGT GGGGCGACAG CGAGGGGCTG GGATCGGTGG TGGAAATGAC GAACGAGAAC AGACTCTTAA TCAGCTTCTG ACAGAAATGG ATGGGTTTGC AGATAATTCT GGCGTCATTC TTCTGGCAGC TACCAATCGG CCAGATGTGC TTGATGCTGC ACTGATGCGG CCTGGACGCT TTGATCGTCG TATTGAAGTT TCCCTTCCTG ATCGTCGGGG CCGTAAGGAA ATCCTGGCTG TTCATGCTCG AACTCGACCC CTTGCTGAGG AGGTGTGTTT GCAGGATTGG GCACGTCGAA CACCTGGTTT CTCAGGGGCT GATCTTGCCA ACTTATTGAA CGAAGCTGCC ATCCTCACAG CAAGGCAGGA GAAAGCATCG ATTGGCACTG AACAGCTTGA AGCGGCGCTT GAAAGGATCA CGATGGGACT CTCTGCAGCA CCTCTACAAG ACAGTGCAAA AAAGAGGTTG ATTGCCTATC ACGAGATTGG ACATGCCCTG GTTGCAGCTT TAACCCCTCA TGCCGACCGT ATTGATAAGG TCACACTTTT GCCCCGTAGC GGAGGCGTTG GCGGATTTAC TCGTTTCTGG CCAGATGAGG AAATCCTTGA TTCAGGTTTG GTCACAAAGG GATATCTCTT CGCAAGGCTT GTGGTTGCCC TTGGCGGCCG GGCAGCAGAG TTGGTGGTTT TTGGACTCGA TGAAATCACC CAGGGTGCAA GTGGCGACTT GCAGAGCGTT GCGCATCTGG CTCGAGAGAT GGTGACGCGG TTCGGATTTT CGAGCCTTGG GCCTATCGCT TTGGAGACAG AGGGTTCAGA GGTTTTCCTA GGACGTGATT TGATCCATAC ACGCCCGAGC TACGCGGAAT CGACTGGCAA AGTCATAGAC GAGCAGATAC GAGCTTTAGC GGTTGAAGCC TTAGAGCAGG CAATTAACTT GCTTTCGCCA AGAAGGGAAG TGATGGATCT GTTGGTTGAT GCTCTGATTC AGGAGGAAAC TCTCCATACT GATCGGTTTC TGCAATTGGC TGGATTGACT GTTGATCAGT CACAGCCAGT GGGCGTGTAA
|
Protein sequence | MSSGPSALQG SPSLKRSTLE ADQSQHSSEA VSFAPFKQKP SISYSQLLNQ IKEKKVKSLE LIPARREVVV VFKDGHQEKV AIFDNYQQIL RVAEAADTSL TVKDIRQEQA FAGMAGNLVL ILMVVVGLTF LFRRSAQVAN RAMGFGRSQP RLKPQEDLPI RFDDVAGITE AKEELQEVVT FLRQPESFIK LGALIPRGVL LVGAPGTGKT LLAKAIAGEA DVPFFSMAAS EFVELFVGVG ASRVRDLFRK AKEKAPCIVF IDEIDAVGRQ RGAGIGGGND EREQTLNQLL TEMDGFADNS GVILLAATNR PDVLDAALMR PGRFDRRIEV SLPDRRGRKE ILAVHARTRP LAEEVCLQDW ARRTPGFSGA DLANLLNEAA ILTARQEKAS IGTEQLEAAL ERITMGLSAA PLQDSAKKRL IAYHEIGHAL VAALTPHADR IDKVTLLPRS GGVGGFTRFW PDEEILDSGL VTKGYLFARL VVALGGRAAE LVVFGLDEIT QGASGDLQSV AHLAREMVTR FGFSSLGPIA LETEGSEVFL GRDLIHTRPS YAESTGKVID EQIRALAVEA LEQAINLLSP RREVMDLLVD ALIQEETLHT DRFLQLAGLT VDQSQPVGV
|
| |