Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_23831 |
Symbol | |
ID | 4777736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 2099579 |
End bp | 2101426 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640087903 |
Product | cell division protein FtsH2 |
Protein accession | YP_001018381 |
Protein GI | 124024074 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0657792 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGATAAAC GCTGGCGCAA CGTAGGTCTT TACGTTCTTC TGGTGGTCGT GGTTGTCGTG GTCGGCACGG CCTTCCTCGG CAAACCTGGT ACGACTGAGC GGGAGACTCT CCGCTACAGC GAATTTGTTG AAGCAGTTCA GGACAACCAA GTGAGTCGTG TGCTGATCTC CCCAGACCAG GCCACCGCTC AGGTTGTTGA AAGCGATGGT CGACGCGCGG ATGTCAACCT TGCCCCTGAC AAGGACCTTT TGAAGTTGTT AACGGATCAC AACGTTGATA TTGCTGTTCA ACCAACCCGC CAGGCAGGTG CTTGGCAACA GGCCGCCGGA AGTTTGATTT TCCCCTTGTT GTTATTGGGT GGACTTTTCT TTCTCTTCAG ACGTTCTCAG AGTGGTGGCG GCGGTGGTAA CCCTGCCATG AATTTTGGCA AGAGCAAGGC CAGAGTTCAG ATGGAGCCGT CCACTCAGGT CACCTTTAGT GATGTTGCTG GGATCGAAGG GGCAAAGCTT GAACTTACCG AAGTCGTTGA CTTTCTCAAA AACCCTGATC GATTTACTGC TGTTGGCGCA AAAATCCCCA AAGGTGTTCT GCTTGTAGGC CCTCCTGGAA CTGGCAAAAC ATTGCTAGCC AAAGCAGTGG CTGGTGAAGC GGCGGTGCCG TTCTTCTCGA TTTCAGGCTC GGAGTTTGTC GAGATGTTTG TTGGGGTTGG CGCCAGTCGA GTCCGTGACC TATTTGAGCA GGCCAAGAAG AACGCTCCTT GCATTGTTTT CATTGATGAA ATTGATGCGG TGGGTCGGCA GCGGGGCGCT GGCCTTGGCG GTGGCAACGA TGAGCGTGAG CAGACTCTTA ACCAGCTGCT GACTGAAATG GATGGTTTCG AGGGCAATAC CGGCATCATC ATCGTGGCGG CCACCAATCG GCCTGATGTG CTCGATTCGG CGTTGATGCG CCCTGGTCGC TTTGACCGAC AGGTTGTTGT GGAGCGCCCT GATTACAGCG GTCGCCTGCA GATCCTCAAT GTGCATGCCC GCGATAAGAC CCTGTCCAAG GATGTTGATC TCGACAAAGT GGCGCGGCGC ACACCAGGCT TCACAGGGGC TGATCTTGCC AATTTGCTCA ATGAAGCGGC AATCCTCGCA GCTCGTCGAG AGCTCACAGA AGTGAGCAAT GATGAGATTA GCGACGCCAT TGAGCGGGTC ATGGCTGGTC CTGAGAAGAA GGATCGTGTC ATGAGTGAAC GGCGTAAGCA ACTGGTTGCT TATCACGAGT CTGGTCATGC CCTGGTAGGA GCCCTTATGC CTGATTACGA CTCAGTGCAG AAGATTTCCA TCATTCCTCG TGGTCAGGCT GGTGGTCTCA CATTCTTCAC CCCGAGTGAG GAGCGGATGG AGTCTGGTCT CTATTCCAGG GCTTATTTGC AGAACCAGAT GGCTGTTGCT CTGGGTGGTC GAGTTGCAGA AGAAATCGTC TACGGCGAAG ACGAGGTGAC CACTGGTGCA TCCAATGACC TTCAACAGGT CGCTCAGGTC GCCAGGCAGA TGGTGACGAG GTTCGGGATG AGCGACAAGC TTGGTCCAGT CGCTTTGGGG CGGTCTCAGG GAGGGATGTT CCTTGGTCGT GACATCGCCT CTGAACGCGA TTTCTCTGAA GACACTGCAG CGATTATCGA TGCAGAGGTC TCTGATCTGG TTGATGTGGC TTACAAGCGT GCCACCAAAG TCTTGATCGA GAATCGTTCT GTTCTTGATG AGTTAGCGGA TTTGCTTGTC GAGAAGGAAA CTGTTGATGC TCAGGATTTG CAGGACTTGC TGATCCGACG CGATGTCAGG GTTGCTGAAT ACGTCTGA
|
Protein sequence | MDKRWRNVGL YVLLVVVVVV VGTAFLGKPG TTERETLRYS EFVEAVQDNQ VSRVLISPDQ ATAQVVESDG RRADVNLAPD KDLLKLLTDH NVDIAVQPTR QAGAWQQAAG SLIFPLLLLG GLFFLFRRSQ SGGGGGNPAM NFGKSKARVQ MEPSTQVTFS DVAGIEGAKL ELTEVVDFLK NPDRFTAVGA KIPKGVLLVG PPGTGKTLLA KAVAGEAAVP FFSISGSEFV EMFVGVGASR VRDLFEQAKK NAPCIVFIDE IDAVGRQRGA GLGGGNDERE QTLNQLLTEM DGFEGNTGII IVAATNRPDV LDSALMRPGR FDRQVVVERP DYSGRLQILN VHARDKTLSK DVDLDKVARR TPGFTGADLA NLLNEAAILA ARRELTEVSN DEISDAIERV MAGPEKKDRV MSERRKQLVA YHESGHALVG ALMPDYDSVQ KISIIPRGQA GGLTFFTPSE ERMESGLYSR AYLQNQMAVA LGGRVAEEIV YGEDEVTTGA SNDLQQVAQV ARQMVTRFGM SDKLGPVALG RSQGGMFLGR DIASERDFSE DTAAIIDAEV SDLVDVAYKR ATKVLIENRS VLDELADLLV EKETVDAQDL QDLLIRRDVR VAEYV
|
| |