Gene P9301_08031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_08031 
Symbol 
ID4911658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp691076 
End bp692989 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content38% 
IMG OID640160385 
ProductFtsH ATP-dependent protease-like protein 
Protein accessionYP_001091027 
Protein GI126696141 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCAGA AATTTAAAAC ATTAATTTTA TGGGCTTTAC CTATACTTTT AGTAATTGCA 
CTTTCCTACC AATTTTTATC TTCAAGCAAT GTTGATGCAC TTAAATCTAA TGGAACTACC
GTTGCACCCA GGAATTCTGC AGTAGCAAGA GTTAGTTACG GCAGATTTTT AGATTACATT
AATTCAGGAA GAGTTACGTC TGTTGATATT TTTGAGGGAG GCAGAAATGC AGTTATAGAG
ACAATAGATT CTGATTTAGA TAATAAAGTT CAAAGGTTGC GTGTAGATCT TCCAGGCTTA
ACACCAGAAC TTATAAACAT TTTAAAAAAT GAGGGAATCA GTTTTGATGT GCATCCAGTT
AAAACAGCCC CTCCTGCATT AGGAATTTTA GGTAATTTAC TTTTCCCAGC TATTTTAATT
GGAGGTTTAA TTTTGTTAGC TAGGAGGTCG AATGGTATGC CTGGAGGTCC AGGGCAAGCG
ATGCAATTTG GAAAGACAAA GGCGAGATTT GCGATGGAAG CTGAAACAGG TGTTGTTTTC
GATGATGTTG CAGGTGTTAA TGAAGCTAAA CAGGATTTGC AAGAAGTTGT CACTTTTCTG
AAAAAACCAG AAAAATTTAC TTCTGTAGGT GCAAGGATAC CGAAAGGAGT TCTATTGGTA
GGACCTCCTG GTACTGGTAA AACCCTCTTA GCAAAAGCAA TAGCAGGCGA AGCAGGTGTA
CCATTTTTCT CATTATCAGG TTCTGAATTT GTTGAAATGT TTGTTGGTGT TGGTGCTAGT
AGAGTTAGAG ATCTTTTCAA AAAAGCTAAA GAAAATAGTC CTTGTTTAAT TTTTATTGAT
GAAATCGATG CTGTTGGAAG GCAAAGAGGT GCTGGTATTG GTGGTGGTAA TGATGAGAGA
GAACAAACTC TCAATCAATT ACTTACTGAA ATGGATGGCT TCGAAGGTAA TAGTGGAATA
ATAATAATTG CAGCCACAAA CAGGCCCGAC GTTTTAGACT CAGCTCTAAT GAGGCCTGGC
AGATTTGACA GACAGGTAAC TGTAGACGCC CCTGATATAA AAGGCAGACT ATCAATATTG
GAAGTTCATG CTAGGAATAA GAAACTTGAT GGAGATTTAA CACTTGAAAG CATTGCTAGA
AGGACGCCAG GATTTACTGG AGCAGATTTA GCAAATTTAT TAAATGAGGC TGCTATATTA
ACCGCAAGAA GGAGAAAAGA CTCTATCAGT ATTTCAGAGA TTGACGACTC TGTAGATAGA
ATTGTTGCAG GAATGGAAGG TTCTCCATTA ACAGATGGTA GAAGCAAGAG ATTAATTGCT
TATCATGAGG TGGGTCATGC TCTCATAGGT TCACTTGTAA AAGCTCACGA TCCTGTTCAA
AAAGTTACAG TTATTCCAAG AGGTCAGGCT AAAGGATTAA CTTGGTTTAC CCCAGACGAT
GAGCAGACTC TTGTTAGCCG GGCTCAACTA AAGGCGAGGA TAATGGGTGC TTTAGGAGGA
AGAGCTGCAG AAGATGTGGT TTTTGGAAAA GGTGAAATTA CAACAGGAGC GGGAGGTGAT
TTCCAACAAG TTGCTTCAAT GGCCCGCCAG ATGGTTACCA GATTTGGAAT GAGTAATTTA
GGTCCGATAG CTTTAGAAAG TGGTAATCAA GAAGTATTTG TTGGTAGAGA TTTAATGACT
AGAAGTGAAG TATCTGATTC AATCTCTAAA CAAATTGACG AAAGTGTTAG AGTAATGGTC
AAGGAATGTT ATAAAGAAAC CTACGACATA GTTAACAAAA ATAGAGAAGC TATGGATAAG
ATAGTTGACC TATTAATCGA AAAAGAAACA TTAGATGGTG AAGAATTTGT AAACATTCTT
TCCAAGTTCA CTAAAATCCC AAAGAAAGAG AGAACACCTC AATTATTAAC TTAG
 
Protein sequence
MNQKFKTLIL WALPILLVIA LSYQFLSSSN VDALKSNGTT VAPRNSAVAR VSYGRFLDYI 
NSGRVTSVDI FEGGRNAVIE TIDSDLDNKV QRLRVDLPGL TPELINILKN EGISFDVHPV
KTAPPALGIL GNLLFPAILI GGLILLARRS NGMPGGPGQA MQFGKTKARF AMEAETGVVF
DDVAGVNEAK QDLQEVVTFL KKPEKFTSVG ARIPKGVLLV GPPGTGKTLL AKAIAGEAGV
PFFSLSGSEF VEMFVGVGAS RVRDLFKKAK ENSPCLIFID EIDAVGRQRG AGIGGGNDER
EQTLNQLLTE MDGFEGNSGI IIIAATNRPD VLDSALMRPG RFDRQVTVDA PDIKGRLSIL
EVHARNKKLD GDLTLESIAR RTPGFTGADL ANLLNEAAIL TARRRKDSIS ISEIDDSVDR
IVAGMEGSPL TDGRSKRLIA YHEVGHALIG SLVKAHDPVQ KVTVIPRGQA KGLTWFTPDD
EQTLVSRAQL KARIMGALGG RAAEDVVFGK GEITTGAGGD FQQVASMARQ MVTRFGMSNL
GPIALESGNQ EVFVGRDLMT RSEVSDSISK QIDESVRVMV KECYKETYDI VNKNREAMDK
IVDLLIEKET LDGEEFVNIL SKFTKIPKKE RTPQLLT