Gene A9601_08041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_08041 
Symbol 
ID4717510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp691399 
End bp693312 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content38% 
IMG OID640078518 
ProductFtsH ATP-dependent protease-like protein 
Protein accessionYP_001009197 
Protein GI123968339 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0315531 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCAAA AATTTAAAAC ATTAATTTTA TGGGCTTTAC CTATACTTTT AGTAATAGCA 
CTTTCCTACC AATTTTTATC TTCAAGCAAC GTTGATTCAC TTAAATCTAA CGGGACTACT
GTTGCGCCAA GAAATTCAGC GGTAGCAAGA GTTAGTTACG GCAGATTTTT AGATTACATT
AATTCGGGAA GGGTTACATC TGTCGATATT TTTGAGGGAG GCAGAAATGC AGTTATAGAG
ACCATAGATT CGGATTTAGA TAATAAAGTT CAAAGGTTGC GTGTAGATCT TCCCGGCTTA
ACACCAGAAC TTATAAATAT TTTAAAAAAA GAGGGAATTA GTTTTGATGT TCATCCAATA
AAAACAGCTC CCCCTGCATT AGGAATTTTA GGTAATTTAC TCTTCCCGGC TATCTTAATT
GGAGGTTTAA TTTTGTTAGC GAGGAGGTCA AATGGTATGC CTGGGGGGCC TGGACAAGCA
ATGCAGTTTG GGAAAACTAA AGCAAGATTT GCAATGGAAG CTGAAACAGG AGTTGTATTC
GATGATGTTG CAGGCGTTAA TGAAGCTAAA CAGGATTTGC AAGAAGTTGT TACTTTTCTG
AAAAAACCAG AAAAATTTAC TTCTGTAGGA GCAAGAATTC CCAAAGGGGT TCTATTAGTA
GGACCTCCTG GTACTGGTAA AACCCTTTTA GCTAAAGCAA TTGCAGGTGA AGCTGGTGTT
CCATTCTTCT CATTATCAGG TTCTGAATTT GTTGAAATGT TTGTTGGTGT TGGAGCTAGT
AGAGTAAGAG ACCTTTTCAA AAGAGCTAAA GAGAATAGTC CCTGTTTAAT TTTTATTGAT
GAAATCGACG CAGTCGGAAG GCAAAGAGGT GCAGGTATTG GTGGAGGTAA TGATGAGAGA
GAACAAACTC TCAATCAATT ACTTACTGAA ATGGATGGTT TCGAAGGTAA TAGTGGCATA
ATAATAATTG CAGCCACAAA CAGGCCCGAT GTTCTAGACT CAGCGCTAAT GAGACCAGGC
AGATTTGATA GGCAGGTAAC TGTAGATGCA CCTGATATCA AGGGCAGACT ATCAATATTG
GAAGTTCATG CAAGGAATAA GAAACTTCAA GAGGATTTAA CGCTTGAAAG CATTGCGAGA
AGAACACCAG GTTTTACTGG AGCAGATTTA GCAAATTTAT TAAATGAGGC TGCTATATTA
ACTGCCAGGA GGAGAAAAGA CTCTATAAGT ATCTCAGAAA TTGATGACTC TGTAGATAGG
ATTGTCGCAG GAATGGAAGG TTCCCCATTA ACAGATGGTA GAAGTAAGAG ATTAATTGCT
TATCATGAAG TTGGCCATGC TCTCATAGGT TCACTTGTCA AAGCCCATGA TCCTGTTCAA
AAAGTGACCG TCATTCCAAG AGGTCAAGCT AAAGGATTAA CTTGGTTTAC CCCAGATGAC
GAACAAACCC TTGTTAGCAG GGCGCAATTA AAAGCTAGGA TAATGGGTGC TTTAGGTGGA
AGAGCTGCTG AAGATGTTGT TTTTGGAGAA GGTGAGATTA CAACAGGAGC TGGAGGTGAT
TTTCAACAAG TTGCTTCAAT GGCTCGCCAA ATGGTTACTA GATTTGGAAT GAGTAATTTA
GGTCCGATAG CTTTAGAAAG TGGTAATCAA GAAGTATTTG TTGGTAGAGA TTTAATGACT
AGAAGTGAAG TTTCTGATTC AATTTCTAAA CAAATAGATG AAAGTGTAAG AATAATGGTA
AAAGAATGTT ATAAAGAGAC CTACGATATA GTAAGCAAAA ATAGAGAAGC TATGGATAAG
ATAGTTGACC TATTAATCGA AAAAGAGACA TTAGATGGTG ATGAATTTGT AAGTATTCTC
TCCAAATTCA CCAAAATTCC TGAGAAAGAC AGAACACCTC AATTATTAAG CTAA
 
Protein sequence
MNQKFKTLIL WALPILLVIA LSYQFLSSSN VDSLKSNGTT VAPRNSAVAR VSYGRFLDYI 
NSGRVTSVDI FEGGRNAVIE TIDSDLDNKV QRLRVDLPGL TPELINILKK EGISFDVHPI
KTAPPALGIL GNLLFPAILI GGLILLARRS NGMPGGPGQA MQFGKTKARF AMEAETGVVF
DDVAGVNEAK QDLQEVVTFL KKPEKFTSVG ARIPKGVLLV GPPGTGKTLL AKAIAGEAGV
PFFSLSGSEF VEMFVGVGAS RVRDLFKRAK ENSPCLIFID EIDAVGRQRG AGIGGGNDER
EQTLNQLLTE MDGFEGNSGI IIIAATNRPD VLDSALMRPG RFDRQVTVDA PDIKGRLSIL
EVHARNKKLQ EDLTLESIAR RTPGFTGADL ANLLNEAAIL TARRRKDSIS ISEIDDSVDR
IVAGMEGSPL TDGRSKRLIA YHEVGHALIG SLVKAHDPVQ KVTVIPRGQA KGLTWFTPDD
EQTLVSRAQL KARIMGALGG RAAEDVVFGE GEITTGAGGD FQQVASMARQ MVTRFGMSNL
GPIALESGNQ EVFVGRDLMT RSEVSDSISK QIDESVRIMV KECYKETYDI VSKNREAMDK
IVDLLIEKET LDGDEFVSIL SKFTKIPEKD RTPQLLS