Gene PCC8801_1842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1842 
Symbol 
ID7101773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1936063 
End bp1937985 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content44% 
IMG OID643474908 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_002372041 
Protein GI218246670 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAACA AGATCTCCCA CAGTCAATCT ATTCATCAAG CAAATCGCCG TCAACCCACT 
TCACGAAAAT GGGGACATCT GGTGACTAGT TGGATGCTCA TACAATCGTT ACTTGTGGCT
ACTCCCAGTT GGGGACAAAC CCTAATTCCT TCTGGGAAAG AATCTAAACC CGAAGGTATC
AGCTATAGTC AGTTATTGAA GCAAATTGAA TCAGGAAAAG TCCGTAAAGT AGAAATTGAC
CCAAAATTAC AAAAAGCCAA AGTTACCCTA AAAAATCAAT CTGAACAAGA CCCCCCACAA
GAAGTCCCCC TCTTTAAAAG CAACCTCAAC AACGAATTAA TTGCTAAGCT GCGAGATAAC
AATGTCCCTG TGGATATTCA ACCTTCCGTA GATAATTCCG CCGCGATTAG CCTAGTTGTT
AATTTAATCG TCCTTTTTCT GCTGTTTAGT ATTTTTATTG CCATTATTAG ACGTTCGGCC
AATGCTTCCG GTCAAGCCAT GAATTTTGGT AAATCTCGCG CTAGGTTTCA GATGGAAGCC
AAAACAGGGA TCAGCTTTGA AGATGTCGCT GGTATTGATG AAGCTAAAGA AGAACTGCAA
GAAGTCGTTA CTTTTCTGAA ACAACCTGAA AAATTCACCG CTATTGGCGC AAAAATCCCC
AAAGGCGTAT TATTAGTCGG TCCCCCTGGA ACGGGTAAAA CTCTACTCGC TAAAGCCATT
GCAGGAGAAG CGGGGGTTCC TTTCTTTAGT ATTTCCGGTT CCGAATTTGT GGAAATGTTC
GTTGGGGTTG GGGCTTCGCG GGTGAGAGAT TTGTTCAAAA AAGCCAAAGA AAACGCCCCT
TGTTTGATTT TTATCGATGA AATTGATGCC GTTGGTCGTC AACGGGGAGT CGGTTATGGG
GGAGGCAATG ATGAACGGGA GCAGACCTTA AACCAATTAT TGACGGAAAT GGATGGGTTT
GAAGGAAATC GCGGAATTAT TGTTATTGCT GCCACTAACC GTCCTGATGT CCTTGATAAA
GCCTTATTGC GCCCTGGACG CTTTGATCGG CAGGTAGTGG TCGATTATCC CGATCTTAAG
GGTCGTCAGG GCATTTTAGA AGTTCACGCC CGCAATAAAA AAGTTGATCA AGAAGTCTCT
TTAGAAGCGA TCGCTCGTCG GACACCAGGC TTTACGGGGG CAGATTTAGC CAATGTCCTC
AATGAAGCAG CCATTTTTAC CGCCAGACGG CGCAAAGAAG CCATTACCAT GACCGAGATT
AACGATGCGA TTGATCGCGT TGTGGCCGGG ATGGAAGGAA CGCCCCTTGT GGACAGCAAG
AGTAAACGGT TAATTGCCTA TCATGAAATT GGCCACGCAG TGGTGGGGAG TTTGCATGAG
GGCCACGATG CCGTCGAGAA AGTGACCCTG ATTCCTCGCG GACAAGCAAA GGGGTTAACC
TGGTTTATGC CCGATGAAGA ATATGGGTTA GTGACGCGAA ATCAATTATT AGCGAGAATT
GCCGGATTAT TAGGTGGAAG GGCAGCCGAA GAGGTGATTT TTGGCGAAGA TGAAGTCACA
ACGGGGGCAG GGAATGATAT CGAAAAAGTG ACCTATTTAG CGAGGCAGAT GGTAACGCGC
TTTGGGATGT CAGAATTGGG GTTAGTTGCC CTAGAGAGTG ATAATGATGA TAGTTATGTG
GGGCTTGATG GTAGTCGGCG ATCGGATTAT TCAGACGAGA TTGCCACTAA AATTGATCAT
CAGGTGCGTT CTATTGTTGA TGATTGTCAC AATTACGCTC AAAAAATTAT CCAAGAAAAT
CGCATTGCTA TTGATCGCTT AGTGGATATT TTAATTGAAC AAGAAACCAT TGAAGGAGAA
CAATTTCGTC AACTGCTAGA AGAATTTCGC CTAAAGGTTG ATAAAACCTT ATTAAAGGTT
TAG
 
Protein sequence
MSNKISHSQS IHQANRRQPT SRKWGHLVTS WMLIQSLLVA TPSWGQTLIP SGKESKPEGI 
SYSQLLKQIE SGKVRKVEID PKLQKAKVTL KNQSEQDPPQ EVPLFKSNLN NELIAKLRDN
NVPVDIQPSV DNSAAISLVV NLIVLFLLFS IFIAIIRRSA NASGQAMNFG KSRARFQMEA
KTGISFEDVA GIDEAKEELQ EVVTFLKQPE KFTAIGAKIP KGVLLVGPPG TGKTLLAKAI
AGEAGVPFFS ISGSEFVEMF VGVGASRVRD LFKKAKENAP CLIFIDEIDA VGRQRGVGYG
GGNDEREQTL NQLLTEMDGF EGNRGIIVIA ATNRPDVLDK ALLRPGRFDR QVVVDYPDLK
GRQGILEVHA RNKKVDQEVS LEAIARRTPG FTGADLANVL NEAAIFTARR RKEAITMTEI
NDAIDRVVAG MEGTPLVDSK SKRLIAYHEI GHAVVGSLHE GHDAVEKVTL IPRGQAKGLT
WFMPDEEYGL VTRNQLLARI AGLLGGRAAE EVIFGEDEVT TGAGNDIEKV TYLARQMVTR
FGMSELGLVA LESDNDDSYV GLDGSRRSDY SDEIATKIDH QVRSIVDDCH NYAQKIIQEN
RIAIDRLVDI LIEQETIEGE QFRQLLEEFR LKVDKTLLKV