Gene PCC8801_1921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1921 
Symbol 
ID7105725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1997147 
End bp1999087 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content46% 
IMG OID643474982 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_002372115 
Protein GI218246744 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCCAG ACATTAATCA TTCTAATCAA ACCTCTGAAC CCGGCGGTTT AGGTCGAACC 
GTCATTATCT TGCTGTTGTG GTTATTGTTC CTGAACTTGC TCGTGATGCG CAGTGATCAC
GAGGGGATCG TCTCCTATAG CCAATTTATC GACCAAATCG AAGCCGGGAA GGTAGCTAAG
GTCAATATTG GCACAGAACG CATTGAATAT ACCTTAAAAC CCGAAATTAA TAGCAAAGAT
AAAACCCAAA CCTTAATTAC CCTTCCCATT GCCCAAGATA CGACCCTAAC CCAACGGTTA
GAAGCCCATG ATGTGGAGTT TTCAGCTATC CCACCGAGTC AAACGGGATG GATCTCCAAT
TTACTCGGTT GGATTATTCC GCCCTTAATT TTCTTTGGCA TCTGGATGTG GTTGTTAAAC
CGTTCCCAAA TGAATGGACC GGGTATGCTA ACCGTAGGAA AAAGTAACGC GCGTATTTAT
TCCCAAGGAG ATACAGGAGT CACCTTTGAA GACGTAGCGG GAGTGGATGA AGCCAAGACA
GAATTACAAG AAATCGTTGA TTTTTTGAAA AGCGCAGAAA AATATACCCG TTTAGGGGCA
AAAATTCCCA AAGGAGTGTT ATTAATCGGT CCCCCAGGAA CCGGGAAAAC CCTGTTAGCC
AAAGCGATCG CCGGAGAAGC CGGAGTTCCT TTCTTTAGTA TCTCAGGCTC GGAATTTATC
GAATTATTCG TCGGTATTGG GGCTTCACGG GTACGAGATC TCTTCGATCA AGCCAAAACT
CAGGCTCCCT GTATCGTCTT TATTGATGAA TTGGATGCCT TGGGTAAATC TAGAGCCAAT
ATGGGGGGCA TGATCGGGGG CAATGACGAA CGGGAGCAAA CCTTGAACCA ATTATTAGCC
GAAATGGATG GATTTGACCC CAATACGGGG GTAATTTTAC TCGCTGCTAC CAACCGTCCT
GAAGTGCTTG ATCCCGCCTT ATTACGTCCT GGCCGTTTTG ATCGTCAAAT CGTGGTTGAT
CGCCCCGATA AGAGTGGACG GGAAGCCATT TTGCGGGTAC ACGCCCATGA TGTCAGGTTA
GCCCCTGATG TAGATTTAGA CAAGTTAGCG GCCAGAACGC CAGGGTTTGC CGGGGCAGAT
TTAGCCAATT TAATCAACGA AGCTGCTTTA TTAGCTGCGC GCAATAACCG AGAAGCCGTG
ATGATGCAGG ATTTTAACGA GGCAATCGAG CGAGTTTTGA CAGGTTTAGA GAAAAAATCA
CGGGTATTGA ATGAATTAGA GAAGAAAACC GTCGCTTACC ATGAAGTGGG TCACGCCTTG
ATTGGGGCAA TTATGCCAGG AACCAGTAAA ATTGAGAAAA TTTCCATTGT GCCGCGTGGG
GTAGGAGCAT TGGGTTATAC CTTGCAATTG CCCGAAGAAG ACCGCTTTTT GATGTTAGAA
GATGAAATTC GCGGACGCAT TGCCACATTA TTGGGGGGAA GGGCAGCCGA AGAGTTGATG
TTTGGTCGGG TGTCTACGGG AGCCAGTGAT GATATTCAAA AAGCGACGGA TCTTGCCGAA
CGGTTTGTGA CGTTGTATGG CATGAGTGAT AAATTGGGTC CGATCGCCTT TGAAAAGGGG
CAACAGCAAT TTTTAGAGGG TTTTACCAAT CCTCGTCGTC CTGTTAGTCC TAAAGTGGCT
GAAGCGATTG ACAATGAAGT GAAAGAATTG GTCGAAGGAG CCCATCAAAT CGCGTTGAAG
ATTTTAGCAG AAAATCGGGA CTTATTAGAA ATAACGGCTC AAACCCTGTT AGAAGCCGAA
ATTTTAGAAG GGGAAGCTCT AAAAACCCAA CTTAAACAGG TTCGCCAACC ATCAATGATG
GACAATTGGT TATTAACCGG TGAGGTAACT CAGGCTTCAA CCTTCGATTC TATTAATTCT
AATGGAAGAA TTTGTCTTTA A
 
Protein sequence
MSPDINHSNQ TSEPGGLGRT VIILLLWLLF LNLLVMRSDH EGIVSYSQFI DQIEAGKVAK 
VNIGTERIEY TLKPEINSKD KTQTLITLPI AQDTTLTQRL EAHDVEFSAI PPSQTGWISN
LLGWIIPPLI FFGIWMWLLN RSQMNGPGML TVGKSNARIY SQGDTGVTFE DVAGVDEAKT
ELQEIVDFLK SAEKYTRLGA KIPKGVLLIG PPGTGKTLLA KAIAGEAGVP FFSISGSEFI
ELFVGIGASR VRDLFDQAKT QAPCIVFIDE LDALGKSRAN MGGMIGGNDE REQTLNQLLA
EMDGFDPNTG VILLAATNRP EVLDPALLRP GRFDRQIVVD RPDKSGREAI LRVHAHDVRL
APDVDLDKLA ARTPGFAGAD LANLINEAAL LAARNNREAV MMQDFNEAIE RVLTGLEKKS
RVLNELEKKT VAYHEVGHAL IGAIMPGTSK IEKISIVPRG VGALGYTLQL PEEDRFLMLE
DEIRGRIATL LGGRAAEELM FGRVSTGASD DIQKATDLAE RFVTLYGMSD KLGPIAFEKG
QQQFLEGFTN PRRPVSPKVA EAIDNEVKEL VEGAHQIALK ILAENRDLLE ITAQTLLEAE
ILEGEALKTQ LKQVRQPSMM DNWLLTGEVT QASTFDSINS NGRICL