Gene Cyan8802_1947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_1947 
Symbol 
ID8391262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp1967274 
End bp1969214 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content45% 
IMG OID644979927 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_003137673 
Protein GI257059785 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCCAG ACATTAATCA TTCTAATCAA ACCTCTGAAC CCGGCGGTTT AGGTCGAACC 
GTCATTATCT TGCTGTTGTG GTTATTGTTC CTGAACTTGC TCGTGATGCG CAGTGATCAC
GAGGGGATCG TCTCCTATAG CCAATTTATC GACCAAATCG AAGCCGGGAA GGTAGCTAAG
GTCAATATTG GCACAGAACG CATTGAATAT ACCTTAAAAC CCGAAATTAA TAGCAAAGAT
AAAACCCAAA CCTTAATTAC CCTTCCCATT GCCCAAGATA CGACCCTAAC CCAACGGTTA
GAAGCCCATG ATGTGGAGTT TTCAGCTATC CCACCGAGTC AAACGGGATG GATCTCCAAT
TTACTCGGTT GGATTATTCC GCCCTTAATT TTCTTTGGCA TCTGGATGTG GTTGTTAAAC
CGTTCCCAAA TGAATGGACC GGGTATGCTA ACCGTAGGAA AAAGTAACGC GCGTATTTAT
TCCCAAGGAG ATACAGGAGT CACCTTTGAA GACGTAGCGG GAGTGGATGA AGCCAAGACA
GAATTACAAG AAATCGTTGA TTTTTTGAAA AGCGCAGAAA AATATACCCG TTTAGGGGCA
AAAATTCCCA AAGGAGTGTT ATTAATCGGT CCCCCAGGAA CCGGGAAAAC CCTGTTAGCC
AAAGCGATCG CCGGAGAAGC CGGAGTTCCT TTCTTTAGTA TCTCAGGCTC GGAATTTATC
GAATTATTCG TCGGTATTGG GGCTTCACGG GTACGAGATC TCTTCGATCA AGCCAAAACT
CAGGCTCCCT GTATCGTCTT TATTGATGAA TTGGATGCCT TGGGTAAATC TAGAGCCAAT
ATGGGGGGCA TGATTGGAGG CAATGACGAA CGGGAGCAAA CCTTGAACCA ATTATTAGCC
GAAATGGATG GATTTGACCC CAATACGGGG GTAATTTTGC TGGCTGCTAC CAACCGTCCT
GAAGTGCTTG ATCCCGCCTT ATTACGTCCT GGCCGTTTTG ATCGTCAAAT CGTAGTTGAT
CGCCCTGATA AGAGTGGACG GGAAGCCATT TTGCGGGTAC ACGCCCATGA TGTCAGGTTA
GCCCCTGATG TAGATTTAGA CAAGTTAGCG GCCAGAACGC CAGGGTTTGC CGGGGCAGAT
TTAGCCAATT TAATCAACGA AGCTGCTTTA TTAGCTGCGC GCAATAACCG AGAAGCCGTG
ATGATGCAGG ATTTTAACGA GGCAATCGAG CGAGTTTTGA CAGGTTTAGA GAAAAAATCA
CGGGTATTGA ATGAATTAGA GAAGAAAACC GTCGCTTACC ATGAAGTGGG TCACGCCTTG
ATTGGGGCAA TTATGCCAGG AACCAGTAAA ATTGAGAAAA TTTCCATTGT GCCGCGTGGG
GTAGGAGCAT TGGGTTATAC CTTGCAATTG CCCGAAGAAG ACCGCTTTTT GATGTTAGAA
GATGAAATTC GCGGACGCAT TGCCACATTA TTGGGGGGAA GGGCAGCCGA AGAGTTGATG
TTTGGTCGGG TGTCTACGGG AGCCAGTGAT GATATTCAAA AAGCGACGGA TCTTGCCGAA
CGGTTTGTGA CGTTGTATGG CATGAGTGAT AAATTGGGAC CGATCGCCTT TGAAAAGGGA
CAACAGCAAT TTTTAGAGGG TTTTACCAAT CCTCGTCGTC CCGTTAGTCC TAAAGTGGCT
GAAGCGATCG ACAATGAAGT GAAAGAATTA GTCGAAGGAG CCCATCAAAT CGCGTTGAAG
ATTTTAGCAG AAAATCGGGA CTTATTAGAA ATAACGGCTC AAACGCTCTT AGAAGCCGAA
ATTTTAGAAG GAGAAGCTCT AAAAACCCAA CTTAAACAGG TTCGCCAACC ATCAATGATG
GACAATTGGT TATTAACCGG TGAGGTAACT CAGGCTTCAA CCTCCCATTC TATTAGTTCT
AATGGAAGAA TTTGTCTTTA A
 
Protein sequence
MSPDINHSNQ TSEPGGLGRT VIILLLWLLF LNLLVMRSDH EGIVSYSQFI DQIEAGKVAK 
VNIGTERIEY TLKPEINSKD KTQTLITLPI AQDTTLTQRL EAHDVEFSAI PPSQTGWISN
LLGWIIPPLI FFGIWMWLLN RSQMNGPGML TVGKSNARIY SQGDTGVTFE DVAGVDEAKT
ELQEIVDFLK SAEKYTRLGA KIPKGVLLIG PPGTGKTLLA KAIAGEAGVP FFSISGSEFI
ELFVGIGASR VRDLFDQAKT QAPCIVFIDE LDALGKSRAN MGGMIGGNDE REQTLNQLLA
EMDGFDPNTG VILLAATNRP EVLDPALLRP GRFDRQIVVD RPDKSGREAI LRVHAHDVRL
APDVDLDKLA ARTPGFAGAD LANLINEAAL LAARNNREAV MMQDFNEAIE RVLTGLEKKS
RVLNELEKKT VAYHEVGHAL IGAIMPGTSK IEKISIVPRG VGALGYTLQL PEEDRFLMLE
DEIRGRIATL LGGRAAEELM FGRVSTGASD DIQKATDLAE RFVTLYGMSD KLGPIAFEKG
QQQFLEGFTN PRRPVSPKVA EAIDNEVKEL VEGAHQIALK ILAENRDLLE ITAQTLLEAE
ILEGEALKTQ LKQVRQPSMM DNWLLTGEVT QASTSHSISS NGRICL