Gene PCC8801_4451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4451 
Symbol 
ID7095828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011721 
Strand
Start bp2889 
End bp5276 
Gene Length2388 bp 
Protein Length795 aa 
Translation table11 
GC content48% 
IMG OID643467408 
Productpeptidase C14 caspase catalytic subunit p20 
Protein accessionYP_002364704 
Protein GI218203849 
COG category[R] General function prediction only 
COG ID[COG4249] Uncharacterized protein containing caspase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTCCCA TTAGCCTTGG TACCAGTCAC TCCAAATTGG CCTTAGAAAC AGGTCAAGCT 
AAGTTGTGGA TTCTCCTAAT TGGGGTTAAT GACTACCAGG ATTTGAATCT ACCGAGACTG
AGTTATCCGG CCTTAGACTG TCAAGGGTTA AGTGACTCCT TACTGGAAGC AACCCAAGCG
TTTCCGCAAA AAAATATTAT CATTCATCAT GATTTTGCCC CACAAAAACC CCTTCTCGAA
ACTGTCAAAA CCAGTTTAGA GACGATTGTT ACTTCAGCCA AAACCCAAGA TACCATTCTC
TTTTATTTTT CCGGCCACGG GTTACTAGAA ACCCAAAGTC AGGAAACCGT TCTTTGTGTA
GCGGATACCC AAACCGATAA TCTTCTGAAG ACAGGACTCA TCCTGCAAGA ATTACTGCAA
AGATTGAGTC AATGTTCAGC CCATTGTCAA GTGATTTGGC TAGATGCTTG TCACTGTGGC
AACATGACCC TACGGGGAGC CAAAGGGTGT ACGGAAGAAC CGTCATGGGA TGATCCCACC
AGTCAATTAT TAGCAACTCT ACGCCAGAAA GCCGCCAAAA GTAAAGGCTT TTATGCCCTA
TTATCCTGTG ATCAGGGACA AAAATCTTGG GAGTTTCCCG ACTTAGGCCA TGGAGTCTTT
AGTTACTTCC TAATGCGAGG GTTGCGCGGG GAAGCGGCCG ATGGCAAAGG GGTCATTAAA
GCTGATGAAC TCTATCAATA TGTTTATAAT CAAACCCTCA ATTATATTGA CAAAATTAAT
CAACAACTGC GGTTAATCAA CCAACAAAGA CGATGTCGAG GAGAATCGCA CCTTTATTCC
GAATATTCCT TACAAACCCC CAAACGCATC GTTGAAGGGG TGGGAGAACT CATTTTAGGA
TTAAAATCCC CTAGTCCATC GCCGCGCGAT CGCCGTTATG CGCTGATTGT GCAAGGATGT
CCCGAAAATC AGACGCTCAA GGCCTTAGGG GAAGTATTAA CCACAGAAGG CCAGTTTTCC
GTCACAGTCC TCTGTCCCGT GGGAAAAGAA GCCTTAGCCG TACGCCAAGA GATTCAAGCG
TTTTTGCAAC GGCGATCGCC CTTAGACCCC AATCATGACC CCCTGTCCCC CTCCGATACC
ATCTTGCTCT ATCTGCGAGG ACAGTTGGAA GTATCAGCGA CAGGAGACTC CGATTGGGTT
CTCGAAAACG GCATTAAACT GAGTCGTTCT TGGTTGAGAC AGGAATTGCG TCGCTGCCAT
TTAGCCCAAC AAATCGTCAT TTTAGACGGA TATGGAACGG GTTTACTCGA AGAGTGGATC
GAAGAACTCC AACTGGGCTT AGATCAGGGA CAATGTTTAT TAGCTGCCCT CAACCCCCAA
GGGGAACCCG ATTTGTTTGC TCAAATTCTC CTCGACTCCC TGGTGGCCGC CAATCCCCAA
GTCGGGTTAT CGGGAGCCCA ATGGCTTAGT CTATTGCAAA AAAACTGCGA ACAACTCTGT
CTACCTTTTT CCGCCTGGCT TTCGGGAATA CAAGGAGTTA TTGATATTTT ACCAGGGAAA
CATCAAGAAA CTTTATTCCC CCTTCATCCT CCTTCCCGAG TTCTTAACCC TTCTTCTGCT
ACGCTACAGC AAGAACAAAC TGCGGTCACC CCCTCACCCT CTCACCCCCT CACCCTCTCC
CCAGAACAAT ACGCCAACCT ATCAGCGTTT TTGACCCAGT TAATTGGTCC GGTGGCTGTC
CCCTTGTTGC AGTCCGCTTT AGAAGAAACA ACCAGTATAA AAGAGCTATG GCAAACCTTA
GCACAATATC TGACTCCCAC TCAAAAACCC CAATTTGATC AATGGGCGAT CGCTATGTTC
AAAGAAGAAA AGAAGATTGA CTCTCCCCCC GTCCCCCCAT CCCCCCGTCC CCCCCTCTCC
CCGTCCCCCC CTCTCCCCCT TTCCCCCGAA CAGTACAGTC AATTAGCATC GGTTCTCAAA
GCCATCATCG GACCGATCGC CCCCACCTTG TTAGACCAGA TGGCCGAACC GGATCAACCC
CCCGAAGGAT TAATCGCTCG TTTAAAAGAG TATCTAACTC CCCTTGAATG GACTGAGTTT
GAACAACAGT TAGTCTCAGT GTTCGGAAAC AACGAAACCC CTAAGCTGAC TGTTATTTCC
GAGCCTGTAC CCCCATCAGA ATTAACATCA GGGGGGTTAG ATGACCCCTT CATCCAACAG
TGTGAACAAG CGTTAACTCA ACTAATCGGG CCCGTCGCTC ACTTTATTAT GGAAACCACC
TTAGAGGATC GCCCTGGAAT AACCCGAACC GAATTGATTG AAGCGATCGC CTCTAGGATT
TCCGACCCCC AGGAATCGGC TAACTTTCGT CAACACTTTT TTCCGTAA
 
Protein sequence
MPPISLGTSH SKLALETGQA KLWILLIGVN DYQDLNLPRL SYPALDCQGL SDSLLEATQA 
FPQKNIIIHH DFAPQKPLLE TVKTSLETIV TSAKTQDTIL FYFSGHGLLE TQSQETVLCV
ADTQTDNLLK TGLILQELLQ RLSQCSAHCQ VIWLDACHCG NMTLRGAKGC TEEPSWDDPT
SQLLATLRQK AAKSKGFYAL LSCDQGQKSW EFPDLGHGVF SYFLMRGLRG EAADGKGVIK
ADELYQYVYN QTLNYIDKIN QQLRLINQQR RCRGESHLYS EYSLQTPKRI VEGVGELILG
LKSPSPSPRD RRYALIVQGC PENQTLKALG EVLTTEGQFS VTVLCPVGKE ALAVRQEIQA
FLQRRSPLDP NHDPLSPSDT ILLYLRGQLE VSATGDSDWV LENGIKLSRS WLRQELRRCH
LAQQIVILDG YGTGLLEEWI EELQLGLDQG QCLLAALNPQ GEPDLFAQIL LDSLVAANPQ
VGLSGAQWLS LLQKNCEQLC LPFSAWLSGI QGVIDILPGK HQETLFPLHP PSRVLNPSSA
TLQQEQTAVT PSPSHPLTLS PEQYANLSAF LTQLIGPVAV PLLQSALEET TSIKELWQTL
AQYLTPTQKP QFDQWAIAMF KEEKKIDSPP VPPSPRPPLS PSPPLPLSPE QYSQLASVLK
AIIGPIAPTL LDQMAEPDQP PEGLIARLKE YLTPLEWTEF EQQLVSVFGN NETPKLTVIS
EPVPPSELTS GGLDDPFIQQ CEQALTQLIG PVAHFIMETT LEDRPGITRT ELIEAIASRI
SDPQESANFR QHFFP