Gene PCC8801_3403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3403 
Symbol 
ID7103103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3548238 
End bp3549284 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content45% 
IMG OID643476418 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_002373527 
Protein GI218248156 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACAG TATTGGCAAT TGAAACAAGT TGTGACGAAA CCGCCGTCGC AATTGTAAAT 
AATCGTAACG TTTGTAGTAG TGTAGTGGCT TCCCAAATCG CCCTCCATAA GACCTATGGC
GGTGTGGTTC CTGAGATGGC TTCCCGTGAA CATTTAATCA CCATTAATGC TTGTTTGGAA
GAAGCCTTAG CTCAATCTAA TCTCAGTTGG TCGGATATTG ATGGGGTTGC CGCTACCATG
GCCCCTGGTT TAGTAGGTGC TTTAATGGTG GGGGCAACCA CCGCTAAAAC CCTGGCCATT
GTTCATCAAA AGCCCTTTGT TGGGGTGCAT CACCTCGAAG GTCATATCTA TGCCACTTAT
TTGAGCGATC CTACCTGGGA ACCCCCGTTT TTATGTCTTT TGGTGTCAGG GGGTCATACT
AGCCTAATTT GGGTCAAAGA TTGCGGGTTC TATGAACAAT TGGGGGCTAC TCGTGATGAT
GCGGCCGGGG AGGCCTTCGA TAAGGTAGCA CGGTTACTCA ATTTGGGCTA TCCAGGGGGA
CCAGTGATCG ATCGCTTGGC TAAAACAGGC AACCCGCAAG CCTTTGCTTT ACCAGAGGGA
CGAGTTTCTT TACCCGAAGG GGGTTATCAT CCCTATGATT CCAGTTTTAG TGGCTTAAAA
ACCGCCGTAT TACGGTTAGT TCAAACCCTA GAAAAAGACG ATAAAAATAG TTTGCCTGTG
GCAGATTTGG CGGCCAGTTT TCAATCAACT GTAGCGCGAT CACTGACTAA AAAAAGTATC
GCTTGTGCTT TGGATTATGG CATTAATTCT ATTGCTGTTG GTGGTGGAGT TGCCGCCAAT
AGTGAACTGA GAAAACAATT ACAAGAAGCG GGAATTAACC ACAATATCAA AGTGCATTTT
CCCCCTTTAA AATGGTGTAC TGATAATGCA GCAATGATCG GTTGTGCTGC TGCGGATCAT
CTCAATAGAG GTCATACTTC TTCTTTGAGT TTGAATGTTA ATTCTCGATT ATCTATTACC
GATGTGATGC AGCTTTATGA ATTTTAA
 
Protein sequence
MATVLAIETS CDETAVAIVN NRNVCSSVVA SQIALHKTYG GVVPEMASRE HLITINACLE 
EALAQSNLSW SDIDGVAATM APGLVGALMV GATTAKTLAI VHQKPFVGVH HLEGHIYATY
LSDPTWEPPF LCLLVSGGHT SLIWVKDCGF YEQLGATRDD AAGEAFDKVA RLLNLGYPGG
PVIDRLAKTG NPQAFALPEG RVSLPEGGYH PYDSSFSGLK TAVLRLVQTL EKDDKNSLPV
ADLAASFQST VARSLTKKSI ACALDYGINS IAVGGGVAAN SELRKQLQEA GINHNIKVHF
PPLKWCTDNA AMIGCAAADH LNRGHTSSLS LNVNSRLSIT DVMQLYEF