Gene PCC8801_4197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4197 
Symbol 
ID7104593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4401686 
End bp4402951 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content40% 
IMG OID643477183 
Productpeptidase M16 domain protein 
Protein accessionYP_002374282 
Protein GI218248911 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGAT ATCAGATTCA AGGAATTGGG GAAAAACAAG CGGTTCATCG TCTCAGGTTA 
GACAATGGTG TTGTGCTTAT TGTTGGGGAA AATCCCACAG CCGATCTAAT TGCAGGGCGA
ATTTTTTTGA AAAATGCGGG ATCTTTGTGG GAAAGCAAGG AAAAAGCTGG ACTTTCTAAT
CTTTTAGCCA CGGTTATTAC CAAAGGAACG GAACGGTTAT CCTCTGGAGA AATTGCAGAA
GCAGTGGAGT CCATTGGGGC CAGTTTGGGG GCGAATGCAG CCTCAGATTA TTTTATGATG
GGAATTAAAA CGGTATCATC AGACTTTGCA TTCATATTAG CTTTAATGGG GGAAATTTTG
CGATCGCCGA CGTTTCCTGA AGCAGAAGTC GCCTTAGAAA AACACCTGAT TATCCAAAGT
ATTCGATCTC AACAGGAACA ACCTTTTAAT GTTGCTTTTA ATCAATTACG AGGGCTCATG
TATCAAGAGC ATCCCTACGG ATTTTCTATT TTAGGAACGG AAGAAACCGT TAGTCAATTG
TGTCGAGATG ATTTGATTAA TTACCATAAT TATCATTTTA GACCTGATAA TTTAATTATT
AGTTTATCTG GCCGAATTAC CCTACATGAC GCTATTAGCC TAGTAGAAAA AACCTTAGGA
GATTGGGAAG TTCCGAGTCA TACGCTGACC CCTCTATCTT TACCACCTTT GATCTCTTCC
CCTGTGGAAA AAATCACCTT TCAAGAGACA CAACAATCGA TTGTTATGTT GGGTTATTTA
ACGGGAGGGG TTAAAAGTCC TGAGTATCCG GTTCTTAAAT TATTAAGTAC CTATTTAGGC
AACGGGTTAT CGAGTCGTTT ATTTGTTGAA TTACGGGAAA AACGTGGGTT AGCCTATGAT
GTTTCTGCCC TTTATCCCAC CCGTTTAGAA CCTTCTCAAT TTGTCGTTTA TATGGGAACC
GCACCTGATA ATACCAGCAT TGCCATCGAA GGGTTACAAC AGGAATGTGA GCGGTTATGT
TATCAAGAAT TAACCCCAGA AGAGTTGCAA GGGGCAAAAA ATAAGTTGTT AGGGCAATAC
GCTTTAGGAA AGCAAACAAA TAGCGAAATA GCTCAATTAT ATGGCTGGTA TGAAACCTTA
GGATTAGGGG TAGAATTTGA TCAGGAATTT CAGGCAATGA TCACTGAAGT AACATCTCAA
ATAGCACAAA GTGTGGCTAA AAATTATTTA CTTTCTCCCT ATCTTTCCGT TGTGGGACCC
AATTAA
 
Protein sequence
MNRYQIQGIG EKQAVHRLRL DNGVVLIVGE NPTADLIAGR IFLKNAGSLW ESKEKAGLSN 
LLATVITKGT ERLSSGEIAE AVESIGASLG ANAASDYFMM GIKTVSSDFA FILALMGEIL
RSPTFPEAEV ALEKHLIIQS IRSQQEQPFN VAFNQLRGLM YQEHPYGFSI LGTEETVSQL
CRDDLINYHN YHFRPDNLII SLSGRITLHD AISLVEKTLG DWEVPSHTLT PLSLPPLISS
PVEKITFQET QQSIVMLGYL TGGVKSPEYP VLKLLSTYLG NGLSSRLFVE LREKRGLAYD
VSALYPTRLE PSQFVVYMGT APDNTSIAIE GLQQECERLC YQELTPEELQ GAKNKLLGQY
ALGKQTNSEI AQLYGWYETL GLGVEFDQEF QAMITEVTSQ IAQSVAKNYL LSPYLSVVGP
N