Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_4197 |
Symbol | |
ID | 7104593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 4401686 |
End bp | 4402951 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643477183 |
Product | peptidase M16 domain protein |
Protein accession | YP_002374282 |
Protein GI | 218248911 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAGAT ATCAGATTCA AGGAATTGGG GAAAAACAAG CGGTTCATCG TCTCAGGTTA GACAATGGTG TTGTGCTTAT TGTTGGGGAA AATCCCACAG CCGATCTAAT TGCAGGGCGA ATTTTTTTGA AAAATGCGGG ATCTTTGTGG GAAAGCAAGG AAAAAGCTGG ACTTTCTAAT CTTTTAGCCA CGGTTATTAC CAAAGGAACG GAACGGTTAT CCTCTGGAGA AATTGCAGAA GCAGTGGAGT CCATTGGGGC CAGTTTGGGG GCGAATGCAG CCTCAGATTA TTTTATGATG GGAATTAAAA CGGTATCATC AGACTTTGCA TTCATATTAG CTTTAATGGG GGAAATTTTG CGATCGCCGA CGTTTCCTGA AGCAGAAGTC GCCTTAGAAA AACACCTGAT TATCCAAAGT ATTCGATCTC AACAGGAACA ACCTTTTAAT GTTGCTTTTA ATCAATTACG AGGGCTCATG TATCAAGAGC ATCCCTACGG ATTTTCTATT TTAGGAACGG AAGAAACCGT TAGTCAATTG TGTCGAGATG ATTTGATTAA TTACCATAAT TATCATTTTA GACCTGATAA TTTAATTATT AGTTTATCTG GCCGAATTAC CCTACATGAC GCTATTAGCC TAGTAGAAAA AACCTTAGGA GATTGGGAAG TTCCGAGTCA TACGCTGACC CCTCTATCTT TACCACCTTT GATCTCTTCC CCTGTGGAAA AAATCACCTT TCAAGAGACA CAACAATCGA TTGTTATGTT GGGTTATTTA ACGGGAGGGG TTAAAAGTCC TGAGTATCCG GTTCTTAAAT TATTAAGTAC CTATTTAGGC AACGGGTTAT CGAGTCGTTT ATTTGTTGAA TTACGGGAAA AACGTGGGTT AGCCTATGAT GTTTCTGCCC TTTATCCCAC CCGTTTAGAA CCTTCTCAAT TTGTCGTTTA TATGGGAACC GCACCTGATA ATACCAGCAT TGCCATCGAA GGGTTACAAC AGGAATGTGA GCGGTTATGT TATCAAGAAT TAACCCCAGA AGAGTTGCAA GGGGCAAAAA ATAAGTTGTT AGGGCAATAC GCTTTAGGAA AGCAAACAAA TAGCGAAATA GCTCAATTAT ATGGCTGGTA TGAAACCTTA GGATTAGGGG TAGAATTTGA TCAGGAATTT CAGGCAATGA TCACTGAAGT AACATCTCAA ATAGCACAAA GTGTGGCTAA AAATTATTTA CTTTCTCCCT ATCTTTCCGT TGTGGGACCC AATTAA
|
Protein sequence | MNRYQIQGIG EKQAVHRLRL DNGVVLIVGE NPTADLIAGR IFLKNAGSLW ESKEKAGLSN LLATVITKGT ERLSSGEIAE AVESIGASLG ANAASDYFMM GIKTVSSDFA FILALMGEIL RSPTFPEAEV ALEKHLIIQS IRSQQEQPFN VAFNQLRGLM YQEHPYGFSI LGTEETVSQL CRDDLINYHN YHFRPDNLII SLSGRITLHD AISLVEKTLG DWEVPSHTLT PLSLPPLISS PVEKITFQET QQSIVMLGYL TGGVKSPEYP VLKLLSTYLG NGLSSRLFVE LREKRGLAYD VSALYPTRLE PSQFVVYMGT APDNTSIAIE GLQQECERLC YQELTPEELQ GAKNKLLGQY ALGKQTNSEI AQLYGWYETL GLGVEFDQEF QAMITEVTSQ IAQSVAKNYL LSPYLSVVGP N
|
| |