Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_4107 |
Symbol | |
ID | 7101898 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 4303687 |
End bp | 4305078 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643477096 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002374195 |
Protein GI | 218248824 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAGTT TGACTGTTAG TGAAGTATTG CTACGCCTTG TGTCAGTTTT CTTGCTGATT CTCATTAATG CTTTTTTTGT GACAGCAGAA TTTGCTATCG TATCGGTACG GCGATCGCGC ATTAGTCAAT TAGTCGTTGC CGGAGATATC CAGGCACAAA GCGTTCAATC GTTACAAAGG AGTATTGATC GCCTCCTCTC AACCACCCAA TTAGGTATTA CTCTATCGAG TTTAGCCTTG GGTTGGATAG GAGAGGGAAC CATGGCAGTT TTAGTCCGCT ATCTCCTCAA ACACCTTCCT TTGTCTGATC AATGGACAAA CACCCTATCC CATAGTTTTG CTATTCCTAT CGCTTTTTTT GCCCTAGCTT ATCTACAAAT TGTGCTAGGG GAACTGTGTC CCAAGTCCGT CGCATTAATT TATTCGGAAA AGTTAGCCCG ATTTTTAGGG ACACCCATTG GAGTCATTGC TCGAATTTTC CATCCTTTTA TTTGGATTTT AAACCAATCG ACCCGTTATT TACTCCTGAG CATTGGCATT GAGTACACGG GAGACAAACG GTATAATCAA GTCACATCCG AGGAACTGCA GTTAATTATT GCCACAGAAG GAGAATCAAC AGGATTAGAA GCCAAAGAAC GAGCCTTGCT CAAAAATATT TTTGAATTTG GGAATGTAGC TGCCGTTGAG GTGATGGTTC CCCGTACTCA ATTAGTAGCC ATTTCTGAGG AAGCAACCTT CAGTGACTTA TTAGAAGAAG TGACCAAAAC TGGACATTCT CGCTACCCCG TTACAGGAGA ATCCCTAGAC GATATTTTAG GCTTTATTGA TTTTAAAGAT TTGGCTTTCC CCTTAGCGCG AGGGGAATTA ACCCCAGAGG CTTCTTTTCG TCGTTGGCTC AAACCGGTAA AATTTGTCTC TGAATCGATG CCTTTAGATG AATTACTCTC GTTAATGCAG CGATCGCAGT TAAAAATGGT GATTGTGGTT GATGAATTTG GAGGAACGTC CGGATTAATT ACAATTCAAG ATTTAATTGC TGAAATTATC GATAGTGATT TAGAAGATAA TATAACAGAA AATATTGCCC TACAAATGCT AGATGAGCAC ACCTTTTTAG TAGAAGCTCA GATCAATCTG GAGGATCTTA ATACGGTTTT AGATTTAGAT TTACCCCTAA CGGATGAATA CCAAACCCTA GGAGGGTTTT TACTCTATCA ATGGCAAAAA ATTCCTCACG TCGGAGAAAC CCTTGCCTAT AATAATTTAG AGTTCACAGT CGTCGCGGCG GATGAACCTC GTTTATTACA AATTCGTCTC CATCGTCAAA ATCCTCCTAA TCAAAATCCT TTAGATAACA TGGTTAATAC AGAATCAATA GAAGACACAT AA
|
Protein sequence | MDSLTVSEVL LRLVSVFLLI LINAFFVTAE FAIVSVRRSR ISQLVVAGDI QAQSVQSLQR SIDRLLSTTQ LGITLSSLAL GWIGEGTMAV LVRYLLKHLP LSDQWTNTLS HSFAIPIAFF ALAYLQIVLG ELCPKSVALI YSEKLARFLG TPIGVIARIF HPFIWILNQS TRYLLLSIGI EYTGDKRYNQ VTSEELQLII ATEGESTGLE AKERALLKNI FEFGNVAAVE VMVPRTQLVA ISEEATFSDL LEEVTKTGHS RYPVTGESLD DILGFIDFKD LAFPLARGEL TPEASFRRWL KPVKFVSESM PLDELLSLMQ RSQLKMVIVV DEFGGTSGLI TIQDLIAEII DSDLEDNITE NIALQMLDEH TFLVEAQINL EDLNTVLDLD LPLTDEYQTL GGFLLYQWQK IPHVGETLAY NNLEFTVVAA DEPRLLQIRL HRQNPPNQNP LDNMVNTESI EDT
|
| |