Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_4147 |
Symbol | |
ID | 8393498 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | + |
Start bp | 4274133 |
End bp | 4275524 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 644982062 |
Product | protein of unknown function DUF21 |
Protein accession | YP_003139774 |
Protein GI | 257061886 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.263369 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAGTT TGACTGTTAG TGAAGTATTG CTACGCCTTG TGTCAGTTTT CTTGCTGATT CTCATTAATG CTTTTTTTGT GACAGCAGAA TTTGCTATCG TATCGGTACG GCGATCGCGC ATTAGTCAAT TAGTCGTTGC CGGAGATATC CAGGCACAAA GCGTTCAATC GTTACAAAGG AGTATTGATC GCCTCCTCTC AACCACCCAA TTAGGTATTA CTCTATCGAG TTTAGCCTTG GGTTGGATAG GAGAGGGAAC CATGGCAGTT TTAGTCCGCT ATCTCCTCAA ACACCTTCCT TTGTCTGATC AATGGACAAA CACCCTATCC CATAGTTTCG CTATTCCTAT CGCTTTTTTC GCCCTAGCTT ATCTACAAAT TGTGCTAGGG GAACTGTGTC CCAAGTCCGT CGCATTAATT TATTCGGAAA AGTTAGCCCG ATTTTTAGGG ACACCCATTG GAGTCATTGC TCGAATTTTC CATCCTTTTA TTTGGATTTT AAACCAATCG ACCCGTTATT TACTCCTGAG CATTGGCATT GAGTACACGG GAGACAAACG GTATAATCAA GTCACGTCTG AAGAACTTCA GTTAATTATT GCCACAGAAG GAGAATCAAC GGGATTAGAA GCCAAAGAAC GAGCATTACT CAAAAATATT TTTGAATTTG GCAATGTAGC CGCCGTTGAG GTGATGGTTC CCCGTACTCA ATTAGTAGCC ATTTCTGAGG AAGCAACCTT CAGTGACTTA TTAGAAGAAG TGACGAAAAC TGGACATTCT CGCTACCCCG TTACAGGAGA ATCCCTAGAC GATATTTTAG GCTTTATTGA TTTTAAAGAT TTGGCTTTCC CCTTAGCGCG AGGGGAATTA ACCCCAGAGG CTTCTTTTCG TCGTTGGCTC AAACCGGTTA AATTTGTCTC TGAATCGATG CCTTTAGATG AATTGCTCTC GTTAATGCAG CGATCGCAGT TAAAAATGGT CATTGTGGTT GATGAATTTG GGGGAACGTC CGGATTAATT ACAATTCAAG ATTTAATTGC TGAAATTATC GATAGTGATT TAGAAGATAA TATAACAGAA AATATTGCCC TACAAATGCT AGATGAGCAC ACCTTTTTAG TAGAAGCTCA GATCAATCTG GAGGATCTTA ATACGGTTTT AGATTTAGAT TTACCCCTAA CGGATGAATA CCAAACCCTA GGAGGGTTTT TACTCTATCA ATGGCAAAAA ATCCCTCATG TTGGAGAAAC CCTTGCCTAT AATAATTTAG AGTTTACAGT CGTCGCTGCT GATGAACCAC GCTTATTACA AATTCGTCTC CATCGTCAAA ATCCTCCTAA TCAAAATCCT TTAGATGACA TGGTTAATAC AGAATCAATA GAAGACACAT AA
|
Protein sequence | MDSLTVSEVL LRLVSVFLLI LINAFFVTAE FAIVSVRRSR ISQLVVAGDI QAQSVQSLQR SIDRLLSTTQ LGITLSSLAL GWIGEGTMAV LVRYLLKHLP LSDQWTNTLS HSFAIPIAFF ALAYLQIVLG ELCPKSVALI YSEKLARFLG TPIGVIARIF HPFIWILNQS TRYLLLSIGI EYTGDKRYNQ VTSEELQLII ATEGESTGLE AKERALLKNI FEFGNVAAVE VMVPRTQLVA ISEEATFSDL LEEVTKTGHS RYPVTGESLD DILGFIDFKD LAFPLARGEL TPEASFRRWL KPVKFVSESM PLDELLSLMQ RSQLKMVIVV DEFGGTSGLI TIQDLIAEII DSDLEDNITE NIALQMLDEH TFLVEAQINL EDLNTVLDLD LPLTDEYQTL GGFLLYQWQK IPHVGETLAY NNLEFTVVAA DEPRLLQIRL HRQNPPNQNP LDDMVNTESI EDT
|
| |