Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_3491 |
Symbol | |
ID | 8392828 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | + |
Start bp | 3559286 |
End bp | 3560425 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 644981424 |
Product | protein of unknown function DUF21 |
Protein accession | YP_003139150 |
Protein GI | 257061262 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.532364 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.141748 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTCTTG ATCCCCCTTT TTACCAGATT ATCTTGCTAA CTCTAGATTA TCATTTCTTA GCGAGTACCG AACCTGCTCC TTTTTTAGGG CAGGTTTGGC TCGATTTAGC GGCAATCGTC TTAATGCTGC TGATGTCTGC TTTTTTTTCC GCCTCAGAAA CCGCTATTAC TGCCTTTGAT AATTTTAAAC TCAGGGGACT CATTGAGCAT CAAGGAGATC CTTCAGGAAT TTACCGCTTA GTTCTCGAAA ATCGACGGCG TTTTATTACA AGTCTTTTAG TCGGGAATAA TCTGGTTAAT AATTTTTCGG CTGTTCTTAC GAGTAATTTA TTTGCCATTT GGTTAGGTAA TGCAGGATTA GGCATAGCAA CGGCTATTAT TACGGTTTTT ATTTTGATTT TTGGAGAAAT AACCCCAAAA TCCCTAGCTA TTCTCCATAA TCGTGCTTTT TTTCGCCTTT CCGTTCGACC TGTTTTCTGG TTGTCTCAAA TACTAACGGC GATCGCCATT GTTCCCATTT TTGAAACCAT TACCCAAAAG ACCATTCAAA TTTTTCAAGG AAAATCCGAT AAAAATGCCC ATTCTGGAGA ATCTTTGCGG GATTTACACC TAATGATCAA GATTTTGGGA GGCAAAGGGA CATTAGATTT GTACCGACAC CAGTTACTGA ACAAAGCGTT AATGCTCGAT CAGTTAATAG CGAAGGATGT GGTCAAACCC CGTATCGATA TGACTACGAT TTCCCATGAA TCTAGTTTAC AGCAATTCAT CGATTTATCT CTCGAAACAG GCTATTCTCG CATTCCCGTC CAAGGAGAAT CGAAGGATCA GATAGTTGGG ATAGTCAATC TTAAACAGGC ACTCCAGAAG CTGCAATCTG TTCCAAAACA AAGACTTTCG GAGATAGCCG TCATTGAAGC GATGGATGCA CCGATTTATA TTCCTGAAAC TAAGCGGGTC ACAAATTTGC TCAAGGAAAT GCTCCAACAA CGGTTTCATA TTGTCATTGT CGTCGATGAA TATGGCGGAA CCGTTGGTTT AGTGACCTTA GAAGACATTT TAGAAGAATT AGTCGGCGAA ATCTATGATG AAAGCGATTA TCCCTCGGTT CAGGAGTCCT TAGTTCAGCG TGATCCCTAA
|
Protein sequence | MSLDPPFYQI ILLTLDYHFL ASTEPAPFLG QVWLDLAAIV LMLLMSAFFS ASETAITAFD NFKLRGLIEH QGDPSGIYRL VLENRRRFIT SLLVGNNLVN NFSAVLTSNL FAIWLGNAGL GIATAIITVF ILIFGEITPK SLAILHNRAF FRLSVRPVFW LSQILTAIAI VPIFETITQK TIQIFQGKSD KNAHSGESLR DLHLMIKILG GKGTLDLYRH QLLNKALMLD QLIAKDVVKP RIDMTTISHE SSLQQFIDLS LETGYSRIPV QGESKDQIVG IVNLKQALQK LQSVPKQRLS EIAVIEAMDA PIYIPETKRV TNLLKEMLQQ RFHIVIVVDE YGGTVGLVTL EDILEELVGE IYDESDYPSV QESLVQRDP
|
| |