Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_2697 |
Symbol | |
ID | 8392023 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | - |
Start bp | 2725458 |
End bp | 2726468 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644980656 |
Product | pentapeptide repeat protein |
Protein accession | YP_003138392 |
Protein GI | 257060504 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.836174 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGTCAG AAGTCCCTAT GAAAGCCAAT GAACTGATTG AGCGTTATTC GTCTGGAGAA ACTCACTTTA ATGGGTTAAA ATTACCTGGA ATTAACCTAG TTGGTGCGGA TTTAATTGGC ATTATTTTTA ATGAAGCCGA TCTTCATGGA GCGAATTTTT TATTAGCTTA CCTCAACCGA GCCAGCTTTA CCCAAGCGAA TTTAGTAGAA ACCAATTTAA GCGGAGCTAA TCTCAGTCAA GCCGATCTTA GTGGAGCCGA TCTTCGCAGT GCTATCTTAC ATGGAGCCAT TTTACAAGGA GCTAATCTTA GGGATACGGA TATTACCCTA GCCATCCTTT TAGACGCTAA TCTAGTCGCA GCAGATTTAC GCGGAGCCGA TTTGAGTGGG GCAACCCTGA CAGGGGCTTG TCTGCGGGGG GCAAATATGC GCCAGGAGAA AAAAAGTTAC TATACCAATC TCCAGGCAGT TAATTTGACC AAAGCCGACC TTCAAGGAGC AAATATGAAG GGGGTTGATC TTAGTCGTGC CAATCTGACG GGAGCCAATC TCAAAGAAGC TAACCTGCGA GACTCCGATC TTCGCAAAGC CGATCTCACC GATGCTAATC TTAAAGGAGC GTTACTCACA GATACCAATT TTACCGGGGC TAAACTCACA GGAGCCAATC TAACGAATGC TAATTTAGTC CGAGCCCAGA TGTCCCATAC TGATATGGTA GGTGTAATGG CCAAGGGTTC TGTGATGACC CATGCTGATT TGAGTCGTGC CAATCTCAGT CAAGCGAATT TAGACCTAAG TCGCATGAAT CATGCTGATT TGAGCCGCTC TAATTTATCA GGAGCCAGTT TTAAGGATGC TGAGTTAGTC GAGGTTTTCT TAGCTAAAGC TAATCTGATG GGAGCTAATT TAACCCAAGC CAATTTAACT CGGGCTGAGT TGATGAGTGC TAATTTAACG GGTGCGATTT TGCGCGGGGC AACCATGCCA GATGGTCGCG TTCGGGATTA A
|
Protein sequence | MRSEVPMKAN ELIERYSSGE THFNGLKLPG INLVGADLIG IIFNEADLHG ANFLLAYLNR ASFTQANLVE TNLSGANLSQ ADLSGADLRS AILHGAILQG ANLRDTDITL AILLDANLVA ADLRGADLSG ATLTGACLRG ANMRQEKKSY YTNLQAVNLT KADLQGANMK GVDLSRANLT GANLKEANLR DSDLRKADLT DANLKGALLT DTNFTGAKLT GANLTNANLV RAQMSHTDMV GVMAKGSVMT HADLSRANLS QANLDLSRMN HADLSRSNLS GASFKDAELV EVFLAKANLM GANLTQANLT RAELMSANLT GAILRGATMP DGRVRD
|
| |