Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_1953 |
Symbol | |
ID | 7105173 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | - |
Start bp | 2030889 |
End bp | 2032232 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643475015 |
Product | protein of unknown function DUF224 cysteine-rich region domain protein |
Protein accession | YP_002372147 |
Protein GI | 218246776 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAACAG AACAAATAAA TTTAACACCA AAAATAGAGC CTAATATTAG CGGATTTGAC GATAAAAATC CTCCTCCTAA AGAACTAATA GATGCTTGTG TACACTGTGG ATTTTGTCTA TCAACTTGTC CGAGTTATCG AGTTATTGGA AAAGAAATGG ACTCACCACG GGGAAGAATT TACCTGATGA ATGCCATTGA ACAAGGGGAA GCAGCAATTG ATGAAACAAC CTCCCAACAT TTTGATAGTT GCTTGGGGTG TTTAGCGTGT GTTAGCACTT GTCCGTCTGG GGTACAGTAT GATCAATTAA TCGCAAAAAC TCGGCCACAA GTTGAACGAA ATCAACCGAG AACGTTAAAA GACAAAATTA TTAGATCTCT AATTTTTAAT ATCTTTCCCT ATCCTCAACG GTTACAGATT TTTTTACCAT TATTGTGGCT ATATCAAAAA TTAGGGATTC AAAAATTAAT CCGTGCGACA GGGTTATTTA AAAAATTCTT TCCCCGTTTA GCAGCGATGG ATTCTATTTT GCCAGAAATT ACCCTAAAGA AATCTCAAAC CTTCCCTGAT ATTATTCCGT GTCAAGGGAC AAAACGCTAT CGCGTGGGGA TGATTTTAGG CTGTGTCCAA CGGTTATTTT TTTCCCCTGT CAATGAAGCA ACTGCCAGGG TTTTGACGGT AAATGGGTGC GAAGTTGTCA TCCCGAAAAC CCAAGGATGT TGCGCTGCTT TACCGGAACA CCAGGGACAA GAAGAACAAG CACAAACCTT AGCAAAACAA ATGATTGATA GTTTTGCAGA AACTGACGTT GATTATATTA TTATTAATGC CGCCGGGTGC GGCCATACCT TAAAAGAATA TGGTCATATT TTAGCCGATG ATCCCGACTA TAGAGAGAAG GCTAAACAGT TTTCTGAAAA AGTAAAAGAT GTTCAAGAAT TTTTAGCAGA AGTGGGGTTA ACTGCACCGT TAAACCCCTT AACTGAGGGA AAATTGATGA TGGTGTATCA GGATGCTTGC CATCTATTAC ACGGTCAAAA AATTAGCTTA CAACCGCGAC AATTATTACA AAAAATTCCA GGGATAATCT TAAAAGAACC CGTTGATGCT GCCTTATGTT GTGGTAGTGC AGGGGTTTAT AATATGTTGC AACCCGAAGT TGCTGAAGAA TTAGGACAGC AAAAAGTCAA TAATTTATTA AATACGGGAG CAACCTTAAT TGCCTCTGCT AACCCTGGAT GTTCTTTGCA GATTAAAAAA CATTTACGGT TACAGGGGAA AGAAATAACA TTAATGCACC CGATAGAGTT GTTAGATTAT TCCATTAGAG GAATACAATT ATAA
|
Protein sequence | MQTEQINLTP KIEPNISGFD DKNPPPKELI DACVHCGFCL STCPSYRVIG KEMDSPRGRI YLMNAIEQGE AAIDETTSQH FDSCLGCLAC VSTCPSGVQY DQLIAKTRPQ VERNQPRTLK DKIIRSLIFN IFPYPQRLQI FLPLLWLYQK LGIQKLIRAT GLFKKFFPRL AAMDSILPEI TLKKSQTFPD IIPCQGTKRY RVGMILGCVQ RLFFSPVNEA TARVLTVNGC EVVIPKTQGC CAALPEHQGQ EEQAQTLAKQ MIDSFAETDV DYIIINAAGC GHTLKEYGHI LADDPDYREK AKQFSEKVKD VQEFLAEVGL TAPLNPLTEG KLMMVYQDAC HLLHGQKISL QPRQLLQKIP GIILKEPVDA ALCCGSAGVY NMLQPEVAEE LGQQKVNNLL NTGATLIASA NPGCSLQIKK HLRLQGKEIT LMHPIELLDY SIRGIQL
|
| |