Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_1527 |
Symbol | |
ID | 7104167 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | - |
Start bp | 1599419 |
End bp | 1600684 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643474600 |
Product | transposase, IS605 OrfB family |
Protein accession | YP_002371737 |
Protein GI | 218246366 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGAAT CCCACGCGGT TCTACCCGTG GGAGTGTCAA TCTATAAACT AGCTGACCTG ATCGAAGCCA ATAAAGAGGA ATTAGCTCGC TTAGAAACCC TCGATAATGG AAAACCGCTC ACAGACTCCC TCAATGCCGA TTTATCCCTG GTTATCGCTT GCTATCGCTA TTATGCGGGT TGGGCGGATA AAGTGCAAGG AAAAACCATT CCCATCAACG GCCCCTATTT TTGCTACACT CGCCATGAAC CGGTGGGAGT CGTCGGTCAA ATTATCCCTT GGAATTTCCC CCTGTTGATG CAAGCATGGA AATTAGCCCC CGCTTTAGCC ATGGGGAACA CGGTGGTCAT GAAAACCGCC GAACAAACTC CCCTATCCGC ATTACGGGTC GGAGAATTGA TCCTAGAAGC AGGGTTCCCC CCTGGAGTCG TCAACTTACT TTCAGGATAT GGTCCCACGG CAGGACAAGC GATCGCCCGT CACAGGGATA TTGATAAAGT TGCCTTTACG GGGTCTACAG AAGTGGGACA CCTGATCATG GAAGCAGCCG CCCAAAGTAA CCTCAAGCGC GTTACCTTGG AATTAGGTGG GAAAAGCCCT AATATTGTCT TTGCTGACGC GAATTTTGAG GAAGCCATCG AAGGATCTCA TCAGGGGCTG TTTTTTAACC AAGGACAGTG TTGTTGTGCG GGATCTCGGC TATTTGTCGA AGAGTCCTGT TATGACGAAT TTGTGACCAA AAGTGTCGAA CGAGCCCGCA GTCGTCGCGT CGGTGATCCC TTCGATAGCA ACACCGAACA GGGGCCCCAA GTCGATCAAG AACAGTTTAA CAAGGTGATG GGCTATATCG AGTCAGGACA GCGCGACGGG GCTCAGATGC TGTGTGGTGG GGGTCGTTTG GGCGATCGCG GTTATTTTAT CGAGCCCACA GTGTTTGCGG GGGTTCGTGA TGATATGAAA ATTGCCCAGG AGGAGGTTTT TGGACCGGTG ATGAGTATTA TCAAGTTTAA AGACGTTGAG GAGGTCATTC AACGGGCAAA TAATACGATC TATGGCTTAG CTGCTGCGGT TTGGACTAAA GATATTACCA AAGCTCATGC GATCGCTAAT GGAGTCCGTG CGGGTACAGT TTGGGTCAAT TGTTACGATG TCTTCGATGC GGCTGCACCT TTTGGTGGGT TCAAACAGTC TGGTATGGGT CGAGAATTGG GTGAATACGG ACTGCAACAG TACACCGAGG TTAAGACGGT GACGATTAAA TTGTAA
|
Protein sequence | MEESHAVLPV GVSIYKLADL IEANKEELAR LETLDNGKPL TDSLNADLSL VIACYRYYAG WADKVQGKTI PINGPYFCYT RHEPVGVVGQ IIPWNFPLLM QAWKLAPALA MGNTVVMKTA EQTPLSALRV GELILEAGFP PGVVNLLSGY GPTAGQAIAR HRDIDKVAFT GSTEVGHLIM EAAAQSNLKR VTLELGGKSP NIVFADANFE EAIEGSHQGL FFNQGQCCCA GSRLFVEESC YDEFVTKSVE RARSRRVGDP FDSNTEQGPQ VDQEQFNKVM GYIESGQRDG AQMLCGGGRL GDRGYFIEPT VFAGVRDDMK IAQEEVFGPV MSIIKFKDVE EVIQRANNTI YGLAAAVWTK DITKAHAIAN GVRAGTVWVN CYDVFDAAAP FGGFKQSGMG RELGEYGLQQ YTEVKTVTIK L
|
| |