Gene PCC8801_1527 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1527 
Symbol 
ID7104167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1599419 
End bp1600684 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content50% 
IMG OID643474600 
Producttransposase, IS605 OrfB family 
Protein accessionYP_002371737 
Protein GI218246366 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGAAT CCCACGCGGT TCTACCCGTG GGAGTGTCAA TCTATAAACT AGCTGACCTG 
ATCGAAGCCA ATAAAGAGGA ATTAGCTCGC TTAGAAACCC TCGATAATGG AAAACCGCTC
ACAGACTCCC TCAATGCCGA TTTATCCCTG GTTATCGCTT GCTATCGCTA TTATGCGGGT
TGGGCGGATA AAGTGCAAGG AAAAACCATT CCCATCAACG GCCCCTATTT TTGCTACACT
CGCCATGAAC CGGTGGGAGT CGTCGGTCAA ATTATCCCTT GGAATTTCCC CCTGTTGATG
CAAGCATGGA AATTAGCCCC CGCTTTAGCC ATGGGGAACA CGGTGGTCAT GAAAACCGCC
GAACAAACTC CCCTATCCGC ATTACGGGTC GGAGAATTGA TCCTAGAAGC AGGGTTCCCC
CCTGGAGTCG TCAACTTACT TTCAGGATAT GGTCCCACGG CAGGACAAGC GATCGCCCGT
CACAGGGATA TTGATAAAGT TGCCTTTACG GGGTCTACAG AAGTGGGACA CCTGATCATG
GAAGCAGCCG CCCAAAGTAA CCTCAAGCGC GTTACCTTGG AATTAGGTGG GAAAAGCCCT
AATATTGTCT TTGCTGACGC GAATTTTGAG GAAGCCATCG AAGGATCTCA TCAGGGGCTG
TTTTTTAACC AAGGACAGTG TTGTTGTGCG GGATCTCGGC TATTTGTCGA AGAGTCCTGT
TATGACGAAT TTGTGACCAA AAGTGTCGAA CGAGCCCGCA GTCGTCGCGT CGGTGATCCC
TTCGATAGCA ACACCGAACA GGGGCCCCAA GTCGATCAAG AACAGTTTAA CAAGGTGATG
GGCTATATCG AGTCAGGACA GCGCGACGGG GCTCAGATGC TGTGTGGTGG GGGTCGTTTG
GGCGATCGCG GTTATTTTAT CGAGCCCACA GTGTTTGCGG GGGTTCGTGA TGATATGAAA
ATTGCCCAGG AGGAGGTTTT TGGACCGGTG ATGAGTATTA TCAAGTTTAA AGACGTTGAG
GAGGTCATTC AACGGGCAAA TAATACGATC TATGGCTTAG CTGCTGCGGT TTGGACTAAA
GATATTACCA AAGCTCATGC GATCGCTAAT GGAGTCCGTG CGGGTACAGT TTGGGTCAAT
TGTTACGATG TCTTCGATGC GGCTGCACCT TTTGGTGGGT TCAAACAGTC TGGTATGGGT
CGAGAATTGG GTGAATACGG ACTGCAACAG TACACCGAGG TTAAGACGGT GACGATTAAA
TTGTAA
 
Protein sequence
MEESHAVLPV GVSIYKLADL IEANKEELAR LETLDNGKPL TDSLNADLSL VIACYRYYAG 
WADKVQGKTI PINGPYFCYT RHEPVGVVGQ IIPWNFPLLM QAWKLAPALA MGNTVVMKTA
EQTPLSALRV GELILEAGFP PGVVNLLSGY GPTAGQAIAR HRDIDKVAFT GSTEVGHLIM
EAAAQSNLKR VTLELGGKSP NIVFADANFE EAIEGSHQGL FFNQGQCCCA GSRLFVEESC
YDEFVTKSVE RARSRRVGDP FDSNTEQGPQ VDQEQFNKVM GYIESGQRDG AQMLCGGGRL
GDRGYFIEPT VFAGVRDDMK IAQEEVFGPV MSIIKFKDVE EVIQRANNTI YGLAAAVWTK
DITKAHAIAN GVRAGTVWVN CYDVFDAAAP FGGFKQSGMG RELGEYGLQQ YTEVKTVTIK
L