Gene PCC8801_4169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4169 
Symbol 
ID7105992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4371164 
End bp4372531 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content47% 
IMG OID643477156 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002374255 
Protein GI218248884 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTATCG CTACGGTTAA CCCAGCAACG GGGGAAACCC TAAAAACCTT TGAACCGCTA 
ACGCCATCGG AAATTGAAGC GAAACTTGCT CTCGCGGATG CAACCTTTAA ACAATACCGC
AAAACCTCAA TGGTGCAGCG TAGCCAATGG TTAAAGCAAG CAGCAGATAT TTTAGACAAA
GATAGCCAAA AATGGGGCGA ATTAATGACC TTAGAAATGG GGAAACCCAT TAAAGGCGCG
ATCGCAGAAG CTAAAAAATG CGCCCTCGTC TGCCGTTATT ATGCCGAAAA CGCTCCTGAA
TTTCTCAAAG ATACTCCCGT TTCTACCGAT GCTAGTCGTA GTTTTATTCG CTACCAACCA
TTAGGTATTA TTTTAGCGGT TATGCCTTGG AATTTTCCTT TCTGGCAAGT TTTTCGCTTT
GCAGCCCCCG CTTTAATGGC CGGAAACGTT GGTATCCTCA AACACGCTTC TAACGTCCCT
CAATGCGCTT TAGCCATTGA AACCATTCTT AAATCCGCCG GATTTCCTGA AGGAGCGTTT
CAAACGCTGT TAATTACCGC CAACCAAGTA GAAGCAGTGA TCAACGATGA TCGCGTCAAA
GCAGCAACCC TAACGGGAAG CGAATATGCA GGGGCAAGTC TAGCCTCAGC CGCCGGTAAA
CACATCAAAA AAACCGTCCT TGAATTGGGG GGGAGCGACC CGTTTATTGT TCTCGAAAGT
GCGGACTTGG AAGCAGCCGC GACGACTGCT GTTACCGCCC GAATGCTCAA TAACGGACAA
TCTTGTATCG CAGCAAAACG GTTTATCCTA GTAGATGCGA TCGCCGATCG CTTTGAACAG
TTGTTAGCCG AAAAATTCCA AACCTTGAAA GTGGGTGATC CCCAGTCAGA GGATACTGAC
ATTGGTCCGT TGGCCACTGC TTCTATTCGG CAAGAGATTG AACAGCAAGT TCAAGAAACG
GTAACAAAAG GGGCTAAAAT TGTGATTGGA GGCCAATCTT ACCGCGATCG TCCTGGTAAC
TTCTACCCGC CTACCATTTT AAAAGATATT CCGATGGATT CCCCTGGTTA TAGCGATGAG
TTCTTTGGAC CGGTGGCTTT ACTCTTTCGG GTTAAAGATA TCGACGAAGC AATCGAATTA
GCTAATAGTA CCATTTTTGG TTTGGGCGCG AGTGGGTGGA CTCACGATGC GACAGAACAA
GAACGGTTAA TCGAGGAAAT TGAGTCTGGA TGCGTCTTTA TCAATGGGAT GGTTAAATCC
GATCCCCGTC TGCCCTTTGG GGGAATCAAG CGATCGGGTT ACGGACGGGA ATTGAGTAGC
CAAGGCATTC AGGAATTTGT TAATGTCAAA ACCGTTTGGA TTAAGTGA
 
Protein sequence
MGIATVNPAT GETLKTFEPL TPSEIEAKLA LADATFKQYR KTSMVQRSQW LKQAADILDK 
DSQKWGELMT LEMGKPIKGA IAEAKKCALV CRYYAENAPE FLKDTPVSTD ASRSFIRYQP
LGIILAVMPW NFPFWQVFRF AAPALMAGNV GILKHASNVP QCALAIETIL KSAGFPEGAF
QTLLITANQV EAVINDDRVK AATLTGSEYA GASLASAAGK HIKKTVLELG GSDPFIVLES
ADLEAAATTA VTARMLNNGQ SCIAAKRFIL VDAIADRFEQ LLAEKFQTLK VGDPQSEDTD
IGPLATASIR QEIEQQVQET VTKGAKIVIG GQSYRDRPGN FYPPTILKDI PMDSPGYSDE
FFGPVALLFR VKDIDEAIEL ANSTIFGLGA SGWTHDATEQ ERLIEEIESG CVFINGMVKS
DPRLPFGGIK RSGYGRELSS QGIQEFVNVK TVWIK