Gene Cyan8802_4209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_4209 
Symbol 
ID8393560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp4343370 
End bp4344737 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content47% 
IMG OID644982121 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003139833 
Protein GI257061945 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTATCG CTACGGTTAA CCCAGCAACG GGGGAAACCC TAAAAACCTT TGAACCGCTA 
ACGCCATCGG AAATTGAAGC GAAACTTGCT CTCGCGGATG CAACCTTTAA ACAATACCGC
AAAACCTCAA TGGTGCAGCG TAGCCAATGG TTAAAGCAAG CAGCAGATAT TTTAGACAAA
GATAGCCAAA AATGGGGCGA ATTAATGACC TTAGAAATGG GGAAACCCAT TAAAGGCGCG
ATCGCAGAAG CTAAAAAATG CGCCCTCGTC TGCCGTTATT ATGCCGAAAA CGCTCCTGAA
TTTCTCAGAG ATACTCCCGT TTCTACCGAT GCTAGTCGTA GTTTTATTCG CTACCAACCA
TTAGGTATTA TTTTAGCGGT TATGCCTTGG AATTTTCCTT TCTGGCAAGT TTTTCGCTTT
GCAGCCCCCG CTTTAATGGC CGGAAACGTT GGTATCCTCA AACACGCTTC TAACGTCCCT
CAATGCGCTT TAGCCATTGA AACCATTCTT AAATCCGCCG GATTTCCTGA AGGAGCGTTT
CAAACGCTGT TAATTACCGC CAACCAAGTA GAAGCAGTGA TCAACGATGA TCGCGTCAAA
GCAGCAACCC TAACGGGAAG CGAATATGCA GGGGCAAGTC TAGCCTCAGC CGCCGGTAAA
CACATCAAAA AAACCGTCCT TGAATTGGGG GGGAGCGACC CGTTTATTGT TCTCGAAAGT
GCGGACTTGG AAGCAGCCGC GACGACTGCT GTTACCGCCC GAATGCTCAA TAACGGACAA
TCTTGTATCG CAGCAAAACG GTTTATCCTA GTAGATGCGA TCGCCGATCG CTTTGAACAG
TTGTTAGCCG AAAAATTCCA AACCTTGAAA GTGGGTGATC CCCAGTCAGA GGATACTGAC
ATTGGTCCGT TGGCCACTGC TTCTATTCGG CAAGAGATTG AACAGCAAGT TCAAGAAACG
GTAACAAAAG GGGCTAAAAT TGTGATTGGA GGCCAATCTT ACCGCGATCG TCCTGGTAAC
TTCTACCCGC CTACCATTTT AAAAGATATT CCGATGGATT CCCCTGGTTA TAGCGATGAG
TTCTTTGGAC CGGTGGCTTT ACTCTTTCGG GTTAAAGATA TCGACGAAGC AATCGAATTA
GCTAATAGTA CCATTTTTGG TTTGGGCGCG AGTGGGTGGA CTCACGATGC GACAGAACAA
GAACGGTTAA TCGAGGAAAT TGAGTCTGGA TGCGTCTTTA TCAATGGGAT GGTTAAATCC
GATCCCCGTC TGCCCTTTGG GGGAATCAAG CGATCGGGTT ACGGACGGGA ATTGAGTAGC
CAAGGCATTC AGGAATTTGT TAATGTCAAA ACCGTTTGGA TTAAGTGA
 
Protein sequence
MGIATVNPAT GETLKTFEPL TPSEIEAKLA LADATFKQYR KTSMVQRSQW LKQAADILDK 
DSQKWGELMT LEMGKPIKGA IAEAKKCALV CRYYAENAPE FLRDTPVSTD ASRSFIRYQP
LGIILAVMPW NFPFWQVFRF AAPALMAGNV GILKHASNVP QCALAIETIL KSAGFPEGAF
QTLLITANQV EAVINDDRVK AATLTGSEYA GASLASAAGK HIKKTVLELG GSDPFIVLES
ADLEAAATTA VTARMLNNGQ SCIAAKRFIL VDAIADRFEQ LLAEKFQTLK VGDPQSEDTD
IGPLATASIR QEIEQQVQET VTKGAKIVIG GQSYRDRPGN FYPPTILKDI PMDSPGYSDE
FFGPVALLFR VKDIDEAIEL ANSTIFGLGA SGWTHDATEQ ERLIEEIESG CVFINGMVKS
DPRLPFGGIK RSGYGRELSS QGIQEFVNVK TVWIK