Gene PCC8801_1010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1010 
Symbol 
ID7104235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1064033 
End bp1065151 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content45% 
IMG OID643474102 
ProductNADH dehydrogenase subunit H 
Protein accessionYP_002371242 
Protein GI218245871 
COG category[C] Energy production and conversion 
COG ID[COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACAG GAATTGACCT ACAAGGCAGT TTTATCGAAT CTCTCAAACA ATTGGGACTT 
CCTGACGGAG TAGCCAAAGC CCTCTGGATT CCCTTACCCT CTTTTTTAAT GATTATTGGA
GCTACCGTCG GCGTATTAGT CGTGGTTTGG TTAGAACGGA AGATCTCCGC AGCCGCCCAA
CAACGCATCG GACCCGAATA TGCTGGACCG TTGGGGGTAC TTCAACCTGT AGCCGACGGG
ATCAAATTAG TGTTTAAGGA AGACATTATT CCGGCCAAAG CTGACCCTTG GCTATTTACC
CTGGGACCCG TTTTAGTGGT GCTCCCTGTT TTTGTTTCCT ATCTCATTGT TCCCTTTGGT
CAGAATTTAG TGATAACTGA CCTCAATGTT GGCATTTTTC TCTGGATTTC TCTGTCAAGC
ATTGCCCCCA TCGGGTTATT GATGTCCGGA TATGCTTCTA ATAATAAATA TTCCCTTCTG
GGGGGCTTAA GGGCAGCAGC GCAGTCTATT AGCTACGAAA TTCCCCTTGC GTTTTCTGTC
CTAGCGATCG CTATGATGTC CAATAGCCTA AGTACCATCG ATATCGTGCA ACAACAGTCA
GGATACGGTA TTTTAGGCTG GAATGTCTGG CGACAACCCG TTGGCTTAAT TATCTTTTGG
ATTGCTGCCT TAGCTGAGTG CGAACGCCTT CCCTTTGACC TTCCTGAAGC GGAAGAAGAA
ATCGTCGCAG GGTATCAAAC CGAATATTCT GGGATGAAAT TTGGGTTATT TTACGTTGGA
TCTTACGTTA ACTTGGTGTT ATCCGCCTTA GTCTTTGCTA TTCTCTATCT AGGCGGTTGG
GAATTTCCCG TTCCCCTCGA TAAATTAGCA GGATGGTTAG GAGTTAATGA TAACAGTCCT
TGGTTACAGG TGATCACGGC ATCTCTGGGG ATTACCATGA CCGTCCTTAA AGCTTATTTT
CTGGTATTTA TTGCCGTTTT GTTGCGCTGG ACAGTACCGA GGGTTCGTAT TGACCAACTC
CTGAATTTAG GCTGGAAATT CTTGCTTCCC GTATCCTTAG TAAATCTGTT ATTAACGGCA
GCCCTAAAAT TAGCGTTTCC CGTTGCTTTT GGTGGCTAA
 
Protein sequence
MNTGIDLQGS FIESLKQLGL PDGVAKALWI PLPSFLMIIG ATVGVLVVVW LERKISAAAQ 
QRIGPEYAGP LGVLQPVADG IKLVFKEDII PAKADPWLFT LGPVLVVLPV FVSYLIVPFG
QNLVITDLNV GIFLWISLSS IAPIGLLMSG YASNNKYSLL GGLRAAAQSI SYEIPLAFSV
LAIAMMSNSL STIDIVQQQS GYGILGWNVW RQPVGLIIFW IAALAECERL PFDLPEAEEE
IVAGYQTEYS GMKFGLFYVG SYVNLVLSAL VFAILYLGGW EFPVPLDKLA GWLGVNDNSP
WLQVITASLG ITMTVLKAYF LVFIAVLLRW TVPRVRIDQL LNLGWKFLLP VSLVNLLLTA
ALKLAFPVAF GG