Gene Cyan8802_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_1039 
Symbol 
ID8390348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp1064263 
End bp1065381 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content45% 
IMG OID644979054 
ProductNADH dehydrogenase subunit H 
Protein accessionYP_003136807 
Protein GI257058919 
COG category[C] Energy production and conversion 
COG ID[COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000216191 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACACAG GAATTGACCT ACAAGGCAGT TTTATCGAAT CTCTCAAACA ATTGGGACTT 
CCTGACGGAG TAGCCAAAGC CCTCTGGATT CCCTTACCCT CTTTTTTAAT GATTATTGGA
GCTACCGTCG GCGTATTAGT CGTGGTTTGG TTAGAACGGA AGATCTCCGC AGCCGCCCAA
CAACGCATCG GACCCGAATA TGCTGGACCG TTGGGGGTAC TTCAACCTGT AGCCGACGGG
ATCAAATTAG TGTTTAAGGA AGACATTATT CCGGCCAAAG CTGACCCTTG GCTATTTACC
CTGGGACCCG TTTTAGTGGT GCTCCCTGTT TTTGTTTCCT ATCTCATTGT TCCCTTTGGT
CAGAATTTAG TGATAACTGA CCTCAATGTT GGCATTTTTC TCTGGATTTC TCTGTCAAGC
ATTGCCCCCA TCGGGTTATT GATGTCCGGA TATGCTTCTA ATAATAAATA TTCCCTTCTG
GGGGGCTTAA GGGCAGCAGC GCAGTCTATT AGCTACGAAA TTCCCCTTGC GTTTTCTGTC
CTAGCGATCG CTATGATGTC CAATAGCCTA AGTACCATCG ATATCGTGCA ACAACAGTCA
GGATACGGTA TTTTAGGCTG GAATGTCTGG CGACAACCCG TTGGCTTAAT TATCTTCTGG
ATTGCTGCCT TAGCTGAGTG CGAACGTCTT CCCTTTGACC TTCCTGAAGC GGAAGAAGAA
ATCGTCGCAG GGTATCAAAC CGAATATTCT GGGATGAAAT TTGGGTTATT TTACGTTGGA
TCTTACGTTA ACTTGGTGTT ATCCGCCTTA GTCTTTGCTA TTCTCTATCT AGGCGGTTGG
GAATTTCCCG TTCCCCTCGA TAAATTAGCA GGATGGTTAG GAGTTAATGA TAACAGTCCT
TGGTTACAGG TGATCACGGC ATCTCTGGGG ATTACCATGA CCGTCCTTAA AGCTTATTTT
CTGGTATTTA TTGCCGTTTT GTTGCGCTGG ACAGTACCGA GGGTTCGTAT TGACCAACTC
CTGAATTTAG GCTGGAAATT CTTGCTTCCC GTATCCTTAG TAAATCTGTT ATTAACGGCA
GCCCTAAAAT TAGCGTTTCC CGTTGCTTTT GGTGGCTAA
 
Protein sequence
MNTGIDLQGS FIESLKQLGL PDGVAKALWI PLPSFLMIIG ATVGVLVVVW LERKISAAAQ 
QRIGPEYAGP LGVLQPVADG IKLVFKEDII PAKADPWLFT LGPVLVVLPV FVSYLIVPFG
QNLVITDLNV GIFLWISLSS IAPIGLLMSG YASNNKYSLL GGLRAAAQSI SYEIPLAFSV
LAIAMMSNSL STIDIVQQQS GYGILGWNVW RQPVGLIIFW IAALAECERL PFDLPEAEEE
IVAGYQTEYS GMKFGLFYVG SYVNLVLSAL VFAILYLGGW EFPVPLDKLA GWLGVNDNSP
WLQVITASLG ITMTVLKAYF LVFIAVLLRW TVPRVRIDQL LNLGWKFLLP VSLVNLLLTA
ALKLAFPVAF GG