Gene Cyan8802_4450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_4450 
Symbol 
ID8393802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp4599915 
End bp4601495 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content45% 
IMG OID644982358 
ProductNAD(P)H-quinone oxidoreductase subunit 4 
Protein accessionYP_003140069 
Protein GI257062181 
COG category[C] Energy production and conversion 
COG ID[COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) 
TIGRFAM ID[TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.487224 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATTG CCAATTTTCC CTGGTTAACT ACAATAATCC TGTTTCCTAT TGTTGCCGCC 
TTGTTTATTC CTATTATTCC AGACAAGGAC GGGAAAACCG TTAGATGGTA CTCTTTAACG
ATTGGACTCA TCGATTTTGC GGTCATTGTT TATGCTTTTT GCACAGGCTA TGACTTCAAT
AATCCCAAGC TGCAATTATT TGAAAGTTAT GCTTGGGTTC CTCAACTTGA TTTGAATTGG
TCGGTGGGGG CTGATGGCTT ATCGATGCCC CTCATTTTGC TGACCGGGTT TATCACCACG
TTAGCCATTA TGGCTGCCTG GCCAGTGACG TTTAAACCCA AGTTATTCTA TTTCCTGATG
TTGTTGATGT ACGGGGGACA AATCGCCGTT TTTGCGGTAC AAGATATGTT ACTCTTCTTC
CTCGTTTGGG AATTGGAGTT AGTTCCCGTC TATCTCATCC TCTCTATCTG GGGAGGAAAA
CGCCGTCTTT ACGCAGCAAC CAAGTTTATC CTCTACACCG CCGGAGGCTC GTTATTTATC
CTAGTTGCAG CCTTAACCAT GGCCTTCTAT GGAGATAATA CCACCTTTGA CATGGTAGCG
ATCGCGGGTA AAGACTTCCC CCTCAAACTG CAATTATTCC TCTATGGAGG CTTTCTCATC
GCCTACGGGG TCAAATTACC GATTTTTCCC CTCCATACAT GGCTACCGGA TGCCCACGGA
GAAGCAACCG CCCCTGCCCA TATGTTACTC GCGGGTATTC TCCTAAAAAT GGGAGGCTAT
GCCTTATTAC GGATGAATGT CGGGATGTTA CCCGATGCCC ATGGGGTTTT TGCCCCCATT
TTGGTTATTT TAGGGGTTGT CAATATTGTT TATGCGGCCT TAACCTCCTT TGCCCAACGG
AACCTCAAAC GAAAAATCGC CTATTCTTCG ATTTCTCACA TGGGGTTTGT CTTAATTGGG
ATGGCTTCCT TTACCTCTTT AGGAACCAGT GGGGCGATGT TACAGATGAT TTCCCACGGA
CTCATTGGGG CAAGTCTCTT CTTTATGGTC GGCTGTACCT ACGATCGCAC CCATACCCTG
ATGTTAGATG AAATGGGCGG GGTGGGCAAA AAGATGAAGA AAGTCTTTGC CATGTGGACA
ACCTGTTCCA TGGCCTCCTT AGCCCTCCCT GGAATGAGTG GTTTTGTGGC AGAATTAATG
GTTTTTGTGG GATTTGCTAC CAGTGATGCC TACAATTCTA CTTTTAAGGT CATTGCTATC
TTTTTAGCTG CCGTTGGGGT CATTTTAACG CCAATTTATC TCCTCTCCAT GCTACGCGAA
ATGCTTTATG GACCAGAAAA TGAAGAATTA GTTTCTCATA CCAAGTTAAT TGATGCCGAA
CCGCGGGAAG TTTTCATTAT TGGTTGCTTA TTAATTCCCA TCATTGGTAT TGGCTTGTAC
CCGAAAATTG TTACCCAAAT TTATGATACA ACCACCAATC AACTAACGGC CTTAATGCGC
GGTTCTGTCC CCAGTTTAGT CCAAAAAGCG GAACTTTCCC CTAGTCATCA AATAGCTTTC
CAAGCCCCTG CAATTAAGTA G
 
Protein sequence
MDIANFPWLT TIILFPIVAA LFIPIIPDKD GKTVRWYSLT IGLIDFAVIV YAFCTGYDFN 
NPKLQLFESY AWVPQLDLNW SVGADGLSMP LILLTGFITT LAIMAAWPVT FKPKLFYFLM
LLMYGGQIAV FAVQDMLLFF LVWELELVPV YLILSIWGGK RRLYAATKFI LYTAGGSLFI
LVAALTMAFY GDNTTFDMVA IAGKDFPLKL QLFLYGGFLI AYGVKLPIFP LHTWLPDAHG
EATAPAHMLL AGILLKMGGY ALLRMNVGML PDAHGVFAPI LVILGVVNIV YAALTSFAQR
NLKRKIAYSS ISHMGFVLIG MASFTSLGTS GAMLQMISHG LIGASLFFMV GCTYDRTHTL
MLDEMGGVGK KMKKVFAMWT TCSMASLALP GMSGFVAELM VFVGFATSDA YNSTFKVIAI
FLAAVGVILT PIYLLSMLRE MLYGPENEEL VSHTKLIDAE PREVFIIGCL LIPIIGIGLY
PKIVTQIYDT TTNQLTALMR GSVPSLVQKA ELSPSHQIAF QAPAIK