Gene PCC8801_4388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4388 
Symbol 
ID7104839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4609222 
End bp4610802 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content45% 
IMG OID643477367 
ProductNAD(P)H-quinone oxidoreductase subunit 4 
Protein accessionYP_002374466 
Protein GI218249095 
COG category[C] Energy production and conversion 
COG ID[COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) 
TIGRFAM ID[TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACATTG CCAATTTTCC CTGGTTAACT ACAATAATCC TGTTTCCTAT TGTTGCCGCC 
TTGTTTATTC CTATTATTCC AGACAAGGAC GGGAAAACCG TTAGATGGTA CTCTTTAACG
ATTGGACTCA TCGATTTTGC GGTCATTGTT TATGCTTTTT GCACAGGCTA TGACTTCAAT
AATCCCAAGC TGCAATTATT TGAAAGTTAT GCTTGGGTTC CTCAACTTGA TTTGAATTGG
TCGGTGGGGG CTGATGGCTT ATCGATGCCC CTCATTTTGC TGACCGGGTT TATCACCACG
TTAGCCATTA TGGCTGCCTG GCCAGTGACG TTTAAACCCA AGTTATTCTA TTTCCTGATG
TTGTTGATGT ACGGGGGACA AATCGCCGTT TTTGCGGTAC AAGATATGTT ACTCTTCTTC
CTCGTTTGGG AATTGGAGTT AGTTCCCGTC TATCTCATCC TCTCTATCTG GGGAGGAAAA
CGCCGTCTTT ACGCAGCAAC CAAGTTTATC CTCTACACCG CCGGAGGCTC GTTATTTATC
CTAGTTGCAG CCTTAACCAT GGCCTTCTAT GGAGATAATA CCACCTTTGA CATGGTAGCG
ATCGCGGGTA AAGACTTCCC CCTTAAACTG CAATTATTCC TCTATGGAGG CTTTCTGATC
GCCTACGGGG TCAAATTACC GATTTTTCCC CTCCATACAT GGCTACCGGA TGCCCACGGA
GAAGCAACCG CCCCTGCCCA TATGTTACTC GCGGGTATTC TCCTAAAAAT GGGAGGCTAT
GCCTTATTAC GGATGAATGT CGGGATGTTA CCCGATGCCC ATGGGGTTTT TGCCCCCATT
TTGGTTATTT TAGGGGTTGT CAATATTGTT TATGCGGCCT TAACCTCCTT TGCCCAACGG
AACCTCAAAC GAAAAATCGC CTATTCTTCG ATTTCTCACA TGGGGTTTGT CTTAATTGGG
ATGGCTTCCT TTACCTCTTT AGGAACCAGT GGGGCGATGT TACAGATGAT TTCCCACGGA
CTCATTGGGG CAAGTCTCTT CTTTATGGTC GGCTGTACCT ACGATCGCAC CCATACCCTG
ATGTTAGATG AAATGGGCGG GGTGGGCAAA AAGATGAAGA AAGTCTTTGC CATGTGGACA
ACCTGTTCCA TGGCCTCCTT AGCCCTCCCT GGAATGAGTG GTTTTGTGGC AGAATTAATG
GTTTTTGTGG GATTTGCTAC CAGTGATGCC TACAATTCTA CTTTTAAGGT CATTGCTATC
TTTTTAGCTG CCGTTGGGGT CATTTTAACG CCAATTTATC TCCTCTCCAT GCTACGCGAA
ATGCTTTATG GACCAGAAAA TGAAGAATTA GTTTCTCATA CCAAGTTAAT TGATGCCGAA
CCGCGGGAAG TTTTCATTAT TGGTTGCTTA TTAATTCCCA TCATTGGTAT TGGCTTGTAC
CCGAAAATTG TTACCCAAAT TTATGATACA ACCACCAATC AACTAACGGC CTTAATGCGC
GGTTCTGTCC CCAGTTTAGT CCAAAAAGCG GAACTTTCCC CTAGTCATCA AATAGCTTTC
CAAGCCCCTG CAATTAAGTA G
 
Protein sequence
MDIANFPWLT TIILFPIVAA LFIPIIPDKD GKTVRWYSLT IGLIDFAVIV YAFCTGYDFN 
NPKLQLFESY AWVPQLDLNW SVGADGLSMP LILLTGFITT LAIMAAWPVT FKPKLFYFLM
LLMYGGQIAV FAVQDMLLFF LVWELELVPV YLILSIWGGK RRLYAATKFI LYTAGGSLFI
LVAALTMAFY GDNTTFDMVA IAGKDFPLKL QLFLYGGFLI AYGVKLPIFP LHTWLPDAHG
EATAPAHMLL AGILLKMGGY ALLRMNVGML PDAHGVFAPI LVILGVVNIV YAALTSFAQR
NLKRKIAYSS ISHMGFVLIG MASFTSLGTS GAMLQMISHG LIGASLFFMV GCTYDRTHTL
MLDEMGGVGK KMKKVFAMWT TCSMASLALP GMSGFVAELM VFVGFATSDA YNSTFKVIAI
FLAAVGVILT PIYLLSMLRE MLYGPENEEL VSHTKLIDAE PREVFIIGCL LIPIIGIGLY
PKIVTQIYDT TTNQLTALMR GSVPSLVQKA ELSPSHQIAF QAPAIK