Gene PCC8801_3836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3836 
Symbol 
ID7102128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4014029 
End bp4015711 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content43% 
IMG OID643476841 
ProductNAD(P)H-quinone oxidoreductase subunit 4 
Protein accessionYP_002373942 
Protein GI218248571 
COG category[C] Energy production and conversion 
COG ID[COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) 
TIGRFAM ID[TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAATCA ATCAATTCCC TTGGTTGACG GCAATGATCC TGCTACCACT TGTAGCTGCT 
TGTGGTATTC CCTTATTGCC TGATAAAGAT GGTAAATTAG TCCGTTGGTA CGCCCTAGGG
GTAGGTCTTG CGGATTTTAT CCTAATGTGC TATGTCTTTT GGAATAATTA TGATATCAGT
AACCCCACCT TTCAACTAAC AGAAAAATAC GCTTGGTTGC CCCAAATTGG ACTAAGTTGG
GCGGTTTCCG TCGATGGAAT CTCGATGCCT TTGGTCTTAT TAGCGGGACT TGTTACTACC
CTTTCTATCT TTGCAGCTTG GCAAGTTGAT CGCAAGCCAA AACTTTTTTA CTTCTTGATG
CTGCTATTAT ATTCAGCCCA AATCGGCGTA TTTGTGGCTC AGGACTTACT GTTACTCTTT
ATTATGTGGG AACTGGAATT AGTTCCCGTT TATCTTTTGG TTTCTATCTG GGGAGGACAA
AAACGCCGTT ATGCAGCAAT GAAGTTCCTT CTCTACACCG CAGCAGCTTC TATCTTTATC
TTAATAGCAG CCTTAGCCAT GGGAATCTAT GGTGGCGGTC AAATGACCTT TGATATAGTA
GAACTCGCGG CTAAAAATTA TCCTCTTGCC TTAGAATTAC CTCTGTATGC TGGATTATTG
ATTGCCTTTG GTGTCAAGTT AGCCATTTTC CCTCTACATA CTTGGTTGCC TGATGCCCAC
GGTGAAGCAT CGTCTCCTGT ATCCATGATT TTAGCCGGAG TTCTCCTAAA AATGGGAGCC
TATGGCTTAA TTCGCCTGAA CTTAGAAATG CTTTCCGATG CCCATGTCTA TTTTGCCCCG
ATTTTAGTAG TTTTAGGGGT GATTAACATT GTTTACGGCG GTTTTGCTTC TTTTGCTCAA
TCGAACATGA AACGCCGTTT AGCCTATTCC TCCGTGTCTC ACATGGGGTT TGTTTTAATC
GGTATTGCCT CCTTTAGCGA TATTGGCATC AGTGGGGCAA TGTTACAGTT AATTTCCCAC
GGGTTAATCG CTGCGGTGTT GTTCTTCCTC GCGGGGGTTA CTTACGATCG CACCCATACT
ATGCTGTTAG ACGAAATGGG AGATATTGGT CAAGTAATAC CCAAAGTTTT CGCCCTATTT
ACTATCGGTG CGATGGCTTC TTTAGCTCTC CCTGGAATGA GTGGGTTTGC CAGTGAAATT
TCGGTTTTTG TCGGTGTAAC GAGTGGTGAT GTCTATAGCT CTGTTTTCCG CACTGTAACG
GTGTTTTTAG CAGCAGTAGG GCTGATTTTA ACCCCCATCT ACTTACTATC CATGCTACGG
CAACTTTTCT ACGGGTCTGA TAAGGTATTA ACCTGTGGGT TGACCAATAC TCAATCCCTC
AATCCAGGTG AAGAAAAAGC CGTCTGTTTC GGAAATAGTT GTGTTTTACC AAGTGAGGCA
CATTTTAGCG ATGCTAAACC CCGTGAACTG TTAATTGCCT TGAGTTTCTT GGTCTTAATT
ATTGGGATTG GCTTCTATCC TAAGCTATTT ACCCAAATGT ATGACGTGAA GACCGTGGCC
ATTAATACTC AAGTCCGTCA ATCCTATCAA CAAATCGCCG AAATCAATCC TGACGTTTAT
GCCTTTGCTT CCTCCACTAA AAAAGTAACT GAAGTGGCAT CAGGGTTAGG GATTTCTCGT
TAA
 
Protein sequence
MIINQFPWLT AMILLPLVAA CGIPLLPDKD GKLVRWYALG VGLADFILMC YVFWNNYDIS 
NPTFQLTEKY AWLPQIGLSW AVSVDGISMP LVLLAGLVTT LSIFAAWQVD RKPKLFYFLM
LLLYSAQIGV FVAQDLLLLF IMWELELVPV YLLVSIWGGQ KRRYAAMKFL LYTAAASIFI
LIAALAMGIY GGGQMTFDIV ELAAKNYPLA LELPLYAGLL IAFGVKLAIF PLHTWLPDAH
GEASSPVSMI LAGVLLKMGA YGLIRLNLEM LSDAHVYFAP ILVVLGVINI VYGGFASFAQ
SNMKRRLAYS SVSHMGFVLI GIASFSDIGI SGAMLQLISH GLIAAVLFFL AGVTYDRTHT
MLLDEMGDIG QVIPKVFALF TIGAMASLAL PGMSGFASEI SVFVGVTSGD VYSSVFRTVT
VFLAAVGLIL TPIYLLSMLR QLFYGSDKVL TCGLTNTQSL NPGEEKAVCF GNSCVLPSEA
HFSDAKPREL LIALSFLVLI IGIGFYPKLF TQMYDVKTVA INTQVRQSYQ QIAEINPDVY
AFASSTKKVT EVASGLGISR