Gene A9601_04901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_04901 
SymbolndhB 
ID4717188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp423947 
End bp425467 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content36% 
IMG OID640078202 
ProductNAD(P)H-quinone oxidoreductase subunit 2 
Protein accessionYP_001008885 
Protein GI123968027 
COG category[C] Energy production and conversion 
COG ID[COG1007] NADH:ubiquinone oxidoreductase subunit 2 (chain N) 
TIGRFAM ID[TIGR01770] proton-translocating NADH-quinone oxidoreductase, chain N 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.592214 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCCAACG AAATCTTTAC AATTAATTTA AATGCTCAAG CCATTATTCC AGAGGCTTTT 
ATTTTACTAG GTATTGTTGG AACACTTCTT GTAGATTTAG CTGGAGAAAA AACTGCATCG
AAGTGGGCAC CAATAATTTG CTATTTGTCA ATCGGAAGCT CTCTTGTTAG CTTGGCATTG
CAGTGGAGTA ATCCGGTAGA AAACGCATTC CTTGGATCCT TTAATTCAGA TAATTTGGCA
ATCGCATTTA GAGCAATAAT TTCTTTATCA ACCTTGATTT CTTTACTTAT AAGTTGGCGG
TATACAGAAC AAAGTGGAAG CCCAATTGGA GAGTTTGCTG CGATAGTTCT TTCAGCCACA
CTTGGAGCAA TGCTTTTATG TGGATCTACT GACCTTATTA GTGTATTTAT TTCTCTTGAA
ACTTTATCTG TAGCAAGCTA CTTACTTTCT GGCTACCTCA AGAGAGATCC AAGAAGTTCA
GAAGCGGCCT TAAAATACCT CCTTGTCGGA TCAGCTGCTG CTGCTGTCTA TTTGTATGGA
TCCTCTTTTC TTTATGGATT AAGTGGTTCA ACAAACTTAG CGACAATAGG TTTAGAGATT
ATCAATAAGC CATCCTTCAT AACTTCACTA GCACTTGTAT TTGTCTTATC AACAGTTGCA
TTTAAAATTG CTGCTGTTCC CTTTCATCAA TGGACTCCTG ACGTATATGA GGGTTCACCT
ACTCCTGTAG TAGCTTTTTT ATCTGTTGGT TCAAAAACAG CGGGCTTTGC ATTTGCGATA
AGAATATTAA GCACAACTTT CTCTTCTTTT GACGAAGAAT GGAAACTTTT ATTTACCATT
TTGGCAATAT TGAGCATGGC TCTAGGAAAT GTTGTAGCTC TAGCTCAAAC CTCAATGAAA
AGGATGCTAG CTTACAGTTC TATTGGACAA GCAGGATTTG TAATGATTGG AATAGTATCT
GGCACACAAG ATGGTTTATC AGCTGCTGTT TTATATTTGG CTGCATATTT ATTTATGAAT
TTGGGTGCAT TTTCTTGTGT AATACTTTTC TCACTAAGAA CTGGTTCTGA CAGAATTCTT
GATTACTCAG GACTTTACCA AAAAGATCCT CTCATTACAT TAGGCTTAAG CCTTTGTCTT
CTATCACTTG GAGGTTTACC TCCAATGTTG GGATTTTTTG GAAAAATATA CTTGTTCTTT
GCAGGTTGGG CCAATCATCA ATATCTACTA GTTATTGTTG GATTAGTAAC TTCAGTTATA
TCTATTTATT ACTACATTTC AGTGATAAAA ATGATGGTAG TTAAAGAACC ACAGGAAGCT
TCTGAAATAG TCAAATCATA TCCTGAAATT AATTGGGGAA TTGAAGGATT ACCTCCCTTG
AGAATTGCAC TTTACACTTG CGTAGCGGTA ACTGCTCTTG GAGGAATCCT ATCTAATCCT
CTTTTTAAAT TAGCCAATAC AGCAGTTTCA GAAACTCCTT TTTTACAAGA TATTATTGCT
ATAGCAAACA ATATTTCCTA G
 
Protein sequence
MPNEIFTINL NAQAIIPEAF ILLGIVGTLL VDLAGEKTAS KWAPIICYLS IGSSLVSLAL 
QWSNPVENAF LGSFNSDNLA IAFRAIISLS TLISLLISWR YTEQSGSPIG EFAAIVLSAT
LGAMLLCGST DLISVFISLE TLSVASYLLS GYLKRDPRSS EAALKYLLVG SAAAAVYLYG
SSFLYGLSGS TNLATIGLEI INKPSFITSL ALVFVLSTVA FKIAAVPFHQ WTPDVYEGSP
TPVVAFLSVG SKTAGFAFAI RILSTTFSSF DEEWKLLFTI LAILSMALGN VVALAQTSMK
RMLAYSSIGQ AGFVMIGIVS GTQDGLSAAV LYLAAYLFMN LGAFSCVILF SLRTGSDRIL
DYSGLYQKDP LITLGLSLCL LSLGGLPPML GFFGKIYLFF AGWANHQYLL VIVGLVTSVI
SIYYYISVIK MMVVKEPQEA SEIVKSYPEI NWGIEGLPPL RIALYTCVAV TALGGILSNP
LFKLANTAVS ETPFLQDIIA IANNIS