Gene A9601_01781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_01781 
SymbolndhA 
ID4716862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp164936 
End bp166054 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content36% 
IMG OID640077877 
ProductNADH dehydrogenase subunit H 
Protein accessionYP_001008573 
Protein GI123967715 
COG category[C] Energy production and conversion 
COG ID[COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.236332 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGAATACG GATTAGATCT CGAATATAGT TTTAATGAAT TTTTAAAAGG CTTTGGCCTT 
TCCAGCGAAA TCGCCCATAT AATTTGGCTC CCTCTGCCAA TGCTTTTGGT TTTGGTAGCG
GCAGTAGTTG GTGTTTTAGT AACAGTTTGG CTTGAAAGAA AAATATCTGC TGCTGCTCAG
CAAAGAATAG GTCCTGAATA TGCAGGAGCA CTTGGCGTAC TCCAACCAAT TGCAGATGGG
CTTAAGTTAC TTGTTAAAGA GGATATTATT CCTGCTAAAG CGGATGGAAT TCTCTTTACT
GCAGGACCTA TATTAGTACT TGTCCCAGTG ATTCTATCCT GGTTAATTGT TCCTTTTGGA
CAAAATCTTT TAATAAGTAA TGTTGGTATT GGAATTTTCC TATGGATTGC TTTAAGCAGT
ATCCAGCCAA TAGGACTTCT CATGAGCGGA TATGCATCAA ATAATAAGTA TTCTTTATTA
GGAGGTTTAA GAGCAGCAGC TCAATCAATT AGTTATGAAA TTCCTCTAGC TTTATCTGTA
CTAGCTATTG TACTAATGAC AAATTCTCTA AGTACTATTG ACATTGTCAA CCAACAAAGT
GGTGCTGGAA TCTTAAGTTG GAATATATGG AGACAACCAG TTGGTTTTAT AGTCTTTTGG
ATTTGTGCTC TTGCAGAATG TGAGAGACTT CCATTTGACT TACCCGAAGC TGAAGAAGAA
TTAGTTGCAG GATATCAAAC TGAATATGCA GGGATGAAAT TCGCATTGTT CTACCTAGGT
AGTTACATTA ATCTAATCCT TTCAGCTTTA TTGGTATCAA TACTTTATTT GGGAGGATGG
GGTTTTCCTG TTCCAGTTGA ATTAATAGCT AAGTTTCTAA ACTTGCCCAT TAATGCACCC
TTTTTACAAG TGTTCACTGC ATCAATAGGA ATTGTAATGA CTGTATTGAA AGCATATCTT
TTAGTTTTCA TTGCAATATT ATTGCGTTGG ACAACTCCTA GAGTAAGAAT AGATCAACTA
TTAGACCTTG GATGGAAGTT TCTTCTTCCA ATTTCTCTTG CTAATCTTTT GATAACTGCA
GGATTAAAAC TTGCTTTTCC GCAATTCTTT GGTGGTTAA
 
Protein sequence
MEYGLDLEYS FNEFLKGFGL SSEIAHIIWL PLPMLLVLVA AVVGVLVTVW LERKISAAAQ 
QRIGPEYAGA LGVLQPIADG LKLLVKEDII PAKADGILFT AGPILVLVPV ILSWLIVPFG
QNLLISNVGI GIFLWIALSS IQPIGLLMSG YASNNKYSLL GGLRAAAQSI SYEIPLALSV
LAIVLMTNSL STIDIVNQQS GAGILSWNIW RQPVGFIVFW ICALAECERL PFDLPEAEEE
LVAGYQTEYA GMKFALFYLG SYINLILSAL LVSILYLGGW GFPVPVELIA KFLNLPINAP
FLQVFTASIG IVMTVLKAYL LVFIAILLRW TTPRVRIDQL LDLGWKFLLP ISLANLLITA
GLKLAFPQFF GG