Gene A9601_01671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_01671 
Symbol 
ID4716851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp156647 
End bp158251 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content35% 
IMG OID640077866 
ProductNAD(P)H-quinone oxidoreductase subunit 4 
Protein accessionYP_001008562 
Protein GI123967704 
COG category[C] Energy production and conversion 
COG ID[COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) 
TIGRFAM ID[TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGGGAA CTTTGGGCGC TGGATTGTCT AATTTTCCTT GGTTATCTGC TTCAATTTTA 
TTCCCAATTG GTAGTGCATT TGTGATACCT TTTTTTCCAG ATAAAGGGGA TGGCAAAGAG
GTGAGATGGT TTGCATTGTC TATTGCATTA ATTACTTTTT TAATAACTGT AGGTTCATAC
ATAAATGGCT TTGATATTAG TAATGAAAAT GTTCAACTTA AAGAAAATAT TAGTTGGCTC
CCTGATTTAG GTCTTACTTG GTCTGTTGGC GCTGATGGTA TGTCTATGCC GTTAATATTA
TTGACTAGTT TTATAACTGC TTTAGCAGTT CTTGCTGCAT GGCCAGTAAA GTTCAAACCA
AAGTTATTTT TCTTTTTAAT ATTGGTTATG GATGGTGGGC AAATCGCTGT GTTTGCCGTA
CAAGATATGC TTTTATTCTT TCTAACTTGG GAACTTGAGT TAATTCCTGT TTATTTATTA
CTCGCTATAT GGGGTGGCAA AAATCGACAA TATGCTGCGA CAAAATTCAT TATCTATACA
GCTGGTAGTT CTATCTTTAT TCTTCTTGCC GCGTTAGCAA TGGGTTTCTA TGGTACAGAA
ATTCCTAACT TTGAGTTTTC TCACTTGGCA GCTCAAGATT TTAGTCAAAA ATTCCAAATT
TTATGCTATG TAGGGCTTTT AATTGCATTT GGTGTGAAAC TTCCAATAGT ACCCCTGCAT
ACTTGGCTTC CAGATGCTCA TGGAGAGGCT ACAGCTCCAG TTCATATGCT TCTAGCGGGA
ATTTTATTAA AGATGGGAGG ATATGCTCTT TTAAGATTTA ATGCACAATT ATTACCCGTC
GCTCATGCTC AATTTGCTCC ATTATTGATA GTTCTAGGGG TAGTCAATAT CATTTATGCT
GCATTAACTT CTTTTGCTCA AAGAAATCTT AAAAGAAAAA TTGCATATAG TTCGATAAGT
CATATGGGTT TCGTTCTTAT TGGAATAGGC AGTTTCAGTA GCCTTGGAAC AAGTGGAGCT
ATGCTGCAAA TGGTTAGTCA TGGATTAATC GGTGCAAGTT TATTTTTTCT TGTTGGTGCT
ACCTATGACA GAACAAAAAC TCTTAAACTT GATGAAATGA GTGGTGTAGG ACAAAAAATG
AGAATCATGT TTGCCTTATG GACTGCTTGC TCATTGGCTT CTCTTGCTTT GCCTGGTATG
AGCGGATTTG TTTCCGAATT GATGGTTTTT ACAGGATTTG TTACTGATGA AGTGTATACT
CTTCCTTTTA GGGTAGTGAT GGCTTCTTTA GCAGCTATCG GTGTAATACT TACTCCTATT
TATCTACTTT CAATGTTACG AGAAATTTTC TTTGGTAAAG AAAATCCTAA ATTAATAGAA
GAACGAAAAC TCATAGATGC AGAGCCAAGG GAAGTTTATA TTATTGCCTG TTTACTTTTA
CCGATTATTG GAATAGGTTT ATACCCAAGA TTAGTTACTG AAAGTTATAT TGCATCTATC
AATAATTTAG TCGATAGAGA TTTAACTGCC ATTAAAAGTG CTGCTAAAGC AAATATTTTT
TCAGGAACTA AAAAAAATGA TATCCTAAAA GCTCCAACAA TATAA
 
Protein sequence
MLGTLGAGLS NFPWLSASIL FPIGSAFVIP FFPDKGDGKE VRWFALSIAL ITFLITVGSY 
INGFDISNEN VQLKENISWL PDLGLTWSVG ADGMSMPLIL LTSFITALAV LAAWPVKFKP
KLFFFLILVM DGGQIAVFAV QDMLLFFLTW ELELIPVYLL LAIWGGKNRQ YAATKFIIYT
AGSSIFILLA ALAMGFYGTE IPNFEFSHLA AQDFSQKFQI LCYVGLLIAF GVKLPIVPLH
TWLPDAHGEA TAPVHMLLAG ILLKMGGYAL LRFNAQLLPV AHAQFAPLLI VLGVVNIIYA
ALTSFAQRNL KRKIAYSSIS HMGFVLIGIG SFSSLGTSGA MLQMVSHGLI GASLFFLVGA
TYDRTKTLKL DEMSGVGQKM RIMFALWTAC SLASLALPGM SGFVSELMVF TGFVTDEVYT
LPFRVVMASL AAIGVILTPI YLLSMLREIF FGKENPKLIE ERKLIDAEPR EVYIIACLLL
PIIGIGLYPR LVTESYIASI NNLVDRDLTA IKSAAKANIF SGTKKNDILK APTI