Gene NATL1_06501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_06501 
Symbol 
ID4779230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp594579 
End bp596153 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content39% 
IMG OID640083928 
ProductNAD(P)H-quinone oxidoreductase subunit 4 
Protein accessionYP_001014477 
Protein GI124025361 
COG category[C] Energy production and conversion 
COG ID[COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) 
TIGRFAM ID[TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGACT TGCCCCAAAT TATGGATGCC AATCTGCCTA TGAGTATTGA TTCTCAGACG 
GTATTTCCAT GGCTCTCACT CATTGTGCTC CTTCCCATAG TTGGAGCATT GATAATGCCT
TTCCTCCCAA CGCAAGAAAG TAAAAATTTT AATCTTCCGA GAAATATAGC CCTAGTAATT
TTATTAGCCG ATTTTCTAAT AGTTGTATTG GCTTTTGCCA ATTTATTTGA TCCAAATAGC
GAAAGCTTGC AGTTGGTCGA AAGACTTCAG TGGCTTCCGT CTATAGGACT TGAATGGTCA
CTGGGAGTAG ATGGAATATC CGCTCCGCTT GTAGTCCTAA GCGGGCTAAT CACTTTTTTG
TCCGCAGCTG CAAGTTGGAA AGTAAAACAA AAAACAAGGC TTTACTTTGC TCTATTACTT
ATACAAGCTT CAGCCCAAGC TTTAGTTTTC CTATCGCAAG ATTTCTTGCT ATTCTTTTTA
GCTTGGGAGC TTGAGCTAGT TCCAGTCTAT CTTTTGATTG CTATTTGGGG AGGCAAAAGG
AAACTCTATG CAGCAACAAA ATTCATTCTT TACACCGCAC TTGCTTCCCT TTTGATATTA
ATTAGTGGGC TTGCTTTAGC TCTTTCTGGA GATCAATTCA CTTTCAATTT AAGTGAAATT
GCAGCCAAAT CCTTTACCGG GAATTTTGGG ATATTTTGTT ATTTAGGTTT TTTAATTGGT
TTTGGAGTAA AGCTTCCTAT CTTTCCACTT CATACCTGGC TGCCCGATGC TCATGGGGAG
GCAAATGCAC CGGTTTCAAT GCTATTAGCT GGTGTTTTAT TAAAAATGGG AGGTTACGCT
CTTATAAGAT TTAACGTCCA AATTTTACCT GATACTCATT TAATACTTGC TCCAGCTCTA
ATAATCATTG GGATAGTAAA TATTATTTAT GGAGCTCTAA ATGCTTTTGC ACAAGACAAT
GTAAAAAGGC GCATCGCTTG CAGTTCTGTG AGTCACATGG GTTTTGTTCT TGTTGGAATT
GGCGCAGTTA ATGCACTTGG AATCAGTGGA GCGATGCTCC AAATGATTAG CCATGGACTG
ATTGCTGCCG CAATGTTTTT TGTTACTGGA AGTTTTTATG AAAGAACCAA TACTCTTTCA
ATCCCTAACA TGGGCGGTTT AGCAAAGGCT TTACCAATTA CTTTCGCATT CTTCTTGGCC
AGTTCTTTAG CGTCACTTGC CTTACCAGGG ATGAGTGGAT TTATAAGTGA AATCACAGTT
TTTCTAGGCA TCACCAGTCA AGAAGGATTC ACTTCTTTAT TTAGAGCGAT AACAATTCTT
TTAGCAGCTA TCGGACTGGT TTTAACTCCT ATCTACCTTC TTTCAATGTG TAGAAGAGTA
TTTTTTGGTC CTAGAATCCC TGCATTATCC ATAGTAAAAG AAATGGATGC TAGAGAGCTT
TCAATAGGTT TAAGCCTCTT AGTTCCTACA TTAGTAATAG GTTTTTGGCC AAGAATAGCT
ATTGATTTAT ATGAGGTATC AACAAATGCT CTCGCACAAT CATTAATTAC CAATAACCTT
GTACCAATCA GTTAG
 
Protein sequence
MADLPQIMDA NLPMSIDSQT VFPWLSLIVL LPIVGALIMP FLPTQESKNF NLPRNIALVI 
LLADFLIVVL AFANLFDPNS ESLQLVERLQ WLPSIGLEWS LGVDGISAPL VVLSGLITFL
SAAASWKVKQ KTRLYFALLL IQASAQALVF LSQDFLLFFL AWELELVPVY LLIAIWGGKR
KLYAATKFIL YTALASLLIL ISGLALALSG DQFTFNLSEI AAKSFTGNFG IFCYLGFLIG
FGVKLPIFPL HTWLPDAHGE ANAPVSMLLA GVLLKMGGYA LIRFNVQILP DTHLILAPAL
IIIGIVNIIY GALNAFAQDN VKRRIACSSV SHMGFVLVGI GAVNALGISG AMLQMISHGL
IAAAMFFVTG SFYERTNTLS IPNMGGLAKA LPITFAFFLA SSLASLALPG MSGFISEITV
FLGITSQEGF TSLFRAITIL LAAIGLVLTP IYLLSMCRRV FFGPRIPALS IVKEMDAREL
SIGLSLLVPT LVIGFWPRIA IDLYEVSTNA LAQSLITNNL VPIS