Gene NATL1_02221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_02221 
Symbol 
ID4779479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp207250 
End bp208866 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content38% 
IMG OID640083487 
ProductNAD(P)H-quinone oxidoreductase subunit 4 
Protein accessionYP_001014051 
Protein GI124024935 
COG category[C] Energy production and conversion 
COG ID[COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) 
TIGRFAM ID[TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAATTC CTGAGCCAAT TCAAGCCGAT TTTCCTTGGT TAAGTTTATC CATATTGTTT 
CCCATAGTTG GAGCATTGAT TGTTCCTTTT ATTCCAGATA AAGGTGAGGG AAAAGAAGTT
CGATGGTATG CACTCATAAT TTCTTTAATT ACTTTTCTAA TTACTGTCGC TGCTTACTTC
AAAGGTTTTG ATCCAAGTCT AGAAGGCTTG CAATTATATG AAAAAGTTAG TTGGCTTCCT
GATCTTGGGT TGACTTGGTC TGTTGGTGCA GATGGACTAT CAATGCCATT GATATTGCTT
ACAAGCTTTA TAACTTCTCT TGCCGTTTTG GCGGCATGGC CAGTTAGCTA TAAACCAAAA
TTATTTTTCT TTTTAATTCT CGCCATGGAT GGCGGCCAGA TAGCTGTCTT TGCAGTTCAA
GATATGTTGC TTTTCTTCCT GGCTTGGGAA TTGGAATTGT TCCCCGTTTA TTTATTCCTT
GCAATCTGGG GCGGAAAGAA GAGGCAATAT GCGGCGACCA AATTTATTAT TTATACAGCA
GGAAGCTCGT TATTTATTCT CCTTGCTGGT CTTGCGATGG GTTTCTTTCA AGGAGGAGGG
GTGCCAGATT TCGGTTATAC CCATTTAGCT CAGCAAAATT TTGGGAGGGG TTTTCAACTG
CTTTGTTATT CAGGTTTGTT AATAGCATTT GGGGTCAAAC TTCCTATTGT TCCCCTACAT
ACTTGGTTGC CTGATGCACA TGGAGAGGCT ACTGCTCCAG TACATATGCT CTTGGCAGGA
ATCTTGTTAA AGATGGGAGG ATATGCTCTC CTTAGATTTA ATGCACAATT ACTGCCAGAT
GCTCATGCTC AATTTGCTCC ATTGTTAATT GTATTGGGAG TGGTAAATAT TATTTATGCA
GCTTTAACCT CTTTTGCTCA AAGAAATTTA AAAAGGAAAA TTGCCTACAG CTCAATAAGT
CATATGGGTT TTGTTTTAAT AGGCATTGGA AGCTTTAGCT CTTTAGGTAC TAGTGGTGCA
ATGTTGCAAA TGGTAAGCCA CGGTTTAATA GGAGCAAGCT TGTTTTTCCT TGTTGGTGCA
ACTTACGACA GAACTCATAC CCTTCAATTA GATGAGATGG GTGGTATTGG TCAAAATATG
AGGATTATGT TTGCGTTATG GACTGCATGC GCTTTCGCTT CTCTTGCTTT GCCTGGGATG
AGTGGATTTA TCTCGGAATT AATGGTTTTT GTTGGGTTTG TAACTGATGA AGTTTATACT
CTTCCATTTA GGATTGTTGT TGCTTCGTTG GCAGCAATTG GAGTCATTTT GACACCTATA
TATTTATTAT CGATGCTCAG AGAAATCTTC TTCGGAAAAG AGAATGCAAA GTTAATATCT
AAAGCAAAGT TAGTAGATGC TGAACCTAGA GAAATCTATA TTATTGCTTG TTTATTAGTT
CCCATTATTG GAATTGGTTT GTATCCAAAA ATTATGACTG ATACTTATAT TTCATCAATT
GATGGATTGG TTAAGAGAGA TTTGTTAGCC GTTGAGAGAA TTAGAAGTGA TCGGGCAACA
ATTATGAGCA ATACAAGCTT ATCAATTGGG ACTATTGAGG CTCCTCTTTT AGATTGA
 
Protein sequence
MLIPEPIQAD FPWLSLSILF PIVGALIVPF IPDKGEGKEV RWYALIISLI TFLITVAAYF 
KGFDPSLEGL QLYEKVSWLP DLGLTWSVGA DGLSMPLILL TSFITSLAVL AAWPVSYKPK
LFFFLILAMD GGQIAVFAVQ DMLLFFLAWE LELFPVYLFL AIWGGKKRQY AATKFIIYTA
GSSLFILLAG LAMGFFQGGG VPDFGYTHLA QQNFGRGFQL LCYSGLLIAF GVKLPIVPLH
TWLPDAHGEA TAPVHMLLAG ILLKMGGYAL LRFNAQLLPD AHAQFAPLLI VLGVVNIIYA
ALTSFAQRNL KRKIAYSSIS HMGFVLIGIG SFSSLGTSGA MLQMVSHGLI GASLFFLVGA
TYDRTHTLQL DEMGGIGQNM RIMFALWTAC AFASLALPGM SGFISELMVF VGFVTDEVYT
LPFRIVVASL AAIGVILTPI YLLSMLREIF FGKENAKLIS KAKLVDAEPR EIYIIACLLV
PIIGIGLYPK IMTDTYISSI DGLVKRDLLA VERIRSDRAT IMSNTSLSIG TIEAPLLD