Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_02221 |
Symbol | |
ID | 4779479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 207250 |
End bp | 208866 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640083487 |
Product | NAD(P)H-quinone oxidoreductase subunit 4 |
Protein accession | YP_001014051 |
Protein GI | 124024935 |
COG category | [C] Energy production and conversion |
COG ID | [COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) |
TIGRFAM ID | [TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTAATTC CTGAGCCAAT TCAAGCCGAT TTTCCTTGGT TAAGTTTATC CATATTGTTT CCCATAGTTG GAGCATTGAT TGTTCCTTTT ATTCCAGATA AAGGTGAGGG AAAAGAAGTT CGATGGTATG CACTCATAAT TTCTTTAATT ACTTTTCTAA TTACTGTCGC TGCTTACTTC AAAGGTTTTG ATCCAAGTCT AGAAGGCTTG CAATTATATG AAAAAGTTAG TTGGCTTCCT GATCTTGGGT TGACTTGGTC TGTTGGTGCA GATGGACTAT CAATGCCATT GATATTGCTT ACAAGCTTTA TAACTTCTCT TGCCGTTTTG GCGGCATGGC CAGTTAGCTA TAAACCAAAA TTATTTTTCT TTTTAATTCT CGCCATGGAT GGCGGCCAGA TAGCTGTCTT TGCAGTTCAA GATATGTTGC TTTTCTTCCT GGCTTGGGAA TTGGAATTGT TCCCCGTTTA TTTATTCCTT GCAATCTGGG GCGGAAAGAA GAGGCAATAT GCGGCGACCA AATTTATTAT TTATACAGCA GGAAGCTCGT TATTTATTCT CCTTGCTGGT CTTGCGATGG GTTTCTTTCA AGGAGGAGGG GTGCCAGATT TCGGTTATAC CCATTTAGCT CAGCAAAATT TTGGGAGGGG TTTTCAACTG CTTTGTTATT CAGGTTTGTT AATAGCATTT GGGGTCAAAC TTCCTATTGT TCCCCTACAT ACTTGGTTGC CTGATGCACA TGGAGAGGCT ACTGCTCCAG TACATATGCT CTTGGCAGGA ATCTTGTTAA AGATGGGAGG ATATGCTCTC CTTAGATTTA ATGCACAATT ACTGCCAGAT GCTCATGCTC AATTTGCTCC ATTGTTAATT GTATTGGGAG TGGTAAATAT TATTTATGCA GCTTTAACCT CTTTTGCTCA AAGAAATTTA AAAAGGAAAA TTGCCTACAG CTCAATAAGT CATATGGGTT TTGTTTTAAT AGGCATTGGA AGCTTTAGCT CTTTAGGTAC TAGTGGTGCA ATGTTGCAAA TGGTAAGCCA CGGTTTAATA GGAGCAAGCT TGTTTTTCCT TGTTGGTGCA ACTTACGACA GAACTCATAC CCTTCAATTA GATGAGATGG GTGGTATTGG TCAAAATATG AGGATTATGT TTGCGTTATG GACTGCATGC GCTTTCGCTT CTCTTGCTTT GCCTGGGATG AGTGGATTTA TCTCGGAATT AATGGTTTTT GTTGGGTTTG TAACTGATGA AGTTTATACT CTTCCATTTA GGATTGTTGT TGCTTCGTTG GCAGCAATTG GAGTCATTTT GACACCTATA TATTTATTAT CGATGCTCAG AGAAATCTTC TTCGGAAAAG AGAATGCAAA GTTAATATCT AAAGCAAAGT TAGTAGATGC TGAACCTAGA GAAATCTATA TTATTGCTTG TTTATTAGTT CCCATTATTG GAATTGGTTT GTATCCAAAA ATTATGACTG ATACTTATAT TTCATCAATT GATGGATTGG TTAAGAGAGA TTTGTTAGCC GTTGAGAGAA TTAGAAGTGA TCGGGCAACA ATTATGAGCA ATACAAGCTT ATCAATTGGG ACTATTGAGG CTCCTCTTTT AGATTGA
|
Protein sequence | MLIPEPIQAD FPWLSLSILF PIVGALIVPF IPDKGEGKEV RWYALIISLI TFLITVAAYF KGFDPSLEGL QLYEKVSWLP DLGLTWSVGA DGLSMPLILL TSFITSLAVL AAWPVSYKPK LFFFLILAMD GGQIAVFAVQ DMLLFFLAWE LELFPVYLFL AIWGGKKRQY AATKFIIYTA GSSLFILLAG LAMGFFQGGG VPDFGYTHLA QQNFGRGFQL LCYSGLLIAF GVKLPIVPLH TWLPDAHGEA TAPVHMLLAG ILLKMGGYAL LRFNAQLLPD AHAQFAPLLI VLGVVNIIYA ALTSFAQRNL KRKIAYSSIS HMGFVLIGIG SFSSLGTSGA MLQMVSHGLI GASLFFLVGA TYDRTHTLQL DEMGGIGQNM RIMFALWTAC AFASLALPGM SGFISELMVF VGFVTDEVYT LPFRIVVASL AAIGVILTPI YLLSMLREIF FGKENAKLIS KAKLVDAEPR EIYIIACLLV PIIGIGLYPK IMTDTYISSI DGLVKRDLLA VERIRSDRAT IMSNTSLSIG TIEAPLLD
|
| |