Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_07121 |
Symbol | |
ID | 4780964 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 654667 |
End bp | 656310 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640083986 |
Product | glucose-methanol-choline (GMC) oxidoreductase:NAD binding site |
Protein accession | YP_001014535 |
Protein GI | 124025419 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.569172 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.139328 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGACA AGCCCTATGA AGTCATAATT ATCGGTTCAG GTGCAACCGG GGGAATGGCA GCCTTAACAA TGGCTAAGGC TGGAGTAAGA GTACTAGTAA TAGAAAGAGG TCCTGAACTA GAGATCAAGC AGGCAAACGG AACAGAGCCT TGCAATATGA TTCGGAGACT TTTAGGGGTA ACAACTGGAA ATTATCAAAA TCAACCTCAA CATCCAGGGT TCTGGAAATC AAATCCCTTA CTGTACGCAA ACAAAAAAGC AAATCCTTAT ACACACCCGC CAAAAGCCCC CTTCATGTGG ACGCAAGGCA ATCAAGTTGG TGGAAGGAGC CTTACTTGGG GAGGGATAAC CTTAAGATTA TCTGAGGAAG ATTTTGAAGC CTCAAAAGAA AAAGAATACA ATCTCGAATG GCCTATTAGT TACAAAGACC TTGAGTCACA TTATTCGGAA ATAGAGAGTT TTCTAAAAAT ATACGGCAAC AAAGACGATC TCAATCAACT ACCTAATGGT GAATATATTG GCAAACTTCC ATTTACAGAA AGTGAATCAA GATTTGCCTC CAATATAAAA GAAAATTTAA ATCTTCCCTT TATACATTCC AGAGGGTTTG GACCAAATGA AGATAAAACG AAATGGCCAA GATATAGCAG TTTAGGCAGC ACATTAAAAG AAGCCGCTAG GCTAGGTAAA ATAGAAATAC TTTCAAATCA TATTGTAGAT AAATTAGTTT TGAATAAGGA TAGGAAGTCT GCAAAAAGTA TTATTGTAGT AAACCAAAAA AATGGAGAAA GAAGTGAATT AGAAAGTAAA TTAATAATAC TATGTTCATC AACAATCCAA ACAATTAGAA TTCTATTAAG TTCCGAAGAA AGTAATAATT CAAATGGTCT AATTGACCCC TCTAATTCAT TAGGAATGAA CTTAATGGAT CATATATCAA CCTGTAGGTT TTTCACTGTC CCTATCGATA AGAATTTCAA TGATTATTCA GATAAAAATA ATAATCATCT TTTAACAGGA GCAGGTAGCT TCTTTATTCC CATCGGTAGA GAGAGCTCAA CTAAAAAAAA CTTTGTTGGT GGATATGGCA TCTGGGGAGG AATAGATAGA TTTGAACCAC CAGAAGTTTT TAAGAAATAT AAAAACACAA AAACTGGTTT CCTTATTGGT CATGGAGAAG TACTCCCAAA TAAAAAAAAC ACTGTTTCTC TCTCAAATAC CAATGATCTA TATGGTATTT CCATACCGCA CATATCAATA GTTTGGCGAG AAAATGAGAA ACGAATGGTT TCAGAAATGA ACAGAATGAT TGAACTTATT ATTAATTCTG GTAATGGCAA AATTATTCCA GTGAATGAGA TTCTCAATAT TCCATTTACC AAACAAATTT TAAGCAAATC AGTTGCTATT AAAAGCGATG CTCCACCCCC TGGTTACTAC ATACACGAGG TAGGAGGAGC ACCAATGGGA AATGACAAAG GAAATAGTGT TTTAGACAAC TGGAATCGTC TATGGGAATG CAACAATGTA TTAGTAGTTG ATGGAGCTTG CTGGCCTACT TCATCTTGGC AAAGCCCAAC ATTAACAATG ATGGCAATAA CTAAGAGAGC TTGCGAAAAG GCAATTAGAG ACTTTAAAGG TTAA
|
Protein sequence | MIDKPYEVII IGSGATGGMA ALTMAKAGVR VLVIERGPEL EIKQANGTEP CNMIRRLLGV TTGNYQNQPQ HPGFWKSNPL LYANKKANPY THPPKAPFMW TQGNQVGGRS LTWGGITLRL SEEDFEASKE KEYNLEWPIS YKDLESHYSE IESFLKIYGN KDDLNQLPNG EYIGKLPFTE SESRFASNIK ENLNLPFIHS RGFGPNEDKT KWPRYSSLGS TLKEAARLGK IEILSNHIVD KLVLNKDRKS AKSIIVVNQK NGERSELESK LIILCSSTIQ TIRILLSSEE SNNSNGLIDP SNSLGMNLMD HISTCRFFTV PIDKNFNDYS DKNNNHLLTG AGSFFIPIGR ESSTKKNFVG GYGIWGGIDR FEPPEVFKKY KNTKTGFLIG HGEVLPNKKN TVSLSNTNDL YGISIPHISI VWRENEKRMV SEMNRMIELI INSGNGKIIP VNEILNIPFT KQILSKSVAI KSDAPPPGYY IHEVGGAPMG NDKGNSVLDN WNRLWECNNV LVVDGACWPT SSWQSPTLTM MAITKRACEK AIRDFKG
|
| |