Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_03071 |
Symbol | met3 |
ID | 4780277 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 285555 |
End bp | 286772 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640083572 |
Product | ATP-sulfurylase |
Protein accession | YP_001014136 |
Protein GI | 124025020 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2046] ATP sulfurylase (sulfate adenylyltransferase) |
TIGRFAM ID | [TIGR00339] ATP sulphurylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCCCCC TATCTGGGAG AATTTCTCGG TTACTTAAAG AGAAAATGAC TAGCAAGCAA AGTTCTAATA AAAATCTCGC AGGTTTAATC AAACCTTATG GTGGAGAACT TATAAACCTA ATGGCTTCTG ATCAAGAAGC AAAAGAGTTA AAAAAAAATT CTTTTAAAAC TTTAAATTGT TCTGATAGAA ATGCTTGTGA TATTGAACTT CTTTTGATAG GTGCTTTTTC TCCTTTAAAT GGGTTCATGA GTGAGAAAAA TTACAACTCA GTCGTTAAAC AAAATCGACT TGAATCAGGT TTGCTTTTTG GTTTGCCCAT TGTGATGGAT ACAGATAGAG AAGATATAAA TCCAGGAGAT TCAGTTGTAC TTAATTACAA AGATCAAGAA CTAGCAATTT TAGAAATACA AGAGAAATGG ACTCCTGACA AAGTTATTGA AGCCAAATTT TGCTATGGAA CAACTTCTTT GGAGCATCCT GCAGTAAGAA TGATATCTAT GGAGAGGAAA AAATATTATT TAGGAGGCTC AATAAAAGGT TTAGAATTAC CTAAAAGAGT TTTTACTTGC CAAACTCCTG CTCAAGTAAG AAAGAACCTT CCTTCTGGAG AAGATGTAGT CGCATTCCAG TGCAGAAATC CAATTCATAG AGCTCATTAT GAGCTTTTCA CAAGAGCCCT AGAAGCCAAT AATGTCAGTA AAAATGGTGT AGTTCTTGTT CACCCAACTT GTGGACCAAC TCAAGAAGAT GACATCCCTG GATCAGTAAG ATTTCAAACC TATGAAAAAC TTGCCTCTGA AGTTAATAAT CCAAAAATCA GGTGGTCATA TCTTCCTTAT TCGATGCATA TGGCTGGGCC AAGAGAGGCT TTACAGCACA TGATTATTAG AAGGAATTAT GGATGTACTC ATTTTATTAT TGGAAGAGAT ATGGCAGGCT GTAAGTCCTC TCTAAATGGT GAAGATTTTT ATGGTCCATA TGATGCTCAA AATTTTGCAA ACGAGTGCTG CCAAGAATTA GAAATGCAAA CAGTTCCATC TCTAAATCTT GTATTTACAG AGGAGGAAGG CTATGTAACC GCCGATTATG CTAAAGAAAA AGGATTACAC ATAAAAAAAT TGAGTGGCAC TCAATTCAGA AAAATGCTCA GAAGTGGAGA AGAAATTCCT GAATGGTTTG CATTTAAAAG CGTCGTTGAT GTACTAAGAG CCGCATAG
|
Protein sequence | MGPLSGRISR LLKEKMTSKQ SSNKNLAGLI KPYGGELINL MASDQEAKEL KKNSFKTLNC SDRNACDIEL LLIGAFSPLN GFMSEKNYNS VVKQNRLESG LLFGLPIVMD TDREDINPGD SVVLNYKDQE LAILEIQEKW TPDKVIEAKF CYGTTSLEHP AVRMISMERK KYYLGGSIKG LELPKRVFTC QTPAQVRKNL PSGEDVVAFQ CRNPIHRAHY ELFTRALEAN NVSKNGVVLV HPTCGPTQED DIPGSVRFQT YEKLASEVNN PKIRWSYLPY SMHMAGPREA LQHMIIRRNY GCTHFIIGRD MAGCKSSLNG EDFYGPYDAQ NFANECCQEL EMQTVPSLNL VFTEEEGYVT ADYAKEKGLH IKKLSGTQFR KMLRSGEEIP EWFAFKSVVD VLRAA
|
| |