Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PMN2A_1596 |
Symbol | |
ID | 3606994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL2A |
Kingdom | Bacteria |
Replicon accession | NC_007335 |
Strand | - |
Start bp | 267762 |
End bp | 269012 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 637688474 |
Product | ATP-sulfurylase |
Protein accession | YP_292787 |
Protein GI | 72383432 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2046] ATP sulfurylase (sulfate adenylyltransferase) |
TIGRFAM ID | [TIGR00339] ATP sulphurylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGTTGAAT TTAATGAAAA CAATTTCTTA AATATGGGCC CCCTATCTGG GAGAATTTCC CGGTTACTTA AAGAGAAAAT GACTAGCAAG CAAAGTTCTA ATAAAAATCT CGCAGGTTTA ATCAAACCTT ATGGTGGAGA ACTTATAAAC CTAATGGCTT CTGATCAAGA AGCAAAAGAG TTAAAAAAAA ATTCTTTTAA AACTTTAAAT TGTTCTGATA GAAATGCTTG TGATATTGAA CTTCTTTTGA TAGGTGCTTT TTCTCCTTTA AATGGGTTCA TGAATGAGAA AAATTACAAC TCAGTCGTTA AACAAAATCG ACTTGAATCA GGTTTGCTTT TTGGTTTGCC GATTGTGATG GATACAGATA GAGAAGATAT AAATCCAGGA GATTCAGTTC TACTCAATTA CAAAGATCAA GAACTAGCAA TTTTAGAAAT ACAAGAAAAA TGGACTCCTG ACAAAGTTAT TGAAGCCAAA TTTTGCTATG GAACAACTTC TTTGGAGCAT CCTGCAGTAA GAATGATATC TATGGAGAGG AAAAAATATT ATTTAGGAGG CTCAATAAAA GGTTTAGAAT TACCTAAAAG AGTTTTTACT TGCCAAACTC CTGCTCAAGT AAGAGAGAAC CTTCCTTCTG GAGAAGATGT AGTCGCATTC CAGTGCAGAA ATCCAATTCA TAGAGCCCAT TATGAGCTTT TTACAAGAGC CTTAGAAGCC AATAATGTCA GTAAAAATGG AGTTGTTCTT GTTCACCCAA CTTGTGGACC AACTCAAGAA GATGACATCC CTGGATCAGT AAGATTTCAA ACCTATGAAA AACTTGCCTC TGAAGTCAAT AATCCAAAAA TTAGGTGGTC ATACCTTCCT TATTCGATGC ATATGGCTGG GCCAAGAGAG GCTTTGCAGC ACATGATTAT CAGAAGGAAT TATGGATGTA CTCATTTTAT TATCGGAAGA GATATGGCCG GCTGTAAGTC CTCTCTAGAT GGTGAAGATT TTTATGGTCC ATATGATGCT CAAAATTTTG CAAACGAGTG CTGCCAAGAA TTAGAAATGC AAACAGTTCC ATCTCTAAAT CTTGTATTTA CAGAGGAGGA AGGCTATGTA ACCGCCGATT ATGCTAAAGA AAAAGGATTA CACATAAAAA AATTGAGTGG CACTCAATTC AGAAAAATGC TCAGAAGTGG AGAAGAAATT CCTGAATGGT TTGCATTTAA AAGCGTCGTT GATGTACTAA GAGCCGCATA G
|
Protein sequence | MVEFNENNFL NMGPLSGRIS RLLKEKMTSK QSSNKNLAGL IKPYGGELIN LMASDQEAKE LKKNSFKTLN CSDRNACDIE LLLIGAFSPL NGFMNEKNYN SVVKQNRLES GLLFGLPIVM DTDREDINPG DSVLLNYKDQ ELAILEIQEK WTPDKVIEAK FCYGTTSLEH PAVRMISMER KKYYLGGSIK GLELPKRVFT CQTPAQVREN LPSGEDVVAF QCRNPIHRAH YELFTRALEA NNVSKNGVVL VHPTCGPTQE DDIPGSVRFQ TYEKLASEVN NPKIRWSYLP YSMHMAGPRE ALQHMIIRRN YGCTHFIIGR DMAGCKSSLD GEDFYGPYDA QNFANECCQE LEMQTVPSLN LVFTEEEGYV TADYAKEKGL HIKKLSGTQF RKMLRSGEEI PEWFAFKSVV DVLRAA
|
| |