Gene NATL1_03071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_03071 
Symbolmet3 
ID4780277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp285555 
End bp286772 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content36% 
IMG OID640083572 
ProductATP-sulfurylase 
Protein accessionYP_001014136 
Protein GI124025020 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2046] ATP sulfurylase (sulfate adenylyltransferase) 
TIGRFAM ID[TIGR00339] ATP sulphurylase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCCCC TATCTGGGAG AATTTCTCGG TTACTTAAAG AGAAAATGAC TAGCAAGCAA 
AGTTCTAATA AAAATCTCGC AGGTTTAATC AAACCTTATG GTGGAGAACT TATAAACCTA
ATGGCTTCTG ATCAAGAAGC AAAAGAGTTA AAAAAAAATT CTTTTAAAAC TTTAAATTGT
TCTGATAGAA ATGCTTGTGA TATTGAACTT CTTTTGATAG GTGCTTTTTC TCCTTTAAAT
GGGTTCATGA GTGAGAAAAA TTACAACTCA GTCGTTAAAC AAAATCGACT TGAATCAGGT
TTGCTTTTTG GTTTGCCCAT TGTGATGGAT ACAGATAGAG AAGATATAAA TCCAGGAGAT
TCAGTTGTAC TTAATTACAA AGATCAAGAA CTAGCAATTT TAGAAATACA AGAGAAATGG
ACTCCTGACA AAGTTATTGA AGCCAAATTT TGCTATGGAA CAACTTCTTT GGAGCATCCT
GCAGTAAGAA TGATATCTAT GGAGAGGAAA AAATATTATT TAGGAGGCTC AATAAAAGGT
TTAGAATTAC CTAAAAGAGT TTTTACTTGC CAAACTCCTG CTCAAGTAAG AAAGAACCTT
CCTTCTGGAG AAGATGTAGT CGCATTCCAG TGCAGAAATC CAATTCATAG AGCTCATTAT
GAGCTTTTCA CAAGAGCCCT AGAAGCCAAT AATGTCAGTA AAAATGGTGT AGTTCTTGTT
CACCCAACTT GTGGACCAAC TCAAGAAGAT GACATCCCTG GATCAGTAAG ATTTCAAACC
TATGAAAAAC TTGCCTCTGA AGTTAATAAT CCAAAAATCA GGTGGTCATA TCTTCCTTAT
TCGATGCATA TGGCTGGGCC AAGAGAGGCT TTACAGCACA TGATTATTAG AAGGAATTAT
GGATGTACTC ATTTTATTAT TGGAAGAGAT ATGGCAGGCT GTAAGTCCTC TCTAAATGGT
GAAGATTTTT ATGGTCCATA TGATGCTCAA AATTTTGCAA ACGAGTGCTG CCAAGAATTA
GAAATGCAAA CAGTTCCATC TCTAAATCTT GTATTTACAG AGGAGGAAGG CTATGTAACC
GCCGATTATG CTAAAGAAAA AGGATTACAC ATAAAAAAAT TGAGTGGCAC TCAATTCAGA
AAAATGCTCA GAAGTGGAGA AGAAATTCCT GAATGGTTTG CATTTAAAAG CGTCGTTGAT
GTACTAAGAG CCGCATAG
 
Protein sequence
MGPLSGRISR LLKEKMTSKQ SSNKNLAGLI KPYGGELINL MASDQEAKEL KKNSFKTLNC 
SDRNACDIEL LLIGAFSPLN GFMSEKNYNS VVKQNRLESG LLFGLPIVMD TDREDINPGD
SVVLNYKDQE LAILEIQEKW TPDKVIEAKF CYGTTSLEHP AVRMISMERK KYYLGGSIKG
LELPKRVFTC QTPAQVRKNL PSGEDVVAFQ CRNPIHRAHY ELFTRALEAN NVSKNGVVLV
HPTCGPTQED DIPGSVRFQT YEKLASEVNN PKIRWSYLPY SMHMAGPREA LQHMIIRRNY
GCTHFIIGRD MAGCKSSLNG EDFYGPYDAQ NFANECCQEL EMQTVPSLNL VFTEEEGYVT
ADYAKEKGLH IKKLSGTQFR KMLRSGEEIP EWFAFKSVVD VLRAA