Gene P9301_02491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_02491 
Symbolmet3 
ID4911533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp232658 
End bp233833 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content34% 
IMG OID640159815 
ProductATP-sulfurylase 
Protein accessionYP_001090473 
Protein GI126695587 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2046] ATP sulfurylase (sulfate adenylyltransferase) 
TIGRFAM ID[TIGR00339] ATP sulphurylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.23115 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATTAC AACAAAAAAC AAAAACTGAC CCTAATGGAC TAATACCGCC TTATGGAGGG 
GAACTAAAAA ATTTAATAAT TAAAGATAAT AGCTTTAAAA ATGACCTTAT CTCCAAAGCT
ACTTATGAAT TTGAATGTAG CGAGAGAAAT GCATGTGATG TAGAACTTTT GATGGTTGGT
GCTTTTTCTC CTTTGGAAGG TTTTATGGAT GAAAATAACT ACAAATCGGT TATCGAAAAT
AACAGAGATA CAAGCGGTTT GCTTTTTGGC TTACCTATTG TCTTTGATTC AAATAATGAT
GAAGTAAAAG CTGGAGAAAC AATCTTGCTT ACCTACAAAA ACCAAAAAAT TGCAATTTTA
GAAGTAAGTT CTATTTGGGA GCCTGATAAA TCTTTAGAAG CCGAATTTTG TTATGGTACT
AATTCTTTAG ATCATCCTGC TGTTAAGATG ATTTTTAATG AAAGGGGAAG ATTCTATATA
GGAGGGAAAG TTTATGGTTT CGAACTACCA GTTAGAGAAT TTCCCTGCAA AACCCCTGAA
GAAGTTAGAT CTTCACTGCC ATCAAATTAT GATGTAGTTG CATTTCAATG CAGAAATCCA
ATTCATAGAG CACATTATGA GTTATTTACT AATGCCCTAC TCTCAGATAA TGTCTCTTCT
AACTCAGTGG TTTTGGTACA TCCAACTTGT GGGCCAACTC AACAAGACGA TATACCTGGA
AAAGTTAGAT ATTTGACCTA TAAAGAATTA GAAGAGGAAA TATCTGATGA AAGAATAAAA
TGGGCTTTTT TACCTTATTC AATGCATATG GCAGGGCCAA GGGAAGCTCT TCAACACATG
ATAATCAGAA GAAATTATGG CTGCACCCAC TTTATTATTG GTAGAGATAT GGCTGGTTGT
AAGTCATCAT CAACTGGTGA AGATTTTTAT GGCCCATATG ACGCCCAGAA TTTTGCTAAT
AAGTGTGCAG ATGAATTAAT GATGCAGACT GTTCCTTCAA AAAATTTAGT TTATACGAAG
GAAAAAGGAT ATATAACAGC TGAAGAAGCT AAAGAATTTA ATTATGAAAT TATGAAACTT
AGTGGTACTG AATTTAGAAA GAAATTGAGG AATGGCGAAC CAATTCCTGA ATGGTTTGCA
TTCAAAAGTG TAGTAGATGT TCTAAGACGC TCTTAA
 
Protein sequence
MELQQKTKTD PNGLIPPYGG ELKNLIIKDN SFKNDLISKA TYEFECSERN ACDVELLMVG 
AFSPLEGFMD ENNYKSVIEN NRDTSGLLFG LPIVFDSNND EVKAGETILL TYKNQKIAIL
EVSSIWEPDK SLEAEFCYGT NSLDHPAVKM IFNERGRFYI GGKVYGFELP VREFPCKTPE
EVRSSLPSNY DVVAFQCRNP IHRAHYELFT NALLSDNVSS NSVVLVHPTC GPTQQDDIPG
KVRYLTYKEL EEEISDERIK WAFLPYSMHM AGPREALQHM IIRRNYGCTH FIIGRDMAGC
KSSSTGEDFY GPYDAQNFAN KCADELMMQT VPSKNLVYTK EKGYITAEEA KEFNYEIMKL
SGTEFRKKLR NGEPIPEWFA FKSVVDVLRR S