Gene A9601_02481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_02481 
Symbolmet3 
ID4716932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp231696 
End bp232871 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content34% 
IMG OID640077947 
ProductATP-sulfurylase 
Protein accessionYP_001008643 
Protein GI123967785 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2046] ATP sulfurylase (sulfate adenylyltransferase) 
TIGRFAM ID[TIGR00339] ATP sulphurylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.256568 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATTAC AACAAAAAAC TAAAACAGAT AATAATGGAC TAATACCGCC TTATGGAGGG 
GAACTAAAAA ATTTAATTAT CAAAGATAAA AGCCTTAAAA ATGAACTTAT TTCTAAAGCT
ACTTATGAGT TTGAATGTAG CGAGAGAAAT GCATGCGATG TAGAACTTTT AATGGTTGGA
GCTTTTTCTC CATTGGAAGG TTTTATGGAT GCAAATAACT ACAATTCGGT GATTAAGAAT
AATAGAAATA CAAGCGGGTT GCTTTTTGGC TTGCCTATTG TATTTGATTC CAATAATGAA
AAAGTAAAAA CTGGAGAGAC AATATTACTT ACCTATAAAA AACAAAAAAT AGCAGTTTTA
GAAGTTAGCT CTAAATGGGA GCCTGACAAA TCCTTAGAAG CTGAACTTTG TTATGGTACT
AATTCTTTAG ATCATCCTGC TGTTAAGATG ATTTTTAACG AGAGAGGTAG ATTTTATATA
GGAGGAAGAG TTTATGGTTT CGAACTGCCA ACTAGAGAAT TCCCCTGCAA AACTCCAGAA
GAAGTTAGAT CTACACTGCC ACCAAATCAT GATGTAGTTG CATTTCAATG CAGAAATCCA
ATTCATAGAG CACATTATGA ATTATTTACT AATGCCTTAC TTTCAGAAAA TGTCTCCTCT
AAATCAGTTG TTTTAGTTCA TCCAACTTGT GGACCAACTC AACAAGATGA TATCCCGGGG
AAAGTTAGAT ATTTGACATA TAAAGAATTA GAAGAGGAAA TATCTGATGA AAGAATAAAA
TGGGCTTTTT TACCTTATTC AATGCATATG GCGGGGCCAA GAGAAGCTTT GCAACATATG
ATAATCAGAA GAAATTATGG CTGCACCCAC TTTATTATTG GTAGAGATAT GGCTGGTTGT
AAGTCTTCAT CAACTGGTGA GGATTTTTAT GGTCCATATG ACGCCCAGAA TTTTGCAAAT
AAGTGCGCAG ATGAATTGAT GATGCAAACT GTTCCTTCAA AAAATTTAGT TTATACGAAG
GAAAAAGGAT ATATAACAGC TGAAGAAGCC AAAGAATTAA ATTATGAAAT TATGAAACTT
AGTGGTACTG AATTTAGAAA GAAATTAAGG AATGGCGAAC CAATTCCTGA ATGGTTTGCA
TTCAAAAGTG TAGTAGATGT TCTAAGACGC TCTTAA
 
Protein sequence
MELQQKTKTD NNGLIPPYGG ELKNLIIKDK SLKNELISKA TYEFECSERN ACDVELLMVG 
AFSPLEGFMD ANNYNSVIKN NRNTSGLLFG LPIVFDSNNE KVKTGETILL TYKKQKIAVL
EVSSKWEPDK SLEAELCYGT NSLDHPAVKM IFNERGRFYI GGRVYGFELP TREFPCKTPE
EVRSTLPPNH DVVAFQCRNP IHRAHYELFT NALLSENVSS KSVVLVHPTC GPTQQDDIPG
KVRYLTYKEL EEEISDERIK WAFLPYSMHM AGPREALQHM IIRRNYGCTH FIIGRDMAGC
KSSSTGEDFY GPYDAQNFAN KCADELMMQT VPSKNLVYTK EKGYITAEEA KELNYEIMKL
SGTEFRKKLR NGEPIPEWFA FKSVVDVLRR S