Gene P9303_23841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_23841 
Symbolmet3 
ID4776428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2101472 
End bp2102644 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content55% 
IMG OID640087904 
ProductATP-sulfurylase 
Protein accessionYP_001018382 
Protein GI124024075 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2046] ATP sulfurylase (sulfate adenylyltransferase) 
TIGRFAM ID[TIGR00339] ATP sulphurylase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.114558 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGCCA GTCCCTCTGC ATCTGCCCAG TCTCCCGGCG TGATCGCGCC CTATGGAGGG 
ACACTGGTGG ATTTGATGGT GGCTACTGAT CAGCAGGAAG CTGTCAAGGC CAGTGCCAAC
CATGTGTTGG AGTGCTCAGA TCGCAATGCT TGCGATCTGG AGTTGCTTGT CGGTGGAGGC
TTTTCGCCTG AGCGGGGCTT TATGCATCAG GGTGATTACG ACGCTGTTGT TGCAGGCCAT
CGCACTCTTT CCGGCTATCT TTTCGGCCTG CCAATCGTGA TGGATACCGA TCGAGAGGAT
GTAGCGATCG GTGATCGGGT GTTGCTGAGT TACAAGGGTC AGGATTTGGC AGTTCTTCAA
GTCGAGGACA AATGGGAGCC CGACAAGGTG GTGGAAGCCA AAGGTTGCTA TGGCACTACC
TCTCTAGAAC ATCCCGCTGT GCGCATGATT GCCACTGAAC GCAAGCGCTT TTATCTCGGG
GGCACCTTGC AGGGTTTGGA GTTGCCTAAG CGTATTTTTC CTTGCAAGAG CCCTGCTCAG
GTTCGGGCGG AACTTCCTGC CGGGGAGGAC GTTGTTGCCT TTCAGTGTCG CAATCCCATT
CATCGCGCTC ACTACGAGTT GTTTACGCGA GCCTTGCATG CCAGCAATGT GAGCGAGAAC
GCTGTTGTGT TAGTGCATCC AACCTGTGGA CCAACTCAGC AGGATGATAT CCCTGGTGGC
GTACGTTTTC AGACCTATGA GCGGTTGGCT GCTGAGGTAG ATAATCCCCG CATTCGCTGG
GCCTATCTTC CCTATGCCAT GCATATGGCA GGTCCGCGCG AAGCCCTGCA ACACATGATT
ATTCGCCGCA ATTATGGATG TACCCATTTC ATCATCGGTC GTGACATGGC CGGATGTAAG
TCCTCCCTTA GCGGCGATGA CTTCTATGGC CCTTACGACG CGCAGAACTT TGCACAGGAA
TGTGCAGGAG AGCTGGCAAT GGAAACGGTC CCCTCGTTGA ATCTTGTTTT CACTGAAGAG
GAGGGCTACG TCACTGCCGA GCATGCTGAG GCTCGTGGAT TACATGTCAA GAAGCTCAGC
GGTACGCAGT TCCGCAAGAT GTTGAGAAGT GGCGAGGAGA TCCCTGAGTG GTTCGCCTTC
CGTAGCGTGG TTGAGGTGCT GAGAGCCACG TGA
 
Protein sequence
MIASPSASAQ SPGVIAPYGG TLVDLMVATD QQEAVKASAN HVLECSDRNA CDLELLVGGG 
FSPERGFMHQ GDYDAVVAGH RTLSGYLFGL PIVMDTDRED VAIGDRVLLS YKGQDLAVLQ
VEDKWEPDKV VEAKGCYGTT SLEHPAVRMI ATERKRFYLG GTLQGLELPK RIFPCKSPAQ
VRAELPAGED VVAFQCRNPI HRAHYELFTR ALHASNVSEN AVVLVHPTCG PTQQDDIPGG
VRFQTYERLA AEVDNPRIRW AYLPYAMHMA GPREALQHMI IRRNYGCTHF IIGRDMAGCK
SSLSGDDFYG PYDAQNFAQE CAGELAMETV PSLNLVFTEE EGYVTAEHAE ARGLHVKKLS
GTQFRKMLRS GEEIPEWFAF RSVVEVLRAT