Gene P9303_04271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_04271 
Symbol 
ID4776326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp426729 
End bp429089 
Gene Length2361 bp 
Protein Length786 aa 
Translation table11 
GC content48% 
IMG OID640085931 
Productsulfatase 
Protein accessionYP_001016444 
Protein GI124022137 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTCA GAATGCTTCT TTGGCTTGCA TGTATCATAA GTTTCACAGC AACATCCGCC 
ACCAACGCAG AAGAGATTAA TCGCCGGCGA CTTCCGATCG CAGATCCATA TCCTCAGAAA
GTTGACGAGC GCTTACCAAG TCAAGTCAAA ATGCCACGAC AACATGTCGT CGAGGCACCC
AAAGGAGCAC CCAATGTTGT TGTCATCTTG TTGGATGATG TTGGCTTTGG TGCTCCTTCG
CCCTTTGGCG GAGTGGTGCA AATGCCGGCC CTGCAGGAAC TAGCAAACAA TGGCCTGAGT
TATAACCGAT TTCATACTTC AGCCCTATGT GCACCTACAC GAGCTGCTCT TAAAGCAGGC
CGTAATCACC ATGTGATGAA CATGGGTTCA ATCCCTGAAA TTGCGACTGG ATATGCGGGC
AATACCACAT TTGTGCCTAA CTATGCCGAA CCTGTAGCCG AGATCCTAAG GCTCAATGGC
TACAACACAG GCGCTTTTGG TAAATGGCAT GAAACCCCTG GTCGTGAGAC CACTGCTGCG
GGCCCACAAA CCCGCTGGCC AACAAGGCAA GGATTTGAAA AATTTTATGG CTTCATCGGC
GCAGAAGAAA ATATGTATGA GCCATCACTT CACGATGGGG TCACAATCAT TGATTACCCA
GATCGAGAGG ATTACCACTT CCTTGAAGAC ATGACAGATC AGGCTATTGC CTGGATGCGC
CAACAACAGG GACTACGCCC AGACAAGCCT TTCTTTATCT ATTATGCATC TGCTGGCTCA
CATGCACCAC ACCATGTAAG ACCTGAATGG ATTAAAAAGT ACAAGGGAAA GTTTGATAAG
GGCTGGGATG TTATTCGAGA AGAAACGCTC GCAAATCAGA TCGCAAAAGG TGTTGTACCC
CCAAACACAC AACTGGCAAA AAAACCAGCA AGTGTGCCGA ACTGGGATGA TCTTAGTGAT
GTACAAAAAC GCATGTTTGC GCGACAAGCA GAGGTGTTCG CAGCTTTTAC AGAATACACA
GACTATCAAG CTGGGCGGCT GATCCAGGCC ATTGATGATC TTGGTGAACT TGACAACACG
CTGGTTATTT ATATCAGTGG TGATAACGGC ACAAGCTCAG AAGGGAACCA AACCGGCAAC
TGGAACTGGG GGCACATGTT TAACGGCATC CCTGAAACAC TTGAGGCTCA GTTAGAGCAT
TATGACAATT GGGGTGATCA GAACACCTAT CCACATATGG CAGTCGGCTG GGCGATTGCC
TTTGATACTC CATTTGCATT TACCAAACAA ATTGCCGGGG ATTTTGGCGG TACACGCAAT
GGCACTGTGA TTCATTGGCC TGAGGCCATC AAATCAAAAG GAGAGATACG CGATCAATTC
TCCCATGTGA TTGATGTTGC GCCAACGATC CTTGAAGCTG CTGGCTTACC CATGCCTGAA
CAGATCAATG GGATTGCACA AATCCCTATG CAAGGAACCA GCCTTGTTTA TTCTTTTGAT
AACGCTGATG CACCTGAGCG TCACAAGGTT CAATATTTTG AAGTTGTTGG TAATCGTGGA
ATCTATCAAG ATGGCTGGAT GGCAAGAGCC ACCGTTGGCC TGCCATGGGA AGCACCCAAG
AAGATGCATA GCGTAGCCAA AGATGATGGA TGGGAGTTAT ACGACACGCG CAATGATTTC
AGCCTAGCTA ACAACCTGGC AACTCAGTAT CCAGAACGGC TCGAGGCAAT GAAACGACTC
TTCTTGAAAG AAGCAATCGC CAATCAAGTG CTCCCCTTAG ATGATCGTCT GTTGCAACGT
TTAGTGCCTT CTGTTGCAGG CCGGCCAACG ATCATGGGCT CGTCGAGGAC ACAACTTGAT
CTCTACCCAG GCTCCTGGGA TCTTGTTGAG GATGCCATCC TCAATGTCAA GAATGTTTCC
AACAGCATTA CAGCCAAGGT CGACATTGAC TCAAGCCAAG ATGCCAACGG CGTGATCATG
GCCCAAGGGA GCCGCTTTGG CGGATGGTCA CTTTATGTAG AAGATGGTTA TCCGGCCTAT
GCCTATAACT ACCTGGGCAA TCTTCATACC TTCCGCAGCA AGGAGAAACT CTCTTCTGGT
AACAGAAAAA TTCGCTTTGA GATGGATTAC GACGGGGGAG GCGCTGGTAA GGGTGCAGAT
GTGCGCCTGC TAGTTGATGA CAAAGTCACA TCAACTGGCA GGGTTGAGGC AACCGTTGGT
ACACGTTTTT CGATTGACGA AGGTGCTGAT GTTGGCATGG ATCGTGGATC ACCTGTTGCT
AAAAGAGTGA TTGGGGATCA GCGCTTCAGT GCTTTCAATG GGACCATCAA CAAGGTGACA
TTGGAGATCT ATCCACAGTG A
 
Protein sequence
MKFRMLLWLA CIISFTATSA TNAEEINRRR LPIADPYPQK VDERLPSQVK MPRQHVVEAP 
KGAPNVVVIL LDDVGFGAPS PFGGVVQMPA LQELANNGLS YNRFHTSALC APTRAALKAG
RNHHVMNMGS IPEIATGYAG NTTFVPNYAE PVAEILRLNG YNTGAFGKWH ETPGRETTAA
GPQTRWPTRQ GFEKFYGFIG AEENMYEPSL HDGVTIIDYP DREDYHFLED MTDQAIAWMR
QQQGLRPDKP FFIYYASAGS HAPHHVRPEW IKKYKGKFDK GWDVIREETL ANQIAKGVVP
PNTQLAKKPA SVPNWDDLSD VQKRMFARQA EVFAAFTEYT DYQAGRLIQA IDDLGELDNT
LVIYISGDNG TSSEGNQTGN WNWGHMFNGI PETLEAQLEH YDNWGDQNTY PHMAVGWAIA
FDTPFAFTKQ IAGDFGGTRN GTVIHWPEAI KSKGEIRDQF SHVIDVAPTI LEAAGLPMPE
QINGIAQIPM QGTSLVYSFD NADAPERHKV QYFEVVGNRG IYQDGWMARA TVGLPWEAPK
KMHSVAKDDG WELYDTRNDF SLANNLATQY PERLEAMKRL FLKEAIANQV LPLDDRLLQR
LVPSVAGRPT IMGSSRTQLD LYPGSWDLVE DAILNVKNVS NSITAKVDID SSQDANGVIM
AQGSRFGGWS LYVEDGYPAY AYNYLGNLHT FRSKEKLSSG NRKIRFEMDY DGGGAGKGAD
VRLLVDDKVT STGRVEATVG TRFSIDEGAD VGMDRGSPVA KRVIGDQRFS AFNGTINKVT
LEIYPQ