Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_04271 |
Symbol | |
ID | 4776326 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 426729 |
End bp | 429089 |
Gene Length | 2361 bp |
Protein Length | 786 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640085931 |
Product | sulfatase |
Protein accession | YP_001016444 |
Protein GI | 124022137 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTCA GAATGCTTCT TTGGCTTGCA TGTATCATAA GTTTCACAGC AACATCCGCC ACCAACGCAG AAGAGATTAA TCGCCGGCGA CTTCCGATCG CAGATCCATA TCCTCAGAAA GTTGACGAGC GCTTACCAAG TCAAGTCAAA ATGCCACGAC AACATGTCGT CGAGGCACCC AAAGGAGCAC CCAATGTTGT TGTCATCTTG TTGGATGATG TTGGCTTTGG TGCTCCTTCG CCCTTTGGCG GAGTGGTGCA AATGCCGGCC CTGCAGGAAC TAGCAAACAA TGGCCTGAGT TATAACCGAT TTCATACTTC AGCCCTATGT GCACCTACAC GAGCTGCTCT TAAAGCAGGC CGTAATCACC ATGTGATGAA CATGGGTTCA ATCCCTGAAA TTGCGACTGG ATATGCGGGC AATACCACAT TTGTGCCTAA CTATGCCGAA CCTGTAGCCG AGATCCTAAG GCTCAATGGC TACAACACAG GCGCTTTTGG TAAATGGCAT GAAACCCCTG GTCGTGAGAC CACTGCTGCG GGCCCACAAA CCCGCTGGCC AACAAGGCAA GGATTTGAAA AATTTTATGG CTTCATCGGC GCAGAAGAAA ATATGTATGA GCCATCACTT CACGATGGGG TCACAATCAT TGATTACCCA GATCGAGAGG ATTACCACTT CCTTGAAGAC ATGACAGATC AGGCTATTGC CTGGATGCGC CAACAACAGG GACTACGCCC AGACAAGCCT TTCTTTATCT ATTATGCATC TGCTGGCTCA CATGCACCAC ACCATGTAAG ACCTGAATGG ATTAAAAAGT ACAAGGGAAA GTTTGATAAG GGCTGGGATG TTATTCGAGA AGAAACGCTC GCAAATCAGA TCGCAAAAGG TGTTGTACCC CCAAACACAC AACTGGCAAA AAAACCAGCA AGTGTGCCGA ACTGGGATGA TCTTAGTGAT GTACAAAAAC GCATGTTTGC GCGACAAGCA GAGGTGTTCG CAGCTTTTAC AGAATACACA GACTATCAAG CTGGGCGGCT GATCCAGGCC ATTGATGATC TTGGTGAACT TGACAACACG CTGGTTATTT ATATCAGTGG TGATAACGGC ACAAGCTCAG AAGGGAACCA AACCGGCAAC TGGAACTGGG GGCACATGTT TAACGGCATC CCTGAAACAC TTGAGGCTCA GTTAGAGCAT TATGACAATT GGGGTGATCA GAACACCTAT CCACATATGG CAGTCGGCTG GGCGATTGCC TTTGATACTC CATTTGCATT TACCAAACAA ATTGCCGGGG ATTTTGGCGG TACACGCAAT GGCACTGTGA TTCATTGGCC TGAGGCCATC AAATCAAAAG GAGAGATACG CGATCAATTC TCCCATGTGA TTGATGTTGC GCCAACGATC CTTGAAGCTG CTGGCTTACC CATGCCTGAA CAGATCAATG GGATTGCACA AATCCCTATG CAAGGAACCA GCCTTGTTTA TTCTTTTGAT AACGCTGATG CACCTGAGCG TCACAAGGTT CAATATTTTG AAGTTGTTGG TAATCGTGGA ATCTATCAAG ATGGCTGGAT GGCAAGAGCC ACCGTTGGCC TGCCATGGGA AGCACCCAAG AAGATGCATA GCGTAGCCAA AGATGATGGA TGGGAGTTAT ACGACACGCG CAATGATTTC AGCCTAGCTA ACAACCTGGC AACTCAGTAT CCAGAACGGC TCGAGGCAAT GAAACGACTC TTCTTGAAAG AAGCAATCGC CAATCAAGTG CTCCCCTTAG ATGATCGTCT GTTGCAACGT TTAGTGCCTT CTGTTGCAGG CCGGCCAACG ATCATGGGCT CGTCGAGGAC ACAACTTGAT CTCTACCCAG GCTCCTGGGA TCTTGTTGAG GATGCCATCC TCAATGTCAA GAATGTTTCC AACAGCATTA CAGCCAAGGT CGACATTGAC TCAAGCCAAG ATGCCAACGG CGTGATCATG GCCCAAGGGA GCCGCTTTGG CGGATGGTCA CTTTATGTAG AAGATGGTTA TCCGGCCTAT GCCTATAACT ACCTGGGCAA TCTTCATACC TTCCGCAGCA AGGAGAAACT CTCTTCTGGT AACAGAAAAA TTCGCTTTGA GATGGATTAC GACGGGGGAG GCGCTGGTAA GGGTGCAGAT GTGCGCCTGC TAGTTGATGA CAAAGTCACA TCAACTGGCA GGGTTGAGGC AACCGTTGGT ACACGTTTTT CGATTGACGA AGGTGCTGAT GTTGGCATGG ATCGTGGATC ACCTGTTGCT AAAAGAGTGA TTGGGGATCA GCGCTTCAGT GCTTTCAATG GGACCATCAA CAAGGTGACA TTGGAGATCT ATCCACAGTG A
|
Protein sequence | MKFRMLLWLA CIISFTATSA TNAEEINRRR LPIADPYPQK VDERLPSQVK MPRQHVVEAP KGAPNVVVIL LDDVGFGAPS PFGGVVQMPA LQELANNGLS YNRFHTSALC APTRAALKAG RNHHVMNMGS IPEIATGYAG NTTFVPNYAE PVAEILRLNG YNTGAFGKWH ETPGRETTAA GPQTRWPTRQ GFEKFYGFIG AEENMYEPSL HDGVTIIDYP DREDYHFLED MTDQAIAWMR QQQGLRPDKP FFIYYASAGS HAPHHVRPEW IKKYKGKFDK GWDVIREETL ANQIAKGVVP PNTQLAKKPA SVPNWDDLSD VQKRMFARQA EVFAAFTEYT DYQAGRLIQA IDDLGELDNT LVIYISGDNG TSSEGNQTGN WNWGHMFNGI PETLEAQLEH YDNWGDQNTY PHMAVGWAIA FDTPFAFTKQ IAGDFGGTRN GTVIHWPEAI KSKGEIRDQF SHVIDVAPTI LEAAGLPMPE QINGIAQIPM QGTSLVYSFD NADAPERHKV QYFEVVGNRG IYQDGWMARA TVGLPWEAPK KMHSVAKDDG WELYDTRNDF SLANNLATQY PERLEAMKRL FLKEAIANQV LPLDDRLLQR LVPSVAGRPT IMGSSRTQLD LYPGSWDLVE DAILNVKNVS NSITAKVDID SSQDANGVIM AQGSRFGGWS LYVEDGYPAY AYNYLGNLHT FRSKEKLSSG NRKIRFEMDY DGGGAGKGAD VRLLVDDKVT STGRVEATVG TRFSIDEGAD VGMDRGSPVA KRVIGDQRFS AFNGTINKVT LEIYPQ
|
| |