Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_28011 |
Symbol | |
ID | 4777733 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2466782 |
End bp | 2468791 |
Gene Length | 2010 bp |
Protein Length | 669 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640088324 |
Product | hypothetical protein |
Protein accession | YP_001018796 |
Protein GI | 124024489 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGATGATC AGCATCATCA CTCAGCAAGA TTCGACCAGC AGGGATCGGA TGCTCCAGAA AATTTAATTA TTTTCATTCG AGATCAAGTC AAGCCGGAAG ATCTTTGGCT TCCGCGTGAA TGGGCAGCAG AAAACCTGCC AACTCGTCAA TGGCTGGTCG ATAACGGACT GTCATTTACG AACTCATTCA CGAATACTGC AATGTGTTCG GTCTCAAGGT CGACTTTTTT TACAAGTAAA TTTCCGGCAC AACATCAAGC CGACCTTTTG CTTTCTGATA TTGATAGTTC ATATCTTAAT GATCAGGTAC AACTCAATCC GGATTTTCCC AACCTGGCGA CAATCCTTAA AGACCAGGGA TATGATGTTT CCTTTTTTGG CAAGGCACAT CTAAGCAAGA CATTTACATT GAACGATGGA GAAGTTGTTT ACCAAGATAT GAATGCATAT GGCTTTGACG ATTGGGTGGG GCCAGATGCA GGGCAGGACA TGAAGCCCGA AAATGCCGGC GGCGGTCCTA ATGATAATGA CGGACGCTTT ACTTCTGAGG CAAAACAATG GCTACAAAAT CGTATTCAGT CAGACAATGA AAAACCTTAT GCTTTGGTTG TTTCCTTAGT AAACCCTCAT GACGTGTTGT CTTATCCAAA GACCTATAAC ACAGACTTTG AATATGATCG AAAATGGATT CATGGAGATA TCGAGATTCT CCCTCCAACG GTAGATGAAG ATAAGGAAGA AACCTTGAAA CCAAGCGTTC AACGGCAATG GATGATTCCA CAAAATGCTG GTCAGCCAAT GCCAACCGAT AAGATGAAAT TGAATTACCT CAACTTCTAT GGAAATTTAA TGAAGAGAGC CGATTGGCAA ATGGGAGAAA TACTTGATGT TATTCGCGAT TCAGACAATC CAGATGATGT TAATAATACG ATGATTGTCA GTACCAGTGA CCACGGTGAG ATGGGCATGT CTCATGGTGG CATGGTTCAG AAGATGTTTA ATGCCTATGA TGAGAGTCTT AAAGTGCCAA TGATTTGGTC AAATCCATCT TATTTTAAAG GCTCTCAAGA GAGTGACGCA CTCATCTCTT TAATTGACTT TTTGCCTACA TATGCAAATT TCCAGGATTT TTCAGAAGAC TATATTGCTC AACAGGATCT TCGCGGTGTA GATTATTCTT CGATTTTAAG GCGCGCCAGG GAAGGTGAGT CCAAGAGCCT AGAGGGCTTG GATGTACAAG ACTCTATTTT ATACACTTAC GATGATATCT ACGCTGGCCA AGATCCAGCT CTCTGCGAAG ATCCAGTTCA TGGCTTATTA CCTGCAGCCA ACAGAATTCA GGCAGTTCGT ACAAAAGATT TTAAATACGC CCGCTATTAC TCTGGGGACC AAGATTATGA ACCCGCAAAT TGGGAAGGTG AGCTTTATGA TTTAAGGCCT GAAGGCGGCG ACTATTATCC AGATATTGAC CCAATTACCG GACAGCTAAA TCCTTTTAGA GCAGCACCTT TAGAAGTGAG AAACCTTGAC CCTAAGGCAG AAACTCGTCG CAGACTTTTG CAGAGATTTG GGATCGGTGA TGGCCCTATT GCAACCAAGA AACAGAAAAA GGCCTATTTA GAGATGTCGG AGTTGCTTGA TCAGCAGATT GCTGACCGGC TACAACCTCT ACCTGAATCA GATCCGATTA AACCATCTAT CTTTGTTTAT CAAGGCGGCT CTAGTGGTGA TCAGTCTGCC TATAAAGTTG GGGACTCGAT TGTTCGCTTC ATCCCAAATA GTGAAGATGA GAAAGGCCTA GAGTTGGCCT TTAATACAAG ATATGGTCAG ACATATAATC TTGTCTATTC GGAGCAAAGA GATCCTTATG CTAGTTACAC TTATCTACCC TTCGAAACTA TAATCGGTAC TAATGGACCT ACTTATCAGT ATCTTCCTGG CTTGTCAGCA GAGATGACCC TTGATCAGAT TTATATTCAG TGGTCTGAAG GATTTGTCCC TCTCGCTTAG
|
Protein sequence | MDDQHHHSAR FDQQGSDAPE NLIIFIRDQV KPEDLWLPRE WAAENLPTRQ WLVDNGLSFT NSFTNTAMCS VSRSTFFTSK FPAQHQADLL LSDIDSSYLN DQVQLNPDFP NLATILKDQG YDVSFFGKAH LSKTFTLNDG EVVYQDMNAY GFDDWVGPDA GQDMKPENAG GGPNDNDGRF TSEAKQWLQN RIQSDNEKPY ALVVSLVNPH DVLSYPKTYN TDFEYDRKWI HGDIEILPPT VDEDKEETLK PSVQRQWMIP QNAGQPMPTD KMKLNYLNFY GNLMKRADWQ MGEILDVIRD SDNPDDVNNT MIVSTSDHGE MGMSHGGMVQ KMFNAYDESL KVPMIWSNPS YFKGSQESDA LISLIDFLPT YANFQDFSED YIAQQDLRGV DYSSILRRAR EGESKSLEGL DVQDSILYTY DDIYAGQDPA LCEDPVHGLL PAANRIQAVR TKDFKYARYY SGDQDYEPAN WEGELYDLRP EGGDYYPDID PITGQLNPFR AAPLEVRNLD PKAETRRRLL QRFGIGDGPI ATKKQKKAYL EMSELLDQQI ADRLQPLPES DPIKPSIFVY QGGSSGDQSA YKVGDSIVRF IPNSEDEKGL ELAFNTRYGQ TYNLVYSEQR DPYASYTYLP FETIIGTNGP TYQYLPGLSA EMTLDQIYIQ WSEGFVPLA
|
| |