Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_10671 |
Symbol | |
ID | 5730338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 963757 |
End bp | 964899 |
Gene Length | 1143 bp |
Protein Length | 380 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641285434 |
Product | hypothetical protein |
Protein accession | YP_001550952 |
Protein GI | 159903608 |
COG category | [R] General function prediction only |
COG ID | [COG2936] Predicted acyl esterases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.43083 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGGGGG CTTTTTGGTG GCATATAGGG CTTGCATGGG GACTCCAATT AGCCGCTCAA AAAGCAAAGC GAAATGGTGA CGAGAAAGTC TGGAAAGAGA TTAGAAATAG CCTTGAAGAT AAAAGTTACT TACAGGAGGG GCCTGATCTA CTTAAAAAAT ATGATTCAAG AGGAATGGTA AATCAATGGT TTGAAGAATC AAATAAAGCA AACCCAATCT GGAAGACTCA TAAACCGCTG AAAACTTGGC TCCAACAACC TATCTTGCTA ATTGGAGGAT GGTGGGACCC TCACCTAAGA GGGATTCTTG ATCTTTATGA AAAATCACTC ATAGTAGGTG GTCAACCAGA TCTGCATATT GGACCAGCTT CTCATTTGCG ATGGTGGGAA GAAGTACAGC AAATTCATTT AGATTTCTTC AACAAATATC TTCAACCAAG CAATACTCTT AAGGTCCCAT CTAGCAGACA ACAAAAACTT TGGAATATTA CAAGCAAAAA ATGGTTTGAC CTACAACCTA TAGACACTAG AAATAGGATT TGGCATCTAA GTACTGGAGG GAATGCATGC ATAGATTCAA CCGATGGAGA ACTTACTCAA TTCGGCAAAG GGCAGGGGGA ACTCTCTATT GTTCATGATC CATGGAGACC CGTTTCTTCA ATAGGAGGTC ACCTAAGTCC AGACCCTGGT ATTGCAAATA GAGCCGAAAT TGACAAGAGA AACGATGTAG CAACTTTTAC CTCAAAACCA CTTGAAGAAA GAATTCAACT AAAAGGAGTT CCTAAGCTTG AAATTATTGC AATGGCTGAT AGGAATGGAT TTGATCTATG CGTTGCAATT TCAATTATTC AGCAAAATTC AAAAGAAGTA CTGCAGATTT CTACAGGAGT ACTTCGTCTA GTTGGGAATA AAGCCAAAAG TACACTCAAA AGAAATGTGA CGCTGCAGCC ATTATTCGCA GATATTCATA AAGGAGATCG CCTTAGATTG TCAATATCTG GAGCAGCTTG GCCAGCTATT GCTATAAACC CAGGAGACCC AAGTTATAAC TGTGGGTCTC CATCTCCATA TTGTCTAGTA ACGACAATCT CCCTAGAACT TTCTCAGGCC AAGCTAGAGA TCTGTCCACT CTTCTCAAAA TAA
|
Protein sequence | MGGAFWWHIG LAWGLQLAAQ KAKRNGDEKV WKEIRNSLED KSYLQEGPDL LKKYDSRGMV NQWFEESNKA NPIWKTHKPL KTWLQQPILL IGGWWDPHLR GILDLYEKSL IVGGQPDLHI GPASHLRWWE EVQQIHLDFF NKYLQPSNTL KVPSSRQQKL WNITSKKWFD LQPIDTRNRI WHLSTGGNAC IDSTDGELTQ FGKGQGELSI VHDPWRPVSS IGGHLSPDPG IANRAEIDKR NDVATFTSKP LEERIQLKGV PKLEIIAMAD RNGFDLCVAI SIIQQNSKEV LQISTGVLRL VGNKAKSTLK RNVTLQPLFA DIHKGDRLRL SISGAAWPAI AINPGDPSYN CGSPSPYCLV TTISLELSQA KLEICPLFSK
|
| |