Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_11021 |
Symbol | |
ID | 4777670 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 984513 |
End bp | 985982 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640086611 |
Product | hypothetical protein |
Protein accession | YP_001017116 |
Protein GI | 124022809 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.879161 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCAAGC AAAGCTTGCC TGCGATCGCT GGCTACGAAG CCGAAATTAT TGCCCTGGTT CAGCAAGCTA ATCCCGCTGT GCTCACGCAT AGCAATCGCA AGCTGGAGCA GTTTCAGTCA GCCTTTGCCT GTGCTCTCCA CATGCATCAG CCCACGATTC CAGCAGGTGC GAATGGAGAA CTGATCTCGC ATCTGCAATA CATGCTGGAG CACAGCGAAG AAGGCGATAA CCACAACGCC GAACCCTTTG CCCATTGCTA CAAGCGCCTC GCAGACATCA TTCCCCAACT GATTCAAGAG GGATGCAACC CTCGCATCAT GCTGGATTAC TCAGGCAACT TGCTCTGGGG TGTTGAGCAA ATGGACCGTG TCGACATTCT CGAGGCGCTT AAACGTCTGG CATGCGATCC CACGCTTCAA CCCCATGTGG AATGGCTCGG CACCTTTTGG AGCCATGCCG TTGCACCCTC AACTCCTATT CCAGATCTAA AGCTACAAAT CCTGGCCTGG CAGCATCAAT TCGCTGCCAT GTTTGGGCGG CAGGCATTGC AACGAGTGAA GGGTTTCTCG CCACCGGAAA TGCACCTTCC CAACCATCCA GACACCCTCT ATGAATTGGT GAAAGCCCTC AGAGATTGCG GATACCGTTG GCTCCTTGTT CAAGAAAACA GCGTTGAAAA CTTCGATGGC TCATGCCTTC GCCATGCACA GAAATACGGC CCCAATCAAC TTGTGGCTCG TAACTCCAGA GGGGAAACAG TCAGCATTGT GGCGTTGATC AAAACCCAGG GCTCAGACAC CAAATTGGTA GGGCAGATGC AGCCCTATCA CGAAGCATTA GGCCTGGGCA GACAATCACT GGCAGGCAAA TCGATTCCAT CATTGGTCTC TCAAATTGCC GACGGAGAAA ATGGAGGCGT AATGATGAAT GAGTTTCCAG CCGCCTTTAT CCAAGCCCAT CAAACCATTG CTTCCCAGGT TGATCCTGTA AGCACAGTTG CGCTCAACGG CACTGAATAT CTGGAGTTAC TGGAGGCCGC CGGTGTGGAA GCCTCTGATT ACCCAAAGAT TCAGGCGATA CAGCAACACA AGCTGTGGCA TAACACTGAC AGCCCCATCA ACCCCGAATC AATCGAAGCG GCCATCAGCG ACCTCAAAGA AACAGATCCC TCCTTCTCAA TGGACGGCGC ATCCTGGACC AACAATCTCA GCTGGATCAA GGGCTATGAA AATGTGCTGG AACCGATCAA CAGCCTCAGC GCAAAGTTTC ACCAGCTGTT TGATCCCTTG GTGACAAAGG ATCCAGCGAT CACGCAAACC CCGCACTATC AAGAAGCTCT GCTCTACCTG CTAATGCTAG AAACCAGTTG CTTCCGCTAT TGGGGGCAAG GCACCTGGAC CAACTACGCC AATGAGATCC ACCGGCGCGG TGAAGCCATG GTCGAAGCAG CAAACCAGGC ACTGAGATAG
|
Protein sequence | MPKQSLPAIA GYEAEIIALV QQANPAVLTH SNRKLEQFQS AFACALHMHQ PTIPAGANGE LISHLQYMLE HSEEGDNHNA EPFAHCYKRL ADIIPQLIQE GCNPRIMLDY SGNLLWGVEQ MDRVDILEAL KRLACDPTLQ PHVEWLGTFW SHAVAPSTPI PDLKLQILAW QHQFAAMFGR QALQRVKGFS PPEMHLPNHP DTLYELVKAL RDCGYRWLLV QENSVENFDG SCLRHAQKYG PNQLVARNSR GETVSIVALI KTQGSDTKLV GQMQPYHEAL GLGRQSLAGK SIPSLVSQIA DGENGGVMMN EFPAAFIQAH QTIASQVDPV STVALNGTEY LELLEAAGVE ASDYPKIQAI QQHKLWHNTD SPINPESIEA AISDLKETDP SFSMDGASWT NNLSWIKGYE NVLEPINSLS AKFHQLFDPL VTKDPAITQT PHYQEALLYL LMLETSCFRY WGQGTWTNYA NEIHRRGEAM VEAANQALR
|
| |