Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_05641 |
Symbol | |
ID | 4911874 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | + |
Start bp | 493302 |
End bp | 494525 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 27% |
IMG OID | 640160144 |
Product | Zn-dependent protease |
Protein accession | YP_001090788 |
Protein GI | 126695902 |
COG category | [R] General function prediction only |
COG ID | [COG1994] Zn-dependent proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.394997 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGAAGTT GGCAAATTTT TAAAATATGG GGAATTCCCT TTAAAGTTCA TCCCTATTGG TTTGTTATTC TCTTTTTATT CTCATGGAGT ATAAGTAATC AGATCAATTT ATCTTCTAGC GATATCTACA ATAATAAAGA AGCTTGGATA ATAGGGTTTT TAACTTCTTT TTTCTTATTA TCTTCAATTA TTTTTCATGA GGTTTTTCAT ACTTTTGTTT CACTCAATCA GGGTGTAAAA ATAAAAAAAA TCACTTTTTA TTTTTTAGGA GCAATTTTAC AAATAGATAA GTATTGTCAA ACTGCTTTAG GTAATATAAA AATTGCAATT GTTAGACCTC TTTTATGTTT CGCTACAGCA TCTATTCTAC TTTTAATTAG TAATAACAGT GCATCTCAAG AACAAATAGC AGTTAATGTA ATTTCAAGAG TAGGTATATT TAATTTATTC TTAGGCTTCT TAAATTTGAT TCCAATTGGT TCTTTAGATG GAGGGAATTT ATTAAAAAGC ATCATTTGGC ATTTCTCAGG GAGTAAAAAT AAAGGAAGAA ATTTCCTCAA TAAAGTAAAT TTATTATTAT CTTTTTGTGT TTTATTTTTT GGGATAATTT GTTTATTTAG ATTTAACTTT TACTTTGGTT TCATTCTTTC TTTTTTGGGC TTGTTTGGAG TTAATTCTTC AAAATCTGAA AGTCAATTTA TAAAAATTGA AAACATACTT AAATTTAGTA AAGTTTCTGA GATCAAATTA AAGCCTTTGA GGAAAATCGA ATACGATGCA AATTTCTCAG AATTTAATAA ATTAATAAAA AATAAGAAGG ATGCATCGGA TAAATATTTT TTTGTTACGA ATAATGGTAG ATGGACCGGT TTTGTTGATG ATAATATTTT AAAAGCTGTT TCCTTAAAAA AATGGGAACG GAACTTTGTT GGAGATTTTA AGAAACCAAT CGATAGTTTC GAAAGTGTAT CTTATAACGA TAAATTATGG AGAACTATAG AAAGACTTGA AGAAACAAAT GAAGGTTTTT TATTAGTTCT CAATGCTGCA GACATCCCTT TGGGGATTAT TGATAGGTCA AAAATTGGAA ATTTTGTATT GAATAAATTA GGTTTTAATT TGCCTTCAGA TATTGTTAAA AAATTAAACT TTAAAAATAA TTACCCCTTA GGAATTGAAT TGCCGAGAAT AATTACTTCA ATGAAGCAGA AAGGAGATCT TTAA
|
Protein sequence | MRSWQIFKIW GIPFKVHPYW FVILFLFSWS ISNQINLSSS DIYNNKEAWI IGFLTSFFLL SSIIFHEVFH TFVSLNQGVK IKKITFYFLG AILQIDKYCQ TALGNIKIAI VRPLLCFATA SILLLISNNS ASQEQIAVNV ISRVGIFNLF LGFLNLIPIG SLDGGNLLKS IIWHFSGSKN KGRNFLNKVN LLLSFCVLFF GIICLFRFNF YFGFILSFLG LFGVNSSKSE SQFIKIENIL KFSKVSEIKL KPLRKIEYDA NFSEFNKLIK NKKDASDKYF FVTNNGRWTG FVDDNILKAV SLKKWERNFV GDFKKPIDSF ESVSYNDKLW RTIERLEETN EGFLLVLNAA DIPLGIIDRS KIGNFVLNKL GFNLPSDIVK KLNFKNNYPL GIELPRIITS MKQKGDL
|
| |