Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_15101 |
Symbol | |
ID | 4778518 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1312552 |
End bp | 1314249 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640087018 |
Product | PDZ domain-containing protein |
Protein accession | YP_001017519 |
Protein GI | 124023212 |
COG category | [R] General function prediction only |
COG ID | [COG3975] Predicted protease with the C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.185527 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTCGAGG TTGTTGACGT CCACCTTGAT CTGTGTGACA AGGCAAGTCA GACCTTTAAG GTCAGTCTGA AATGGAAACC AAGAACACAT CGTCAGAGCT GGTCTCTACC GATATGGACC CCTGGCTCAT ACACCATTAG AGACCATGTT CAGCATCTCC ACAGTCTGAG CCTTTCTCAA GCCTCAAATG ATTGTCAAGT GCAACGGATC GGCCCAAGTG GTTGGAAGGC TGATCTCGAC ACTCTTGATC TTGTCACCCT TTGCTATGTC ATTGAGGCTC GACAGCTCAC TGTGCGCACC TGCTACTTAG ATCCGGAGTT TGCATCCCTT TGTCTTGCGG CTGCTGTGAT GGAAATCGAT GGTCAGCGTT GGACTCCTCA TTGCCTCACC CTGGACCTAC CAGTGGGTTG GAATGCCTAC GTTCCTCTGG CTGGAGAGGA AACACTCTGG GCTAAGGATT TTGATCATCT TGTCGATGCT CCTGTTCATG CTGGATGTTT CGTTTCTCAG CCCTTTGTTG TCAAAAAAAA TTCTCATCAA CTTCTCTGTA TAGGCGATCC TCCTATGGGA TGGCCGGCAA ACCTTGTAAA TGATGTGAGC GCTGTTTGTA AAGCTACTTG TTGTCTAATG GATGAACATC CACCAGCCGG AGATCTCTAC CAATTAGTGA TTCATATGTT AGAAACTGGT TATGGCGGTC TTGAGCATGA TTATGGCGCA GTTTTGCATT ATTCCTGGCG TGCATTAACT GAACCTGATG GCTATCGGAA GCTTCTACAA TTAATAGGGC ATGAGTATTT GCATCAATGG AATGTGAGAC GTCTTCGACC CAGGGAATAT CGACCTTATG ATTATTCTCA AGCTGTGATA AGTGATGGAC TTTGGTTTGC CGAAGGAATC ACTAGCTACC TGGACCTCAC CCTACCATTC CTTGCTGGGC TGAGCGATCG CTCAACATTA CTAAAAGATT TATCTCTAGA GTTTTCACCT CTATTAATCA ACCCAGGTCG ACAATTACAG AGCCTGGCTG ACAGTTCACG TGAAGCCTGG GTGAAATTAT ATAAAGCAAC ACCGGCCAGT GCCGATTCAC AGGTCAGCTA CTACAAGCTT GGTGCTGCGA TGGCCTTTTG CCTGGATGTT CGCCTACGTC AACAAAACTC GTCATTAACG CAAGTACTCC GTGACCTTTG GCGGAAGTTT GGTCGTAGTC ATCGAGGCTA TTCAAGGTTA GACATCAAAG CTGCCATCGC CAAGTTCGAT CCCAATACTG CGAATGAGGT TGATGCATGG CTTGATCAAC CTGACTCTCT CCCGTTGACT TCGATAGTTA AAGATCTTGG ACTAAGGTTT GAAGAGAGAT ATTCAAACAA AAGAGAAACA GGGCTTACCT TAGTTGAACG AGAGGGTCTT GTTTTGGTGT CACGAGTTTC TCCATCTAGT CCCGCCCATA ATGCAGGTCT TGTCGTTGGG GATGAATTGC TTGCTGTCGG CGGATTTCGA TTGCGTAAGG TCGATGATTT ATGCAAACTT ATCTCAAATG AAGAGCCTGT ATCGATAATC TATTCAAGAC GAGGACGGCT TAGTGAAACT GAACTTTCGA GTGGTTTGCC CCAAGTTGAT CACTGGGAGA TTATTATTGA TTCTGAGGCA CCATCTGAGT TAAGCAATCT ACGGGATCAA TGGTTTCAGA TTATTTAA
|
Protein sequence | MVEVVDVHLD LCDKASQTFK VSLKWKPRTH RQSWSLPIWT PGSYTIRDHV QHLHSLSLSQ ASNDCQVQRI GPSGWKADLD TLDLVTLCYV IEARQLTVRT CYLDPEFASL CLAAAVMEID GQRWTPHCLT LDLPVGWNAY VPLAGEETLW AKDFDHLVDA PVHAGCFVSQ PFVVKKNSHQ LLCIGDPPMG WPANLVNDVS AVCKATCCLM DEHPPAGDLY QLVIHMLETG YGGLEHDYGA VLHYSWRALT EPDGYRKLLQ LIGHEYLHQW NVRRLRPREY RPYDYSQAVI SDGLWFAEGI TSYLDLTLPF LAGLSDRSTL LKDLSLEFSP LLINPGRQLQ SLADSSREAW VKLYKATPAS ADSQVSYYKL GAAMAFCLDV RLRQQNSSLT QVLRDLWRKF GRSHRGYSRL DIKAAIAKFD PNTANEVDAW LDQPDSLPLT SIVKDLGLRF EERYSNKRET GLTLVEREGL VLVSRVSPSS PAHNAGLVVG DELLAVGGFR LRKVDDLCKL ISNEEPVSII YSRRGRLSET ELSSGLPQVD HWEIIIDSEA PSELSNLRDQ WFQII
|
| |