Gene P9303_15101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_15101 
Symbol 
ID4778518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1312552 
End bp1314249 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content46% 
IMG OID640087018 
ProductPDZ domain-containing protein 
Protein accessionYP_001017519 
Protein GI124023212 
COG category[R] General function prediction only 
COG ID[COG3975] Predicted protease with the C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.185527 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCGAGG TTGTTGACGT CCACCTTGAT CTGTGTGACA AGGCAAGTCA GACCTTTAAG 
GTCAGTCTGA AATGGAAACC AAGAACACAT CGTCAGAGCT GGTCTCTACC GATATGGACC
CCTGGCTCAT ACACCATTAG AGACCATGTT CAGCATCTCC ACAGTCTGAG CCTTTCTCAA
GCCTCAAATG ATTGTCAAGT GCAACGGATC GGCCCAAGTG GTTGGAAGGC TGATCTCGAC
ACTCTTGATC TTGTCACCCT TTGCTATGTC ATTGAGGCTC GACAGCTCAC TGTGCGCACC
TGCTACTTAG ATCCGGAGTT TGCATCCCTT TGTCTTGCGG CTGCTGTGAT GGAAATCGAT
GGTCAGCGTT GGACTCCTCA TTGCCTCACC CTGGACCTAC CAGTGGGTTG GAATGCCTAC
GTTCCTCTGG CTGGAGAGGA AACACTCTGG GCTAAGGATT TTGATCATCT TGTCGATGCT
CCTGTTCATG CTGGATGTTT CGTTTCTCAG CCCTTTGTTG TCAAAAAAAA TTCTCATCAA
CTTCTCTGTA TAGGCGATCC TCCTATGGGA TGGCCGGCAA ACCTTGTAAA TGATGTGAGC
GCTGTTTGTA AAGCTACTTG TTGTCTAATG GATGAACATC CACCAGCCGG AGATCTCTAC
CAATTAGTGA TTCATATGTT AGAAACTGGT TATGGCGGTC TTGAGCATGA TTATGGCGCA
GTTTTGCATT ATTCCTGGCG TGCATTAACT GAACCTGATG GCTATCGGAA GCTTCTACAA
TTAATAGGGC ATGAGTATTT GCATCAATGG AATGTGAGAC GTCTTCGACC CAGGGAATAT
CGACCTTATG ATTATTCTCA AGCTGTGATA AGTGATGGAC TTTGGTTTGC CGAAGGAATC
ACTAGCTACC TGGACCTCAC CCTACCATTC CTTGCTGGGC TGAGCGATCG CTCAACATTA
CTAAAAGATT TATCTCTAGA GTTTTCACCT CTATTAATCA ACCCAGGTCG ACAATTACAG
AGCCTGGCTG ACAGTTCACG TGAAGCCTGG GTGAAATTAT ATAAAGCAAC ACCGGCCAGT
GCCGATTCAC AGGTCAGCTA CTACAAGCTT GGTGCTGCGA TGGCCTTTTG CCTGGATGTT
CGCCTACGTC AACAAAACTC GTCATTAACG CAAGTACTCC GTGACCTTTG GCGGAAGTTT
GGTCGTAGTC ATCGAGGCTA TTCAAGGTTA GACATCAAAG CTGCCATCGC CAAGTTCGAT
CCCAATACTG CGAATGAGGT TGATGCATGG CTTGATCAAC CTGACTCTCT CCCGTTGACT
TCGATAGTTA AAGATCTTGG ACTAAGGTTT GAAGAGAGAT ATTCAAACAA AAGAGAAACA
GGGCTTACCT TAGTTGAACG AGAGGGTCTT GTTTTGGTGT CACGAGTTTC TCCATCTAGT
CCCGCCCATA ATGCAGGTCT TGTCGTTGGG GATGAATTGC TTGCTGTCGG CGGATTTCGA
TTGCGTAAGG TCGATGATTT ATGCAAACTT ATCTCAAATG AAGAGCCTGT ATCGATAATC
TATTCAAGAC GAGGACGGCT TAGTGAAACT GAACTTTCGA GTGGTTTGCC CCAAGTTGAT
CACTGGGAGA TTATTATTGA TTCTGAGGCA CCATCTGAGT TAAGCAATCT ACGGGATCAA
TGGTTTCAGA TTATTTAA
 
Protein sequence
MVEVVDVHLD LCDKASQTFK VSLKWKPRTH RQSWSLPIWT PGSYTIRDHV QHLHSLSLSQ 
ASNDCQVQRI GPSGWKADLD TLDLVTLCYV IEARQLTVRT CYLDPEFASL CLAAAVMEID
GQRWTPHCLT LDLPVGWNAY VPLAGEETLW AKDFDHLVDA PVHAGCFVSQ PFVVKKNSHQ
LLCIGDPPMG WPANLVNDVS AVCKATCCLM DEHPPAGDLY QLVIHMLETG YGGLEHDYGA
VLHYSWRALT EPDGYRKLLQ LIGHEYLHQW NVRRLRPREY RPYDYSQAVI SDGLWFAEGI
TSYLDLTLPF LAGLSDRSTL LKDLSLEFSP LLINPGRQLQ SLADSSREAW VKLYKATPAS
ADSQVSYYKL GAAMAFCLDV RLRQQNSSLT QVLRDLWRKF GRSHRGYSRL DIKAAIAKFD
PNTANEVDAW LDQPDSLPLT SIVKDLGLRF EERYSNKRET GLTLVEREGL VLVSRVSPSS
PAHNAGLVVG DELLAVGGFR LRKVDDLCKL ISNEEPVSII YSRRGRLSET ELSSGLPQVD
HWEIIIDSEA PSELSNLRDQ WFQII