Gene P9301_01041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_01041 
Symbol 
ID4911170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp105099 
End bp106220 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content32% 
IMG OID640159669 
Productserine protease 
Protein accessionYP_001090328 
Protein GI126695442 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.741511 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAAGAT ACATCCATAA GCTATTAGTA ATCGTCTCAT TAATTCCAAT AGGCATTACA 
TATCCTGCAA AAGTAATATC CCAACCATTA AGCAAATCAA ACATTAATGA AGTTAGTTTA
TTTTCAAACA AATCTTTCAT AACAAAAGCT GTAGAGAGAA CCGGTGCAGC TGTGGTGACA
ATTGATACTC AAAGATATGT TAAAAAAAGA AAATTTCCAA GAAATTCTCA ACTATTTATA
GACCCATATT TTGAAAGATT TTTTGGATTA GATTTGCCTA ACGAAAACCG ACCAAGGATA
GAGCAAAACC AAGGCAGTGG ATTTATATTT GCAGATGGAC TTGTCATGAC CAATGCTCAT
GTAGTGAATG GATCAGATAA GGTAATTGTT GGTTTAACCA ACGGCAAAAA ATTAAACGCC
CAACTGATAG GTCAAGACTC TTTTACTGAT TTAGCTGTGT TAAAGATTGA AGGGAAAGGG
CCTTGGCCAA AAGCAAAATT GGGCGATTCT GCAAAGATTA AAGTTGGTGA TTGGGCTATA
GCAGTTGGAA ATCCATTCGG ACTGGAAAAC ACAGTTACTC TTGGTATTAT TAGTAATCTA
AATAGAAACG TAAATCAATT AGGAATATAT GATAAAAAAC TTGAACTGAT ACAAACAGAC
GCTGCTATTA ATCCTGGCAA TTCTGGAGGT CCACTGTTGA ATAGCGATGG TGAAGTAATT
GGTATTAATA CGTTGATAAG ATCAGGTCCA GGAGCGGGTT TGAGTTTTGC AATCCCAATT
AATAAAGCTA AGGAAATTGC CTATCAACTT TTACAAAATG GGAAAGTAAT ACATCCTATG
ATTGGAATTA GCCTAATAGA AGAAAGTATT TCTGAGAGAA AAAATAAGGT TGTAAAAGTT
GGATATGTAG TACCAAACAG TCCAGCTGAA AAAAGTGGAA TCAAGATAGA TGATATTTTA
ATTAAAATAG GAAATAAAGA TATTGAAACA GCATCAGACG TAATAGAACA AATTAGTAAA
AATGGTATCA AAAAACAAGT AAATATATTA TTGAAGCGTA AAAATAAATT TATTAAATTA
AAAGTAATAC CAACTGATAT TACTAATCTA CAAAATAACT AA
 
Protein sequence
MKRYIHKLLV IVSLIPIGIT YPAKVISQPL SKSNINEVSL FSNKSFITKA VERTGAAVVT 
IDTQRYVKKR KFPRNSQLFI DPYFERFFGL DLPNENRPRI EQNQGSGFIF ADGLVMTNAH
VVNGSDKVIV GLTNGKKLNA QLIGQDSFTD LAVLKIEGKG PWPKAKLGDS AKIKVGDWAI
AVGNPFGLEN TVTLGIISNL NRNVNQLGIY DKKLELIQTD AAINPGNSGG PLLNSDGEVI
GINTLIRSGP GAGLSFAIPI NKAKEIAYQL LQNGKVIHPM IGISLIEESI SERKNKVVKV
GYVVPNSPAE KSGIKIDDIL IKIGNKDIET ASDVIEQISK NGIKKQVNIL LKRKNKFIKL
KVIPTDITNL QNN