Gene P9303_00661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_00661 
Symbol 
ID4776987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp62605 
End bp65856 
Gene Length3252 bp 
Protein Length1083 aa 
Translation table11 
GC content46% 
IMG OID640085566 
Producthypothetical protein 
Protein accessionYP_001016088 
Protein GI124021781 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.464249 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAC AAGAAGCTGG CCACAAACAG CTCGTCAACT CGCGATTTGA TCGCTGGCAA 
GATGAGCTCA GCAAACCAAT GCAACAATCA ATCGCATTTT TCTCCAATCC TGAAAACTTA
ACCAACAAGG ATTCAAGCGA TGTGTTCCGC TGGGTGATTG CAGGACCAAC TCAAAAAAAC
CTTTCTGCAT TTAAACAACA AGCAGCCAGG GTCGGAATCA AAATTCAAGA AGAGAATCTG
AGCAATGCAG GATTCGGAAT TGATGAGCTA CAACTAATCG CCTTCGAACC TGGGTCCTTA
TCAGATTTTC AAAGGCTGCA AACAGCAGCG AAAGAACTTA ACCTGGAACT CTGGCAAGAA
GTGGTTCAAA CACACTCCGG TGATGGGCCT ATCGACACAG CTGCCATTAC AGAAGGAACC
TTCGATCTCA ATCTTAACGA CAACAATACA GCCACTGATG TCAATGGTGA CGGCACTCCA
GACCCTCTCT TTCATCTATT AGCAAGCAAT GATGCTAACG GTAGTTATGG CGTCAATGCT
GTTGGAGCCT GGAGCCAAGT CAGTGGAGAG GGTATCACTG TTGGAGTTCT AGATACATTC
TTTGATCTAA ATCATACAGA TCTAAATGCG GCTATGCCAA CTAATTTTGA CTGGGACAAT
GATGGTCAAA ATGATGGTGT AGACAACAAC AACAACAACA TTCCAGATCT CTTCGAAAGT
GAGCAATTCA CCCATGCCCT AGGTTCACCT AATTGGCCCG TAAATAACCC ACCACAACCA
ACACCCCCTA ATCAATCTCA TGGTACGGCT GTTAGTGGAA TCGCGGTAGG CAGAAGCAAT
GGAAACTCAG GCATTGGTGT TGCTCCTGAA TCCAACTGGA TCCCAGATGG CTTTCTTGAC
CATCAAAATC TATGGCCATC TCAGAATTAC TACAATTACG CTGATGTCGT TAACAATAGT
TGGGGGATGC CGAATACAGC CGGCGTTTTC CAAACATGGA CTCCTCAACG ACTTGCCAAC
TGGCAACTTG CCACAGATGG AGCTATTCAA GTTGTTACTG CAGGCAATGA CAGAGACCCA
GGCAATACAG CAAATCAGGG ATGGAGCAAT ACAAACAATT CTGAAAAGAC TAGAAGAGAA
AATATTGTTG TTGCAGCAAC AATGCGCAAT GGTGAAGTAG AACAATACAG CACACCTGGC
GCCTCAGTTT TTGTCAGCGC CCCTGTCAAT GGATCAAACT TCAGGTTTGC AAACTCATTT
TTCGCCAACG CAGGAGTCCA ACGCACAACA ACAGCCGATG TCACAGACAA TGTGGCGTCC
AATGCAGACA ATTCGGGTTA CATGAATGGG CCCACTGACA CCAGATTCAA TGGCACATCA
GCCGCAGCAC CGATGGTGAC AGGAGCTATC GCCTTGATGT TAGAAGCGAA TCCAACACTC
ACTGTAAGGG ATATTCAGCA CATCCTTACT GAAACATCAG TCAAAAATGG CCTAATAGAT
AGTGATGGAG ACGGTTTACT CGATGCCATT AACCCCAATG CAGGCGGTAA TGCAGCGTTC
CCAGGCGCAG CAGGAACAAT TGAACTAAGA AATTCACTAA CAGCGGGTGT TAACTCGACT
TTTAACATTG CGGATGGTCA CAACACCGGA TGGTTTGTCA ATGGTGCCGG TCATTGGGTT
AGTGATTCCT TCGGTTTCGG CATCGTTGAT GCAGGAGCAG CTGTTGCATT AGCAAACAAC
TGGACAAATG TAGGAGATGA GCTCAAAGTC ACCACTGACA CGATTCTAAA CAACCCATAC
ACCATTCAAG AAGGCATTCT AGGTGGACTC AATTCACTCA CAAATGCAGG CTCTTGGAAT
GTCAACAACC ACATCGAACT GGAATGGGTT GAACTCACTC TGAACTTGAA CCTGCCAGAA
CAAGATGAGG TGATGCTGGC GATTCAATCA CCATCTGGAA CCAGATCAGT GTTAATGGCT
CCTGGTGGAA GCGATGCAAC CGCATTTAAT GGTCAGCGCA CCCTGATTAC AAATCAGTTC
TGGGGTGAAA GCGCAAATGG ACAGTGGAAT ATCGAAGTTC TTGATGTGAA CAATGATGGT
GACAATGCCA CCATCTCAAA CGCAACTCTG GACCTCTACG GAACCTGTAA TCAAACTTGC
CCCCTTGAGG TTAAATCCTT CAAAGAACTA TCAAACAGTG GATTTGGTCT CGGCCAGTTA
GCCAATCAAT TCCTGCAAGA TGGTGGTGCA AACCAAGGTA GCTATCAACT TCACTCAGTG
ATGTCGATTG GCGATTGGGA ATCCTATGGA AATTTCACAG AAGGCTACAA CACAGGGCTG
AAGATTGATG AAGGATTGAT CTTAACCAGT GGTCGCGCCA AAGATGCTAT TGGACCAAAC
TCATCACCTA GCACGTCAAC AAATTGGCAA AATGTTGGCC ATCCACTACT AGGTGCAAAT
AGCAAGGATG CTTCAGGAAT GGAGATTCGT TTCTCCCCCA ATCAAGACAT GGTCTTGGAT
TGGAATGCAC AATTTGGTTC AGAAGAATTT GACGAGTATT CGCCCAGTAT TTTTGATGAC
AACGCCGGCA TCTTCTTTAC TGAAATTACT GATCAAAAGG ACCCACTAGT TGGATACAAC
CCAACAAACC TCCTCTCCGG TCCTAATCAA GGTCCCTTCT CAGTGAATGG CTTCAGCGAA
AACCCTGGCA TCTTTGAGAA ATGGATGAAT ATGACCGAGC CTTGTGGACC AGTGAGCTGG
GAATATGATG GAGGAACAAA CTTTGCAATC ACCTCCAAAA AAGCGGTGCT GGAAAAAGGC
AAGACTTATG TACTCGCACC AATAATTGGT GATGCAACCG ATCATATCTA TGACAGCGGC
ATCATCATCG GTCCAAACAA GCCAATTTTC AACTTACCCA AGCTCCCGAG GTTATGGGAG
CCACGGAAAA AATCTCTTCC ATTCCGCAAA GAAGATCTCA TTCACATTGA CGTGAAACCC
GAAAACACTG GAGCGAATGA CAACCTGGAT GCTCTGAGCA AACTTGGCCA AGTCTCCTTC
GCTGAGGCCT CTACACGTAG TCTCGAAGAC ATCGAAATCT TCACCGGTCG TTTGCTTGAG
GCCTTCTTCA CAGGTAACAA CCTCTCCAGA GAGCAAGTCA AAACCATGCT CACTGGCCTC
GATTCCGAAG ACGCCATGAA CAACCAGCTC TTGAGCAACC ACTTCGCCCC TGAAGTTGCA
AGAGTGATTT GA
 
Protein sequence
MKRQEAGHKQ LVNSRFDRWQ DELSKPMQQS IAFFSNPENL TNKDSSDVFR WVIAGPTQKN 
LSAFKQQAAR VGIKIQEENL SNAGFGIDEL QLIAFEPGSL SDFQRLQTAA KELNLELWQE
VVQTHSGDGP IDTAAITEGT FDLNLNDNNT ATDVNGDGTP DPLFHLLASN DANGSYGVNA
VGAWSQVSGE GITVGVLDTF FDLNHTDLNA AMPTNFDWDN DGQNDGVDNN NNNIPDLFES
EQFTHALGSP NWPVNNPPQP TPPNQSHGTA VSGIAVGRSN GNSGIGVAPE SNWIPDGFLD
HQNLWPSQNY YNYADVVNNS WGMPNTAGVF QTWTPQRLAN WQLATDGAIQ VVTAGNDRDP
GNTANQGWSN TNNSEKTRRE NIVVAATMRN GEVEQYSTPG ASVFVSAPVN GSNFRFANSF
FANAGVQRTT TADVTDNVAS NADNSGYMNG PTDTRFNGTS AAAPMVTGAI ALMLEANPTL
TVRDIQHILT ETSVKNGLID SDGDGLLDAI NPNAGGNAAF PGAAGTIELR NSLTAGVNST
FNIADGHNTG WFVNGAGHWV SDSFGFGIVD AGAAVALANN WTNVGDELKV TTDTILNNPY
TIQEGILGGL NSLTNAGSWN VNNHIELEWV ELTLNLNLPE QDEVMLAIQS PSGTRSVLMA
PGGSDATAFN GQRTLITNQF WGESANGQWN IEVLDVNNDG DNATISNATL DLYGTCNQTC
PLEVKSFKEL SNSGFGLGQL ANQFLQDGGA NQGSYQLHSV MSIGDWESYG NFTEGYNTGL
KIDEGLILTS GRAKDAIGPN SSPSTSTNWQ NVGHPLLGAN SKDASGMEIR FSPNQDMVLD
WNAQFGSEEF DEYSPSIFDD NAGIFFTEIT DQKDPLVGYN PTNLLSGPNQ GPFSVNGFSE
NPGIFEKWMN MTEPCGPVSW EYDGGTNFAI TSKKAVLEKG KTYVLAPIIG DATDHIYDSG
IIIGPNKPIF NLPKLPRLWE PRKKSLPFRK EDLIHIDVKP ENTGANDNLD ALSKLGQVSF
AEASTRSLED IEIFTGRLLE AFFTGNNLSR EQVKTMLTGL DSEDAMNNQL LSNHFAPEVA
RVI