Gene P9303_20251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_20251 
Symbol 
ID4777760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1783257 
End bp1784978 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content38% 
IMG OID640087539 
Producthypothetical protein 
Protein accessionYP_001018032 
Protein GI124023725 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.625545 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATGC CCTCGTTGCC TGAACAGAGA ACTGCTTCTG ATTTATACGC GCTAGCGGTA 
GAAAAGTACA AAAGTGAAGA ATATCAAGAA GCAATAGATG CATTCCGTAA ATCACTAGCA
CTGCAAGAAC ACTGGAATTC ATACCAAGGT CTTGGATGGG GACTATTTTA TACAAATCAA
TGTCAAGAAG CAATAGATGC ATTCCGTAAA TCACTAGCAC TACAAGAAGA CTGGAATTCA
TACCAAGGTC TTGGATGTGC ACTCTTGAGA GAAACAGTAT ACGCAGAAGC AATAGATGCA
TTCCGTAAAT CACTAGCACT ACAAGAAGAC TGGAATTCAT ACCAAGGTCT TGGATGGGCA
TTCTTTAGAG CAAACGTATA CACACAAGCA ATAGATGCAT TCCGTAAATC ACTTGCACTG
CATGAACATT GGAATGTATA CTTAGGTTTA GGACGATCAC TATTCAAGAC AAACCAATAT
CAAGAGGCAA TCGAGGCATT CAGAAAAGCG CTTGCACTTA ATAATTTAAA CTCCAATGAA
CTAACTGCTG AACTCCACAG AGAACTTGCC GATGCATATG AAGGTGCAGG CAAATCTGAT
GCTTCAATTG CTTCTTGGGA GGTCTACTTA TCTTATCTAG AACCCATCTC ATCCCTTGAT
CCATTCCTTG GAAATAGAGT TATTTACGAG CAAGTGGATC ATGAGCAGAT AGAAAGAATC
AAAAGTACAT GTGCTTCTAT TGGACTCGAC TTTAATCCCT CCCTAAAAGG GGATAATGAT
GCTTCAATCG AATCATGGAA ATATCTTATG TACTTGCATA TACCTAAATG CGGGGGGACT
TCATTTGAGA CACCCTTGTA TTTACTTAAA GAGCACTTAA AAGATAAGTC ATGTGATTTG
CCTAAAGTTA ATAGAACTAA CGATTATCTT GCAATAAGCA GATTGGCTTC AAATCATTCG
ATTGCAGCAT TCACAAATTT GATGTCATCC AATTCTTGTA ATGGTTTAAA GAACGCGTTT
CTTGGCCTTC ACGGTGCCAA ATGGAGTGCT TTGCATGATT ACATAGGCGA ATTAACCAAT
GCTTGTCCTA GAATTATTAC TACGGTACGT GACCCTCGTC AAAGGTTGTT ATCACACATC
AAGCATCAAG CGTTTCAATA TTGCACCTCA ATCGACGACC TTCTTACACT TGTAGATAAT
CAAAACAGTA TTTTCAATAA TTTAATGCAT AGACAAATCT TTGATTATGG ACTAGACGGC
GACAATCCCT GCGGAAACTC TGAACTTGGT AGCGAAAGAT TAGACTTGCT CCAAGACATG
GATTTTATTG ATATATCAGA CTCCACTACA AACTCAAAAG TCAAGTCTTC TTTTTTGAGC
GCATCTTTAT TCCCTAATAT TGTTCAAACT TCAAGATTTA ATGATTCCAA GGAACGTGAA
GAGATGTATG GTTTCAAGAT AAGTGGCAAC GACATTCAAT ATATTTTCAA GCATTGTGTG
GACAAAGGTT TTCTGGAGAA GGATCAGTCT ATTGACTATG ATTTTTTAAA AAATAGAACC
CTTGAAAGAT TGCATTTCCC TTCATTCATG GAGGCGCATA CCTGTTATAT TCACCCCTTG
ACATTTGTCA TTTTTGGTAT GAACAGATAT TCTATTGTTA CCACTAAAAA GTTTCTAGAT
AATCCTCACC ATCTGCTTCA GGAACTCAAT CAATCGCTTT AA
 
Protein sequence
MSMPSLPEQR TASDLYALAV EKYKSEEYQE AIDAFRKSLA LQEHWNSYQG LGWGLFYTNQ 
CQEAIDAFRK SLALQEDWNS YQGLGCALLR ETVYAEAIDA FRKSLALQED WNSYQGLGWA
FFRANVYTQA IDAFRKSLAL HEHWNVYLGL GRSLFKTNQY QEAIEAFRKA LALNNLNSNE
LTAELHRELA DAYEGAGKSD ASIASWEVYL SYLEPISSLD PFLGNRVIYE QVDHEQIERI
KSTCASIGLD FNPSLKGDND ASIESWKYLM YLHIPKCGGT SFETPLYLLK EHLKDKSCDL
PKVNRTNDYL AISRLASNHS IAAFTNLMSS NSCNGLKNAF LGLHGAKWSA LHDYIGELTN
ACPRIITTVR DPRQRLLSHI KHQAFQYCTS IDDLLTLVDN QNSIFNNLMH RQIFDYGLDG
DNPCGNSELG SERLDLLQDM DFIDISDSTT NSKVKSSFLS ASLFPNIVQT SRFNDSKERE
EMYGFKISGN DIQYIFKHCV DKGFLEKDQS IDYDFLKNRT LERLHFPSFM EAHTCYIHPL
TFVIFGMNRY SIVTTKKFLD NPHHLLQELN QSL