Gene P9303_28681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_28681 
Symbol 
ID4776250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2538048 
End bp2539667 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content45% 
IMG OID640088391 
Producthypothetical protein 
Protein accessionYP_001018863 
Protein GI124024556 
COG category[O] Posttranslational modification, protein turnover, chaperones
[R] General function prediction only 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain
[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.71597 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCGCC GCACTAGTGC AATTGCTGCT GCCTTATCGC TGCTGCCAAT AGGACAACCA 
CTGCTATTGG GCACCCTTGG CATCACAACA GCAACCACTG CAGTCGTTCT TCAACAGACA
CCAGCAATTG CTCAAGATGC TTCTGCTGTT GCACGTATCG CCAAGGCAAT CACTGTTCGC
ATAGAAGGTG CCACCCAAGG TTCAGGGGTG CTCGTCAAGC AAGAAGGCAA TCGCTACACG
GTGCTCACGG CATGGCATGT AGTCAGTGGC AATAGACCAG GAGAAGAGGT TGGGATCTAT
ACCTCTGATG GGAATGAGCA CCAACTAGAG CAAGGCAGCA TCCAAAGGTT GGGAGAGGTT
GATATGGCAG TGCTCTCCTT CTCTAGTGGC AGTGCTTATG AGGTTGCTGA AGTCGGAGAC
GTCAAAAAGG TCAAGCATGA TCAACCGATT TATGTGGCAG GTTTTCCTTT AAATAACTCA
CAAAACCTTC GCTATGAAAC TGGAGAGGTT GTTGCTAATG CAGAAGTAGG AATTGATCAG
GGTTATCAAC TGCTATACGA CAACGAAACA GTCGCTGGAA TGAGTGGAGG CGTGCTGCTT
AATGCTGATG GAGATTTGGT GGGACTTCAT GGCAGGGGAG AGAAAGATGA ACAGGCATCA
AGTGGTGAGT TAGTAATGAA GACAGGAGTT AATCAAGGCG TGCCAATTAC TTACTACAAC
CTCTTTGCAA GTGGTGCTCC TGTTGTTGTT GCCAAGAACA CTGCAACCAC TGCTGATGAC
TATCTGGCGC AAGCAAAAGC ATCCCAGTCA AGGAAGGGAA GAGAACAGAC AGTTATTAAG
TTAACAACCC AGGCATTAGC ATTGCGATCC AGTGTGGAGG GATACTTTCT TCGTGCTTAT
GCCAAGTATG ACTTAAGAGA TTATCAAGAA GCAATTGCTG ATTACACAAA GACAATAGAG
ATTCATCCGC AGAACACCGT TTCCTACAAT AACCGTGGTA ATGCCAAGCA GAAATTAAAA
GATCATCAAG GGGCAATTGC TGATTTCAAC AAGGCAATAG CAATTGATCC GCAAAATCAC
ACTGCCTACA CCAACCGCGG TAGTGCCAAG GATGATTTAG GAGATTATCA AGGGGCAATT
GCTGATTACA ACAAGGCAAT AGCAATTAAT CCGCAGGATG ACGCTGCCTA CAACAACCGT
GGTAATGCTA AGCAGAAATT AAAAGATCAT CAAGGAGCAA TTTCTGATTA CAGCAAGGCA
ATTGCAATTA ATCCGCAGAA TGCCATTTCC TACACCAACC GTGGTAATAC CAAGGATGAT
TTAGGAGATT ATCAAGGAGC AATTGCTGAT TTCAACAAGG CAATAGAAAT TAAACCAGAT
TCTGCAAATG CCTACAACAA CCGTGGTAAT GCCAAGGATG ATTTAGGAGA TCATCAAGGG
GCAATTGCTG ATTACAACAA GGCAATAGAG ATTAATCCGC AGGATGCCGT TTCTCACGCT
AATCGTGGTA TTGCCAAGGA ATTAGTTGGA GACCTCAAAG GTGCTTGTGC TGATTGGAGA
AAGGCATCCT CGCTAGGTGT TCAAGTTGTT GCTAGTTGGG TAAGAAAGCA ATGCCAATAA
 
Protein sequence
MSRRTSAIAA ALSLLPIGQP LLLGTLGITT ATTAVVLQQT PAIAQDASAV ARIAKAITVR 
IEGATQGSGV LVKQEGNRYT VLTAWHVVSG NRPGEEVGIY TSDGNEHQLE QGSIQRLGEV
DMAVLSFSSG SAYEVAEVGD VKKVKHDQPI YVAGFPLNNS QNLRYETGEV VANAEVGIDQ
GYQLLYDNET VAGMSGGVLL NADGDLVGLH GRGEKDEQAS SGELVMKTGV NQGVPITYYN
LFASGAPVVV AKNTATTADD YLAQAKASQS RKGREQTVIK LTTQALALRS SVEGYFLRAY
AKYDLRDYQE AIADYTKTIE IHPQNTVSYN NRGNAKQKLK DHQGAIADFN KAIAIDPQNH
TAYTNRGSAK DDLGDYQGAI ADYNKAIAIN PQDDAAYNNR GNAKQKLKDH QGAISDYSKA
IAINPQNAIS YTNRGNTKDD LGDYQGAIAD FNKAIEIKPD SANAYNNRGN AKDDLGDHQG
AIADYNKAIE INPQDAVSHA NRGIAKELVG DLKGACADWR KASSLGVQVV ASWVRKQCQ