Gene P9303_07871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_07871 
Symbol 
ID4778584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp721611 
End bp722879 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content54% 
IMG OID640086296 
ProductZn-dependent protease 
Protein accessionYP_001016803 
Protein GI124022496 
COG category[R] General function prediction only 
COG ID[COG1994] Zn-dependent proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCGAAG GCTGGGAGCT GATGAAGATT CGGGGAATCC CTTTAAGGGT TCATCCCAGC 
TGGTTTGTGA TCCTGTTGCT TTTTACCTGG ATTTCACAAA ATCAGGTGTC CGCTGCAGCT
GAATCTTCGC TTCCAGCTTG GATCAGCTGG GGTCTGGGGT TGATTACGGC CCTGCTGTTG
TTTTTGTCTG TTCTCCTGCA TGAGCTAGGC CACTCTTTGG TTGCACTGCG TGAGGGGGTC
AAGGTTCGCA GCATCACGCT TTTTTTCCTG GGAGGTGTTG CCAGCGTTGA GAGGGAGTGC
TCTACACCGA TGGCCTCTTT AAGGGTTGCT GCAGCCGGCC CACTGGTGAG CTTGGTGTTG
GCTGTTGCCT TGCTGACGGG AGGGGTGTAT GCAGCTGATC ACGTCAATCC GCTGCTTGCC
AATCTCGTTG GGCAGTTGGG TGGGCTCAAT TTGTTGCTTG CCCTTTTTAA CTTGCTACCT
GGGTTGCCTC TTGATGGAGG CTTGATCCTT AAGGCTTTGG TCTGGCAGTG GACTGGCAGT
CAGAGGAAGG GTGTCCAGGT CGCCACAGCA ACTGGCCGTG CCCTGTCTCT CTCGGCAATG
GTGTTGGGGG GTTGGTTGCT CTTCTTTAAA GGTGGTGGGA TCGGTGGGCT TTGGCTGTTG
ATGCTTGGTT GGTTTGGTCT CGGTGCATCT CGCTCTCAAA CCCAGCTACT TGCCTTGCAG
AAGGTCTTGC GTGAGCTCAA CGTGGGCCTG GCTGCTGGGC GCAACTTCCG TGTGCTTGAA
GATGACCAGT CGTTGCGCAG GCTTAGTCAG TTGCGTTTGT CTGGAAGCGA GGAGCAGTCT
CCTCCGGCGT GGGTTTTGGT TTGTCGCTCT GGTCGATGGG TTGGTTACAT GACGGACCAA
CCCTTAAAAG AATTGCCTGT GCAGCAATGG GATAGGCAAT GCCTGGCGGA TCACATGAAA
CCGATATCTG AGTTGCCTTC CATTGGCGAG AAAGCCCCTT TATGGCAGGC GGTGTTGGCA
CTAGAACAGG CTGAGGAGGG CAGGCTTCTT GTCTTTAATG TTGCTGGTCT TCCTTGCGGA
ACATTAGATC GAATTGATCT CTCCGAAGCT GTTCTTAAGC GTCTTGGGGT AAGGCTTCCT
GCTCAGTTTC TCGAAGCTGC TCGCCGTCAG AACACCTATC CCCTGGGTAT GGCACTGCCT
AAAGTTGTGG AGTCGATGAT CTCTGGCGGA TTGGTTGAGC AGCCTGAGGC ATCCAGCAGT
ACTTCATAG
 
Protein sequence
MGEGWELMKI RGIPLRVHPS WFVILLLFTW ISQNQVSAAA ESSLPAWISW GLGLITALLL 
FLSVLLHELG HSLVALREGV KVRSITLFFL GGVASVEREC STPMASLRVA AAGPLVSLVL
AVALLTGGVY AADHVNPLLA NLVGQLGGLN LLLALFNLLP GLPLDGGLIL KALVWQWTGS
QRKGVQVATA TGRALSLSAM VLGGWLLFFK GGGIGGLWLL MLGWFGLGAS RSQTQLLALQ
KVLRELNVGL AAGRNFRVLE DDQSLRRLSQ LRLSGSEEQS PPAWVLVCRS GRWVGYMTDQ
PLKELPVQQW DRQCLADHMK PISELPSIGE KAPLWQAVLA LEQAEEGRLL VFNVAGLPCG
TLDRIDLSEA VLKRLGVRLP AQFLEAARRQ NTYPLGMALP KVVESMISGG LVEQPEASSS
TS