Gene PMN2A_1021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPMN2A_1021 
Symbol 
ID3606407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL2A 
KingdomBacteria 
Replicon accessionNC_007335 
Strand
Start bp1516665 
End bp1517810 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content38% 
IMG OID637687890 
Producttrypsin-like serine protease 
Protein accessionYP_292214 
Protein GI72382859 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.75122 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGAGTC TCAGAAAGTT AGAAATTATG CGGCGATTCA TTAGTATATT TTCCCTTTTC 
TCAATCCTTT TTTGTTTTTT ATTTGAGCCA ATATCTGTAT TCGCGTTAGC AGATTTTCAA
GAAAGCGAAT CACATAGTTT TGTTGCCAAT GTAGCTAGCA AAGTTTCACC TTCGGTCGTA
AGGATTGATA TTGAAAGAGA GTTTCAAACA GATGAATTTG AATCTGATTT GTTAGATCCC
TTACTAAAAG ATCTTTTAGG GGATTTGGGA ACTTTTCCCA AAAAAGAGAG AGGGCAAGGC
TCTGGGGTCA TAATCGATAG CTCTGGATTA GTTCTCACCA ATGCTCATGT AGTGGAAAGA
GTTGATCGTG TGATAGTTAC TCTTCAAAAT GGAAATCAAG TGGATGGAAC AGTTGTTGGA
ACCGATCAAG TTACTGACTT AGCTTTGGTG AAAATTAAAG AATTTCCTGA TTTAGAAAGT
GCAAAATTGG GTGATTCAGA AGATATCCAA GTTGGGGACT GGGCAATTGC TCTTGGAACA
CCTTATGGCC TTGAAAGCAC TGTCACGCTC GGTATAGTCA GCAGTCTTCA TAGAGACATT
AATTCTCTTG GCTTTTCTGA TAAGAGATTG GATTTAATTC AAACTGACGC AGCAATAAAT
CCTGGGAATT CCGGAGGACC ATTGATAAAT GCAAATGGAG AAGTTATTGG AATTAATACT
TTGGTCAGAT CAGGTCCAGG GGCTGGACTT GGATTCGCAA TTCCAATCAA TCTTGCTTCA
AAAGTTACCA ATCAACTGCT CACTAACGGT GAGGTTATTC ATCCTTACTT GGGTGCTCAG
TTGGTTTTAT TGAATGAAAG AATAGCTAAA GAACATAATC AAGATCCAAA TGCATTGATT
TTTTTACCTG AACGATCAGG AGCACTTGTG CAGTCTGTTA TCCCTCAAAG TCCAGCAGAG
GAAGGAGGTC TAAGACGGGG TGATCTTGTA ATTAATGCAG GGGGTAATGC AATTAATGAT
CCTAGGTCTT TACTCATGCA AGTTGAAAAT GCTCAAATAG GAAAGCCATT TGAATTGGAA
GTGGTTCGAA ATAATAAAGA GATTAATCTT TCTATTAAGC CAGCTGCTTT ACCAGGAATT
AGCTAG
 
Protein sequence
MQSLRKLEIM RRFISIFSLF SILFCFLFEP ISVFALADFQ ESESHSFVAN VASKVSPSVV 
RIDIEREFQT DEFESDLLDP LLKDLLGDLG TFPKKERGQG SGVIIDSSGL VLTNAHVVER
VDRVIVTLQN GNQVDGTVVG TDQVTDLALV KIKEFPDLES AKLGDSEDIQ VGDWAIALGT
PYGLESTVTL GIVSSLHRDI NSLGFSDKRL DLIQTDAAIN PGNSGGPLIN ANGEVIGINT
LVRSGPGAGL GFAIPINLAS KVTNQLLTNG EVIHPYLGAQ LVLLNERIAK EHNQDPNALI
FLPERSGALV QSVIPQSPAE EGGLRRGDLV INAGGNAIND PRSLLMQVEN AQIGKPFELE
VVRNNKEINL SIKPAALPGI S