Gene NATL1_18911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_18911 
Symbol 
ID4779194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1553275 
End bp1554420 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content39% 
IMG OID640085180 
Producttrypsin-like serine protease 
Protein accessionYP_001015711 
Protein GI124026596 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.58243 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAGCC TTAGAAAGTT AGAAATTATG CGGCGATTCA TTGGTATATT TTCCCTTTTC 
TCAATCCTTT TTTGTTTTTT GTTTGAGCCA ATATCTGTAT TCGCGCTAGC AGATTTTCAA
GAAAGCGAAT CACACAGTTT TGTTGCCAAT GTAGCTAGCA AAGTTTCACC TTCGGTCGTA
AGGATTGATA TTGAAAGAGA GTTTCAAACA GATGAATTTG AATCTGATTT GTTAGATCCC
TTACTAAAAG ATCTTTTAGG GGATTTGGGA ACTTTTCCCA AAAAAGAGAG AGGGCAAGGC
TCTGGGGTCA TAATCGATAG CTCTGGATTA GTTCTCACCA ATGCTCATGT AGTGGAAAGA
GTTGATCGTG TGATAGTTAC TCTTCAAAAT GGAAATCAAG TGGATGGAAC AGTTGTTGGA
ACCGATCAAG TTACTGACTT AGCTTTGGTG AAAATTAAAG AATTTCCTGA TTTAGAAAGT
GCAAAATTGG GTGATTCAGA AGATATCCAA GTTGGGGACT GGGCAATTGC TCTTGGAACA
CCTTATGGCC TTGAAAGCAC TGTCACGCTG GGTATAGTCA GCAGTCTTCA TAGAGACATT
AATTCTCTTG GCTTTTCTGA TAAGAGATTG GATTTAATTC AAACTGACGC AGCAATAAAT
CCTGGGAATT CCGGCGGACC ATTGATAAAT GCAAATGGAG AAGTTATTGG AATTAATACT
TTAGTCAGAT CAGGTCCAGG GGCTGGACTT GGATTCGCAA TCCCAATAAA TCTTGCTTCA
AAAGTTACCA ATCAACTGCT CACTAACGGT GAGGTTATTC ATCCTTACTT GGGTGCTCAG
TTGGTTTTAT TGAATGAAAG AATAGCTAAA GAACATAATC AAGATCCAAA TGCATTGATT
TTTTTGCCTG AAAGGTCAGG AGCCCTAGTT CAATCAGTTA TCCCTCAAAG TCCAGCAGAG
GAAGGAGGTT TAAGACGGGG TGATCTTGTA ATTAATGCAG GGGGTAATGC AATTAATGAT
CCTAGGTCTT TACTCATGCA AGTTGAAAAT GCTCAAATAG GAAAGCCATT TGAATTGGAA
GTGGTTCGAA ATAATAAAGA GATTAATCTT TCTATTAAGC CAGCTGCTTT ACCAGGAATT
AGCTAG
 
Protein sequence
MQSLRKLEIM RRFIGIFSLF SILFCFLFEP ISVFALADFQ ESESHSFVAN VASKVSPSVV 
RIDIEREFQT DEFESDLLDP LLKDLLGDLG TFPKKERGQG SGVIIDSSGL VLTNAHVVER
VDRVIVTLQN GNQVDGTVVG TDQVTDLALV KIKEFPDLES AKLGDSEDIQ VGDWAIALGT
PYGLESTVTL GIVSSLHRDI NSLGFSDKRL DLIQTDAAIN PGNSGGPLIN ANGEVIGINT
LVRSGPGAGL GFAIPINLAS KVTNQLLTNG EVIHPYLGAQ LVLLNERIAK EHNQDPNALI
FLPERSGALV QSVIPQSPAE EGGLRRGDLV INAGGNAIND PRSLLMQVEN AQIGKPFELE
VVRNNKEINL SIKPAALPGI S