Gene P9301_05201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_05201 
Symbol 
ID4912351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp451498 
End bp453003 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content32% 
IMG OID640160100 
Productcarboxypeptidase Taq (M32) metallopeptidase 
Protein accessionYP_001090744 
Protein GI126695858 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2317] Zn-dependent carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.322027 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCTGAAA CTCATTGGAA AAAGCTGGGT GCTTACCTTA AAGAAACACA AATATTAGGT 
TCAATCCAAA ATACACTTTA TTGGGATCAG AATACTGGAA TGCCAAAAAA AGGGGCTTAT
TGGAGGTCTG AACAACTTAC TTATATTGCA AAAGTATTGC ATGAAAGAAA TTCTTCCGAG
GAATTTTCTA ATCTGATACA ATCTGCAAAA AATGAACTAG CAGATATTGA AAGAAATTCC
GATAATCAAC TTTTCATAAA AGATAAAGAA AGAAATATTA GTCTTTTATT GAAGGAATTT
AATAGAGAAA GAAATTTAGA TCCTAAATTA GTTGAGTCTT TAGCAAAGGC AAAATCTAAA
GGATATGAAA GCTGGCAAGA AGCTAAGGAA AAATCAGATT TTAAAATTTT TCTTCCTTTC
TTTGAAGAAT TAGTTAAATT GCGGATTGAA GAGGCAAAGC AAATATCTAT TAAATGTTCA
CCTTGGGAGA CATTAGCCCA ACCCTTTGAG CCTGAATTAA ATTTGAAATG GTTGAACAAA
ATTTTTCAAC CTTTGAAAGA AACCATCCCA GGCTTGATTA GAGGACTTAA CAAGTCCCAA
AAAAATCAAT GGGATTTAAG TCCAGAATCT CAAAAAAAAT TATGTTCTAA ATTACTTGAC
GAGTTTGGAA GAGATAGAGA TCTCGTAGTT GTTGGACAAT CTCCCCATCC TTTTTCGATT
ACATTAGGGC CAAATGATTT TAGGATCACT ACAAGAATTG TTGAAGGTGA ACCATTATCA
AGTTTTTTAG CAACCGCGCA TGAGTGGGGG CATTCTATTT ATGAGCAGGG TTTGCCATCA
CAAAGTCATC AATGGTTTGC TTGGCCTTTA GGTCAAGCAA CATCTATGGG TATTCATGAA
AGTCAATCTT TATTTTGGGA AAATAGAATA GTTAAATCCA AATCTTTTTC AAAAAGATTT
TTTAAAAAAT TTGTTTCGGC TGGATGTTCT CTTAATAATT ATTTAGAACT ATGGAAATCT
ATTAATCATT TGGAAGCAGG ATTAAATAGG GTGGAAGCGG ATGAATTGAC TTATGGCTTA
CACATATTAA TAAGAACCGA ACTTGAAATA GATTTAATTG AAAGAGGGTT ACCTGCTGAA
GATATTCCAA CAGAATGGAA TAAAAGATAT GGTGAACTCC TAGGAATTAA ACCATCTAAT
GATTCAGAAG GTTGTCTTCA AGATGTTCAT TGGAGTGAAG GGGCGTTTGG ATATTTCCCC
TCATATTTGT TAGGACATGT TATAAGTGCG CAAATATCTT CTCAAATGGA AAGAGAAATA
GGTTTGATTG ACAACTTAAT TGAAAATGGT GAATATCAAA AGATCATCTT TTGGTTAAAA
AATAATATAC ATAAATATGG CAGATCTGTT AATTGTATGG AGTTGGTAAG AGCTGTAACT
AATGAAGAAC TATCGCCAAA CTATTTTATT AATCATTTAA GGTCTAAAAT AAATGATTTT
TGCTGA
 
Protein sequence
MAETHWKKLG AYLKETQILG SIQNTLYWDQ NTGMPKKGAY WRSEQLTYIA KVLHERNSSE 
EFSNLIQSAK NELADIERNS DNQLFIKDKE RNISLLLKEF NRERNLDPKL VESLAKAKSK
GYESWQEAKE KSDFKIFLPF FEELVKLRIE EAKQISIKCS PWETLAQPFE PELNLKWLNK
IFQPLKETIP GLIRGLNKSQ KNQWDLSPES QKKLCSKLLD EFGRDRDLVV VGQSPHPFSI
TLGPNDFRIT TRIVEGEPLS SFLATAHEWG HSIYEQGLPS QSHQWFAWPL GQATSMGIHE
SQSLFWENRI VKSKSFSKRF FKKFVSAGCS LNNYLELWKS INHLEAGLNR VEADELTYGL
HILIRTELEI DLIERGLPAE DIPTEWNKRY GELLGIKPSN DSEGCLQDVH WSEGAFGYFP
SYLLGHVISA QISSQMEREI GLIDNLIENG EYQKIIFWLK NNIHKYGRSV NCMELVRAVT
NEELSPNYFI NHLRSKINDF C