Gene A9601_05941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_05941 
Symbol 
ID4717294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp518895 
End bp520118 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content27% 
IMG OID640078306 
ProductZn-dependent proteases 
Protein accessionYP_001008987 
Protein GI123968129 
COG category[R] General function prediction only 
COG ID[COG1994] Zn-dependent proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGAAGTT GGCAAATTTT TAAAATATGG GGAATTCCCT TTAAAGTTCA TCCTTATTGG 
TTTGCTATTC TCTTTTTATT CTCATGGAGT ATAAGTAATC AGGTTAATTT GACCTCAAGT
GACATTTATA ATACCAAAGA AGCTTGGATT ATAGGATTTT TTACTTCTTT TTTCTTATTA
TCTTCAATTG TTTTTCATGA GATTTTTCGT ACTTTTGTTT CTCTTAATCA GGGTGTAAAA
ATAAAAAATA TTACTTTTTA TTTTCTCGGA GCAATTTTAC AAATAGATAA GTATTGTCAA
ACTGCTTTAG GTAATATAAA AATTGCAGTT GTTAAACCTC TTTTATGTTT CGCTACAGCA
TTTATCCTAT TATTAGTTAG TAGCAACAGT GCATCCCAAG AACAAATAGC AGTTAATGTA
ATTTCTAGAG TAGGTATATT TAATTTATTC TTAGGTTTCT TAAATTTGAT TCCAATTGGT
TCTTTAGATG GAGGAAATTT ATTAAAAAGT ATTATTTGGC ATTTTTCAGG AAGTAAAAAT
AAAGGAAGAA ACTTTCTCAA TAAAGTAAAT TTATTATTAT CTTTTTTTGT TCTATTTTTT
GGGATAGTTT GTTTATTTAG ATTTAACTTT TACTTTGGTT TTATTCTTTC TCTTTTGGGA
TTGTTTGGAG TTAATTCTTC AAAATCTGAA AGTCAATTTT TTAAAATTGA AAATATACTT
AAATTTAGTA AAGTTTCTGA GATTAAATTA AAGCCGTTGA GGAAAATTGA ATACGATTCA
AATTTCTCAG AATTTAATAC ACTTATAAAA AATAAAAAGG ATATATCGGA TAAATATTAT
TTTGTTACGA ATAATGGTAG ATGGACCGGT TTTGTTAATG AGAGTATTTT AAAAACTGTT
TCCTTAAATA AATGGGAACG GAACTTTGTT GGAGATTTTA AGAAACCAAT CGATAATTTT
GAGAGTGTAT CTTATAACGA TAAATTATGG AGAACTATAG AAAGACTTGA AGAAACAAAT
GAAGGTTTTT TGTTGGTCCT CAATGCTGCA GATATCCCTT TGGGGATAAT TGATAGGTCA
AAAATTGGGA ACTTTGTATT GCATAAATTA GGTTTAAATT TGCCTTCAGA GATTGTTAAC
AAATTAAACT TTAAAAATAA CTACCCCTTA GGAATTGAAT TGCCAAGAAT AATTAATTCA
ATGAAGCAGA AAGGAGATCT TTAA
 
Protein sequence
MRSWQIFKIW GIPFKVHPYW FAILFLFSWS ISNQVNLTSS DIYNTKEAWI IGFFTSFFLL 
SSIVFHEIFR TFVSLNQGVK IKNITFYFLG AILQIDKYCQ TALGNIKIAV VKPLLCFATA
FILLLVSSNS ASQEQIAVNV ISRVGIFNLF LGFLNLIPIG SLDGGNLLKS IIWHFSGSKN
KGRNFLNKVN LLLSFFVLFF GIVCLFRFNF YFGFILSLLG LFGVNSSKSE SQFFKIENIL
KFSKVSEIKL KPLRKIEYDS NFSEFNTLIK NKKDISDKYY FVTNNGRWTG FVNESILKTV
SLNKWERNFV GDFKKPIDNF ESVSYNDKLW RTIERLEETN EGFLLVLNAA DIPLGIIDRS
KIGNFVLHKL GLNLPSEIVN KLNFKNNYPL GIELPRIINS MKQKGDL