Gene P9301_12621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_12621 
Symbol 
ID4911397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1071397 
End bp1072776 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content33% 
IMG OID640160851 
Producthypothetical protein 
Protein accessionYP_001091486 
Protein GI126696600 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAT TAATACTTTT AGTAATTGTT TTGGGGTTTG GCTCTTTTTT TAATGCTCAA 
GAATTATTAG CCGTAAAGAT TAATTGTGAT TCACCTGTTC ACAAAAATAA AAAAAGATGT
AGTGAGAAAT ACCTGCGAAA TGTAGTTATT GATGAAGATA CAGGTTTAGA AGTTATTGAA
TATGAAAAAG ATGTCGATTG GAAAAAGAAA AATCCTAAGA TTGCTTGGTC CAAAATCATA
AAATACAAAT CATCTCTTAG AAATTCATAT GAATTAACAA TTTTTGATAG GGATTATGTT
TCTGATTTTA GTACCGGAGC AGTTAAAAGT TATGTCACGA AATGGAATAC CAACAAATTA
GAAGGAAGAA TATTAACTTG GGGTGGTTGT GGTTTTTGGA CATGTACTTA TGAAAGTGCT
AGATATTACG ATTTTCCTGG TTTTATTGAG ATATTTGTTG GAGACAAGAG GTTTAGATTA
AGAGGTAGTA GAGGTGAATT TCAATTCCCT TATGGATTTG CAAAAAGAAT AAAGAATATA
GGTGAGAATG AATCTATTAA TTTACAAATT AAAGCTATCC CAAATAGTGG TTTAGCAGAT
AAATTTATCC CTATAGGTGA TGAAACAATA AAAAATTTAA AACTATTATT CCAAAAGGAT
ACAAAAGAGT GGAACAAACC TAAATATGAA ATAGCTCGAG CCTCAATTAG TTCAAAAAAA
CTTGATATAG AGGAAATAGC ATCTATAACT CTTCCATCAG TAGTCAAATT AGAGGGAGAT
TCTGGATTAG GAAGTGGCTT TTTTATAAAT AATACAGGAC TTATTGTTAC TAATATGCAC
GTAGTTGCCG GTGGAGATAA AGAATTCACT ATCTCTGGAG ATAATGGGTT AAAAGATCAA
GGTGAAGTTA TTTATGTAGA CTCAAAGCTT GATTTCGCAT TAATACAATC AAATAACACT
AAGAATTCAA AAGCTCTTCC TCTTTGTTTT AGTAAATATC CAAGACCTGG TCAAAATGTA
ATTGCTCTTG GATCTCCTTT AGGTTTAGCG GGAACAGTGA CTAGAGGGAT TGTAAGTGCA
GTGAGACAAC CATCAAGTGA TTTAGAAGAC GTAGTCCCAT ATTATGTGAC TCTTATACAA
ACAGATGCAG CTATAAGTCC TGGTAATAGC GGAGGCCCTT TGGTAAATAG CAATGGTGAG
GTAGTTGGAG TGAATACCTG GAGTTTACCT GGAGACGAGG GTAGAGCTCA GAATTTAAAT
TTTGCCATCT CAATCGTAGA TATTTTAAGG TCGTTAAATA GTGAAATTCC AGTTGAAGCT
GAAAATACTA ATAGTTGCGG GAATTTTGTA GAGAAAGAGA GCATATTTAA ATTTTGGTAA
 
Protein sequence
MRKLILLVIV LGFGSFFNAQ ELLAVKINCD SPVHKNKKRC SEKYLRNVVI DEDTGLEVIE 
YEKDVDWKKK NPKIAWSKII KYKSSLRNSY ELTIFDRDYV SDFSTGAVKS YVTKWNTNKL
EGRILTWGGC GFWTCTYESA RYYDFPGFIE IFVGDKRFRL RGSRGEFQFP YGFAKRIKNI
GENESINLQI KAIPNSGLAD KFIPIGDETI KNLKLLFQKD TKEWNKPKYE IARASISSKK
LDIEEIASIT LPSVVKLEGD SGLGSGFFIN NTGLIVTNMH VVAGGDKEFT ISGDNGLKDQ
GEVIYVDSKL DFALIQSNNT KNSKALPLCF SKYPRPGQNV IALGSPLGLA GTVTRGIVSA
VRQPSSDLED VVPYYVTLIQ TDAAISPGNS GGPLVNSNGE VVGVNTWSLP GDEGRAQNLN
FAISIVDILR SLNSEIPVEA ENTNSCGNFV EKESIFKFW