Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_12621 |
Symbol | |
ID | 4911397 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 1071397 |
End bp | 1072776 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640160851 |
Product | hypothetical protein |
Protein accession | YP_001091486 |
Protein GI | 126696600 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAAT TAATACTTTT AGTAATTGTT TTGGGGTTTG GCTCTTTTTT TAATGCTCAA GAATTATTAG CCGTAAAGAT TAATTGTGAT TCACCTGTTC ACAAAAATAA AAAAAGATGT AGTGAGAAAT ACCTGCGAAA TGTAGTTATT GATGAAGATA CAGGTTTAGA AGTTATTGAA TATGAAAAAG ATGTCGATTG GAAAAAGAAA AATCCTAAGA TTGCTTGGTC CAAAATCATA AAATACAAAT CATCTCTTAG AAATTCATAT GAATTAACAA TTTTTGATAG GGATTATGTT TCTGATTTTA GTACCGGAGC AGTTAAAAGT TATGTCACGA AATGGAATAC CAACAAATTA GAAGGAAGAA TATTAACTTG GGGTGGTTGT GGTTTTTGGA CATGTACTTA TGAAAGTGCT AGATATTACG ATTTTCCTGG TTTTATTGAG ATATTTGTTG GAGACAAGAG GTTTAGATTA AGAGGTAGTA GAGGTGAATT TCAATTCCCT TATGGATTTG CAAAAAGAAT AAAGAATATA GGTGAGAATG AATCTATTAA TTTACAAATT AAAGCTATCC CAAATAGTGG TTTAGCAGAT AAATTTATCC CTATAGGTGA TGAAACAATA AAAAATTTAA AACTATTATT CCAAAAGGAT ACAAAAGAGT GGAACAAACC TAAATATGAA ATAGCTCGAG CCTCAATTAG TTCAAAAAAA CTTGATATAG AGGAAATAGC ATCTATAACT CTTCCATCAG TAGTCAAATT AGAGGGAGAT TCTGGATTAG GAAGTGGCTT TTTTATAAAT AATACAGGAC TTATTGTTAC TAATATGCAC GTAGTTGCCG GTGGAGATAA AGAATTCACT ATCTCTGGAG ATAATGGGTT AAAAGATCAA GGTGAAGTTA TTTATGTAGA CTCAAAGCTT GATTTCGCAT TAATACAATC AAATAACACT AAGAATTCAA AAGCTCTTCC TCTTTGTTTT AGTAAATATC CAAGACCTGG TCAAAATGTA ATTGCTCTTG GATCTCCTTT AGGTTTAGCG GGAACAGTGA CTAGAGGGAT TGTAAGTGCA GTGAGACAAC CATCAAGTGA TTTAGAAGAC GTAGTCCCAT ATTATGTGAC TCTTATACAA ACAGATGCAG CTATAAGTCC TGGTAATAGC GGAGGCCCTT TGGTAAATAG CAATGGTGAG GTAGTTGGAG TGAATACCTG GAGTTTACCT GGAGACGAGG GTAGAGCTCA GAATTTAAAT TTTGCCATCT CAATCGTAGA TATTTTAAGG TCGTTAAATA GTGAAATTCC AGTTGAAGCT GAAAATACTA ATAGTTGCGG GAATTTTGTA GAGAAAGAGA GCATATTTAA ATTTTGGTAA
|
Protein sequence | MRKLILLVIV LGFGSFFNAQ ELLAVKINCD SPVHKNKKRC SEKYLRNVVI DEDTGLEVIE YEKDVDWKKK NPKIAWSKII KYKSSLRNSY ELTIFDRDYV SDFSTGAVKS YVTKWNTNKL EGRILTWGGC GFWTCTYESA RYYDFPGFIE IFVGDKRFRL RGSRGEFQFP YGFAKRIKNI GENESINLQI KAIPNSGLAD KFIPIGDETI KNLKLLFQKD TKEWNKPKYE IARASISSKK LDIEEIASIT LPSVVKLEGD SGLGSGFFIN NTGLIVTNMH VVAGGDKEFT ISGDNGLKDQ GEVIYVDSKL DFALIQSNNT KNSKALPLCF SKYPRPGQNV IALGSPLGLA GTVTRGIVSA VRQPSSDLED VVPYYVTLIQ TDAAISPGNS GGPLVNSNGE VVGVNTWSLP GDEGRAQNLN FAISIVDILR SLNSEIPVEA ENTNSCGNFV EKESIFKFW
|
| |