Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_16561 |
Symbol | |
ID | 4778526 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1445036 |
End bp | 1446403 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640087165 |
Product | insulinase family protein |
Protein accession | YP_001017665 |
Protein GI | 124023358 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGCCG ACTCGACTTC TCGAGCAAAT GCTTCCATTG CTGCAACCAG AGCGCAGCTT CACCCTTGTT GCCAGACCGA TGGAAGCAGG CGAATGAATC CTCTTGATGT GGTTTTAGAT CCAATCGCCG CACCGGGAGT TATTGCCGCC AAGCTCTGGG TTAGAGGCGG TAGTGGTGCT GACCCAAAAG GGCAACGGGG AGTTCATCAA CTGCTCGGAG CCCTCTTGAC CAGGGGCTGT GGACCTTATG ACCACCTTGC TCTAGCCGAT CTCGTTGAAG GCTGCGGGGC AGGTTTGCGC TGCGATACCC ACGAAGACGG ATTGCTAATT AGCCTCAAAT GTGCAGATCG TGATGCCGAA CGACTCCTTG ATTTACTTGG CTGGATGCTG ATCGATCCGC ATCTGGATTC AAGTCAAGTA ACGCTGGAAA GGGATCTCAG TCTTCAGGCC TTGCAAAGAC AAAGAGAAGA CCCATTTCAC TTGGCTTATG ACGGTTGGCG GCATATGGCT TATGGCAGTG GCCCCTACGG CCACGATCCC CTTGGCCTTA GCGAGGACCT GAACCAACTT GGTCGTCAGC AATTAATTTC CTTAATCGAC GGGCTAACAG CACAATCACC TGTGCTTGCC CTCGCTGGGA CCCTTCCAGA GGATCTTGAA CAGCGGCTGG AGGCAATGGA ATCTTTCCAG CGCTGGCCCA ATCAGCCACC TCAGCAAGCG AGAAAGTCTG AATCAAGCAA GATCTCAACA GAGAACATTC AGATCGAATC CAACATTTGT CTTCAGCCTG AACCTACAAG TCAGGTGGTC ATGATGCTTG GACAGCCAAC CCTTGCTCAT GGCCATGAAG ACGATCTGGC ACTGCGTCTA CTGAACTGCC ACCTGGGATT AGGCATGTCG AGCTTGCTGT TCAGGCGTCT ACGAGAGCAA CACGGGGTGG CCTACGACGT AGGCACTCAT CACCCGGTAC GTAAGTGTGC CGCTCCATTT GTATTACATG CTTCGACAAG CGAAGACAAG GCAAAACTCA CCCTTCAGTT GCTTCTAGAC AGCTGGTGGG AACTCAGCCA GCAAGCGATA TCAGAAGAAG ACATTGAACT GGCACGCGCA AAATTCCATG GTCAACTCGC CCATGGAGCT CAAACCACTG GACAACGGGC AGAACGCCGA GCCCAATTAC GGGGACTAGG GCTGCCAGGC AACTATGACG AGCACAGCTT GGAGACAATC AAAAATCTTG ATGGAAGCGC TCTGCAAAAG GCAGCTCAAC GACATCTAAA AATGCCCTTG CTAAGTCTCT GTGGCCCAGA AACCAGCCTT CAAATCCTTG CCAAGGACTG GCAACAGCAA GTGGTTCAAA GCTCTTAA
|
Protein sequence | MDADSTSRAN ASIAATRAQL HPCCQTDGSR RMNPLDVVLD PIAAPGVIAA KLWVRGGSGA DPKGQRGVHQ LLGALLTRGC GPYDHLALAD LVEGCGAGLR CDTHEDGLLI SLKCADRDAE RLLDLLGWML IDPHLDSSQV TLERDLSLQA LQRQREDPFH LAYDGWRHMA YGSGPYGHDP LGLSEDLNQL GRQQLISLID GLTAQSPVLA LAGTLPEDLE QRLEAMESFQ RWPNQPPQQA RKSESSKIST ENIQIESNIC LQPEPTSQVV MMLGQPTLAH GHEDDLALRL LNCHLGLGMS SLLFRRLREQ HGVAYDVGTH HPVRKCAAPF VLHASTSEDK AKLTLQLLLD SWWELSQQAI SEEDIELARA KFHGQLAHGA QTTGQRAERR AQLRGLGLPG NYDEHSLETI KNLDGSALQK AAQRHLKMPL LSLCGPETSL QILAKDWQQQ VVQSS
|
| |