Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_10181 |
Symbol | |
ID | 5731840 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 910844 |
End bp | 912094 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641285385 |
Product | insulinase family protein |
Protein accession | YP_001550903 |
Protein GI | 159903559 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0999203 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAAGC TAAGAATAGT TCTAGATCCC AGGGATACAG CGGGTATAAT GTCGGCAAAA TTATGGATTA AAGAAGGTAG TAGAGCAGAC CCTAAAAACA AGCAGGGTCT TCATTATCTT TTAGGTTCAC TTCTTAGTAG GGGTTGTGGC CCCTACAACC GAATAGAGAT TGCTGATCTA ATAGAGGGGT GCGGAGCTGC CTTACGTTGC GACACCTTTG AAGATGGTAT ATTGCTCAGT CTTAAATGTA CAGAAAGAGA TCAATCAAAG CTATTACCAT TATTTGTGTG GATGGTTACA GCGCCACATA TAGCATCAGA TCAAATGAGT TTAGAACGTG AGCTATCTAT ACAACTACTA CAAAGACAAA AAGAAAGCGC CTTTCATGTA GCCTTTGATT GTTGGAGGAG AATTATTTAT CGCAAGTCCC CTTATGAACA TGATCCATTA GGAACCCTTG CTGGTATTGC TGCTATTGGG GAGGATGATC TAAAAAACTT ATCAACTAAA TTGTCTATAA GAGAACAGGT ATTAGTAATC TCTGGTAGCT TCCCTAAGAA AATTGAATCA GATATTAAAA GCTTGTTCTC GTTTACTTCC TCTAGCCTGA AAGATAGTGA CGTGAATATT ACTTCAGAAG GAAGTAAAGA ATTTAAGGTT GACAATCATA AGCAAAGGCT GATACTTCGA CACCAAGAGA CAAGCCAGGT AATATTAATT TTAGGGCAAA AGACTATTAG CCACTCTCAT CAAGATGACT TGGTCTTAAG ACTACTGAGT TGTCATTTGG GTTCTGGTAT GTCCAGTCTA CTATTTAAAA AGTTTAGAGA GCAATATGCA TTGGCATACG AAACTGGGGT TTACCATCCA ATTAGAGAAT ATGAAGCACC TTTTGCAATT CATGTGGCAA CTACTCAAGA GAAGGCATTG CATTCTTTAC GTCTCCTCAA AAAATGCTGG GAAATACAAT TGGAACAAAA GATATCAGAA GAAGAGTTAT TCTTAGCCAG AGCAAAGTTT AAAGGTAATG TTGCCCATAA CTTGCAGACA GTTAGTCAAA GAGCCGAAAG GAAAGCCCAG CTGTTAAGTT TTGGTATGAG CGATAACTAT GACAATGAAT GCTTCAAAAG AGTTGATACA ATTTCTGCAG AAGAGATACA AACCACTGCA ATAAGATATC TCAGTAATCC ATTGTTAAGC TTATGCGGAC CAAAGAAAAC ACTTGACATA TTGGCAAGTC ATTGGTGTTA G
|
Protein sequence | MQKLRIVLDP RDTAGIMSAK LWIKEGSRAD PKNKQGLHYL LGSLLSRGCG PYNRIEIADL IEGCGAALRC DTFEDGILLS LKCTERDQSK LLPLFVWMVT APHIASDQMS LERELSIQLL QRQKESAFHV AFDCWRRIIY RKSPYEHDPL GTLAGIAAIG EDDLKNLSTK LSIREQVLVI SGSFPKKIES DIKSLFSFTS SSLKDSDVNI TSEGSKEFKV DNHKQRLILR HQETSQVILI LGQKTISHSH QDDLVLRLLS CHLGSGMSSL LFKKFREQYA LAYETGVYHP IREYEAPFAI HVATTQEKAL HSLRLLKKCW EIQLEQKISE EELFLARAKF KGNVAHNLQT VSQRAERKAQ LLSFGMSDNY DNECFKRVDT ISAEEIQTTA IRYLSNPLLS LCGPKKTLDI LASHWC
|
| |