Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_02981 |
Symbol | |
ID | 4912513 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 272771 |
End bp | 274039 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 640159866 |
Product | hemolysin-like protein |
Protein accession | YP_001090522 |
Protein GI | 126695636 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAA CTCTACTTTT ATTTCTTTTA TTTCTACCAG CTTTTTTCGC AGCGAGTGAA CTCTCTTTTT TATTAATAAG GCCAAGTAAA GTTTTAAGGT TAATAGAAGA AAAAAAGAAA GGGGCATTTT CAATTTTAAA AATTCAAAAA CGTTTTAGAT CTTCACTAAT TGCTTCTCAA TTTGGAGTAA CAATTTCATT AATTGCAATT GGATGGCTCA GCAATAACCT GGCTAATGAT TATTGGAAAA GTAATATTTT ATCAAATAGA TTTTATGATC TTCTATTATT TTTATTTGTT GTTTTAGTTG TTACTCTTGT TTCTGGACTC ATTCCAAAAG CTTTAGTAAT TAACAATCCA GAATCTGCTG CATTAAGGTT AACTACAATA TTCGATGCCG TGAGAAAAGC TATGAATCCT ATAGTGAAAA TAATAGAATT CTTTGCTAGC GCCTGTTTAG GCTTGTTCAA TTTAAATAAC AAATGGGATT CTTTAAACTC TGGTTTATCT GCTGGAGAAT TAGAAACTCT TATAGAAACA GATAACGTAA CAGGTTTAAA ACCAGATGAG AAGAATATTC TTGAGGGAGT CTTTGCTTTA AAAGATACAC AGGTTAAAGA AGTTATGATT CCAAGATCTG AAATGGTAAC TTTGCCAAAA AATATAACCT TTTCAGAACT AATGAAACAA GTAGATAAAA CTCGACATGC TCGCTTCTTT GTCATTGGTG AGTCTTTAGA TGATGTATTA GGTGTATTAG ATTTACGTTA TCTAGCTAAG CCAATTTCAA AAGGTGAAAT GGAAGCAGAT ACATTATTAG AGCCATTCCT TTTACCAGTA ACAAAAATAA TAGAAACATG TTCACTAGCA GAAATATTTC CAATAGTTAG AGACTACAAT CCGTTCTTAC TAGTAGTTGA TGAACATGGT GGAACAGAAG GACTTATAAC TGCAGCTGAT CTAAATGGCG AAATAGTTGG AGAGGAAATG CTCAATAATA GAATTTATTC AGATATGAGA ATGTTAGATA ATTTCTCTAA AAAATGGTCA ATAGCTGGAA AATCAGAAAT TGTTGAAATC AATAAAAAGA TAGGATGTTC AATTCCAGAA GGTACTGATT ATCATACTCT TGCTGGATTT ATGTTAGAAA AATTTCAAAT GGTTCCAAAA ATTGGCGACG TTTTAGATTT TAATAACATT AAATTCGAAG TTATTTCTAT GTCAGGTCCA AAAATTGATC GTGTTAAAAT AATTCTTCCC AAAAGCTAA
|
Protein sequence | MKITLLLFLL FLPAFFAASE LSFLLIRPSK VLRLIEEKKK GAFSILKIQK RFRSSLIASQ FGVTISLIAI GWLSNNLAND YWKSNILSNR FYDLLLFLFV VLVVTLVSGL IPKALVINNP ESAALRLTTI FDAVRKAMNP IVKIIEFFAS ACLGLFNLNN KWDSLNSGLS AGELETLIET DNVTGLKPDE KNILEGVFAL KDTQVKEVMI PRSEMVTLPK NITFSELMKQ VDKTRHARFF VIGESLDDVL GVLDLRYLAK PISKGEMEAD TLLEPFLLPV TKIIETCSLA EIFPIVRDYN PFLLVVDEHG GTEGLITAAD LNGEIVGEEM LNNRIYSDMR MLDNFSKKWS IAGKSEIVEI NKKIGCSIPE GTDYHTLAGF MLEKFQMVPK IGDVLDFNNI KFEVISMSGP KIDRVKIILP KS
|
| |