Gene P9301_02981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_02981 
Symbol 
ID4912513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp272771 
End bp274039 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content31% 
IMG OID640159866 
Producthemolysin-like protein 
Protein accessionYP_001090522 
Protein GI126695636 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAA CTCTACTTTT ATTTCTTTTA TTTCTACCAG CTTTTTTCGC AGCGAGTGAA 
CTCTCTTTTT TATTAATAAG GCCAAGTAAA GTTTTAAGGT TAATAGAAGA AAAAAAGAAA
GGGGCATTTT CAATTTTAAA AATTCAAAAA CGTTTTAGAT CTTCACTAAT TGCTTCTCAA
TTTGGAGTAA CAATTTCATT AATTGCAATT GGATGGCTCA GCAATAACCT GGCTAATGAT
TATTGGAAAA GTAATATTTT ATCAAATAGA TTTTATGATC TTCTATTATT TTTATTTGTT
GTTTTAGTTG TTACTCTTGT TTCTGGACTC ATTCCAAAAG CTTTAGTAAT TAACAATCCA
GAATCTGCTG CATTAAGGTT AACTACAATA TTCGATGCCG TGAGAAAAGC TATGAATCCT
ATAGTGAAAA TAATAGAATT CTTTGCTAGC GCCTGTTTAG GCTTGTTCAA TTTAAATAAC
AAATGGGATT CTTTAAACTC TGGTTTATCT GCTGGAGAAT TAGAAACTCT TATAGAAACA
GATAACGTAA CAGGTTTAAA ACCAGATGAG AAGAATATTC TTGAGGGAGT CTTTGCTTTA
AAAGATACAC AGGTTAAAGA AGTTATGATT CCAAGATCTG AAATGGTAAC TTTGCCAAAA
AATATAACCT TTTCAGAACT AATGAAACAA GTAGATAAAA CTCGACATGC TCGCTTCTTT
GTCATTGGTG AGTCTTTAGA TGATGTATTA GGTGTATTAG ATTTACGTTA TCTAGCTAAG
CCAATTTCAA AAGGTGAAAT GGAAGCAGAT ACATTATTAG AGCCATTCCT TTTACCAGTA
ACAAAAATAA TAGAAACATG TTCACTAGCA GAAATATTTC CAATAGTTAG AGACTACAAT
CCGTTCTTAC TAGTAGTTGA TGAACATGGT GGAACAGAAG GACTTATAAC TGCAGCTGAT
CTAAATGGCG AAATAGTTGG AGAGGAAATG CTCAATAATA GAATTTATTC AGATATGAGA
ATGTTAGATA ATTTCTCTAA AAAATGGTCA ATAGCTGGAA AATCAGAAAT TGTTGAAATC
AATAAAAAGA TAGGATGTTC AATTCCAGAA GGTACTGATT ATCATACTCT TGCTGGATTT
ATGTTAGAAA AATTTCAAAT GGTTCCAAAA ATTGGCGACG TTTTAGATTT TAATAACATT
AAATTCGAAG TTATTTCTAT GTCAGGTCCA AAAATTGATC GTGTTAAAAT AATTCTTCCC
AAAAGCTAA
 
Protein sequence
MKITLLLFLL FLPAFFAASE LSFLLIRPSK VLRLIEEKKK GAFSILKIQK RFRSSLIASQ 
FGVTISLIAI GWLSNNLAND YWKSNILSNR FYDLLLFLFV VLVVTLVSGL IPKALVINNP
ESAALRLTTI FDAVRKAMNP IVKIIEFFAS ACLGLFNLNN KWDSLNSGLS AGELETLIET
DNVTGLKPDE KNILEGVFAL KDTQVKEVMI PRSEMVTLPK NITFSELMKQ VDKTRHARFF
VIGESLDDVL GVLDLRYLAK PISKGEMEAD TLLEPFLLPV TKIIETCSLA EIFPIVRDYN
PFLLVVDEHG GTEGLITAAD LNGEIVGEEM LNNRIYSDMR MLDNFSKKWS IAGKSEIVEI
NKKIGCSIPE GTDYHTLAGF MLEKFQMVPK IGDVLDFNNI KFEVISMSGP KIDRVKIILP
KS