Gene P9301_08051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_08051 
Symbol 
ID4911660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp693308 
End bp694525 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content28% 
IMG OID640160387 
Productinsulinase family protein 
Protein accessionYP_001091029 
Protein GI126696143 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.126573 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAGAAA GATATTTTTT AAATATTAAA AAAAGGAATT TTTCAACTGC TTCAATTTGG 
ATTAAAGGAG GGAGTGATGC GGATAGTGTT GGCAAAAAAG GTATAAACAA GATCCTTAGT
TCATTACTTA CCAGAGGATG TGAGGGTTTT AATAATTTCA CTCTTTCAGA GTATATTGAG
TCCTACGGAG CAGAATTAAA TCAAGAAGTA TTTGAAGATG GTATTTCAAT AAGCATTAAA
TCCCTAAATG AACATTTCAG CAAATTATTC CCCTTATTAG ATTTAATAAT TAATAAACCA
ACACTCTTAG AAAGAGAATT TGAAAAAGTA AAAAAATCCT CTATTGATTT TTTAAAAAAG
GATAAAGAGA ATCCATTTAA TATCTGTTTT GAAAAATGGA GAAGAATTGT TTACTCAAAT
CATCCTTATG CCTTTAATAC AAATGGCAAT GAAAATGATG TCTCAAAGAT TACATATGAA
GATGTTTTGC TGGAATTTAA AAATTTCAAA AGTCGAGATA AGTATTTGAT TTCAAATAAT
TCAGAAATAG ATGGAGTAAG TATAGAAAAA TTAGACAAAA AACCCTTAGT AGAAAAATTT
AGACCTCTAA ATCATGATTT AAGTCCAAAC AATCGATTTG ATTTCAATAA TAATAATTCA
AATCAAACAA TAATAATGTT TGGCAACCAA ACTTGCTCTC GTAAAAGTAG TGAATATTTG
CCTCTTAAGG TTTTGGAGTC GTATCTATCT TATGGAATGA GCGCTGCTTT ATTTAAACTT
TTTAGGGAAA AAAATGGGAT CACTTACGAT TTAGGTGTTT ATTATCCAGT TAGGAGAAGG
AATGCTCCAT TTTTAGTATA TTTATCAGTA TCAAATAAAA AAGCCCTTTT TGCTTTTGAA
CTTTTATCAA CTTTATGGAA AGATTTACTT TTAAATCCTT TGATTGATAA TGAAATACTA
TTGGCTAAAG AAAAACTAAA AGGTTCTTTT CTATTGGGAA ATCAATCACT AGATGAAATT
TTACAGCGAA AGATACAGTT AATTAGTTAT GGTGTTACCC CAATTTCTGA GAGTGATTTA
AATTCTAAAA TAGACGAAAT ATCTTCATTA GATATTCTTA AATTAACAAA CAAGTATTTT
TCAAAACCTT TTCTGAGTAT TTCTGGTAGT AAGAATATAT GTTTAGAAAT TATTAAAAGT
TGGAAGCAGA ACTTTTGA
 
Protein sequence
MLERYFLNIK KRNFSTASIW IKGGSDADSV GKKGINKILS SLLTRGCEGF NNFTLSEYIE 
SYGAELNQEV FEDGISISIK SLNEHFSKLF PLLDLIINKP TLLEREFEKV KKSSIDFLKK
DKENPFNICF EKWRRIVYSN HPYAFNTNGN ENDVSKITYE DVLLEFKNFK SRDKYLISNN
SEIDGVSIEK LDKKPLVEKF RPLNHDLSPN NRFDFNNNNS NQTIIMFGNQ TCSRKSSEYL
PLKVLESYLS YGMSAALFKL FREKNGITYD LGVYYPVRRR NAPFLVYLSV SNKKALFAFE
LLSTLWKDLL LNPLIDNEIL LAKEKLKGSF LLGNQSLDEI LQRKIQLISY GVTPISESDL
NSKIDEISSL DILKLTNKYF SKPFLSISGS KNICLEIIKS WKQNF