Gene A9601_08061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_08061 
Symbol 
ID4717512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp693631 
End bp694848 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content26% 
IMG OID640078520 
Productinsulinase family protein 
Protein accessionYP_001009199 
Protein GI123968341 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.209165 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAAGA GATATTTTTT AAATAATAAA AAAAGAAATT TTTCAATTGC TTCAATTTGG 
ATTAAAGGGG GGAGTGATAT GGATAGTACT GGCAAAAAAG GTATTAACAA GATCCTTTGT
TCATTACTTA CCAGAGGATG TGAAGGTTTT AACAATTTAA CTCTCTCTGA ATATATTGAG
TCCTATGGAG CAGAATTAAA TCAAGAAATA TTTGAAGATG GAATTTCAAT AAGTATTAAA
TCCCTAAATG AACATTTCAG CAAATTATTC CCTTTATTAG AGTTAATAAT TAATAAGCCA
ATCCTTTCGG AAACTGAATT TAAAAAAGTA AAAAAATCTT CTATTGATCA CATTAAAAAA
GATAAAGAGA ATCCATTCAA TATCTGTTTT GAAAAATGGA GAAAAATTGT TTATTCAAAT
CATCCTTATG CCTTTAACAC AATAGGCAAT GCTAGTGATG TCTCAAAGAT TACCTATGAA
GATATTTTAC TTGAGTTTAA AAATTTAAAA AAAAGAGAAA AGTATTTAAT TTCAAATAAT
CCTGAAATAA ATGGAGAAAA TTATGGAACA CTTGAAAAAA AAATCTTAAA AGAAAAATCA
GATCCTTTAA ATCACAATTT AAAAACTACA AATAGATTTG ATTACATTAG TAATGATACA
AATCAAACAA TAATAATGAT GGGTGACCAA ACTTGCTCGC GAAGAAGTAG TGAATATTTT
CCTCTTAAGG TTTTGGAGTC ATATTTATCT TATGGAATGA GCGCTGCTTT ATTTAAACTT
TTTAGAGAAA AACATGGTAT CACTTACGAT TTAGGTGTTT ATTATCCTAT CAGGAGTGGA
AATGCCCCAT TTTTAATTTA TTTATCCGTA TCTAATGATC AAGCACTTTT TGCTTTTGAA
CTTTTATCAA CACTATGGAA AAATTTACTT TTAAATCCGT TGACTGATGC TGAAATATTT
TTAGCAAAAG AAAAACTAAA AGGTTCTTTT TTATTAGGAA ATCAATCACT AGATGAAATT
TTACACAGAA AGATACAGTT AGTTAGTTAT GGTATTTCAC CAATTTCAGA GAACGAATTA
AATTCAAAAA TAGAGGAAAT TTCTTCGTTA GATATTTTGA CATTAACTAA TAAGTATTTT
TCAAAACCTT TTCTGTGTAT TTCTGGAAAT AAAAATATAT GTTTAGAAAT TTCTAATAGG
TGGAAGAAAA ACTTTTAG
 
Protein sequence
MLKRYFLNNK KRNFSIASIW IKGGSDMDST GKKGINKILC SLLTRGCEGF NNLTLSEYIE 
SYGAELNQEI FEDGISISIK SLNEHFSKLF PLLELIINKP ILSETEFKKV KKSSIDHIKK
DKENPFNICF EKWRKIVYSN HPYAFNTIGN ASDVSKITYE DILLEFKNLK KREKYLISNN
PEINGENYGT LEKKILKEKS DPLNHNLKTT NRFDYISNDT NQTIIMMGDQ TCSRRSSEYF
PLKVLESYLS YGMSAALFKL FREKHGITYD LGVYYPIRSG NAPFLIYLSV SNDQALFAFE
LLSTLWKNLL LNPLTDAEIF LAKEKLKGSF LLGNQSLDEI LHRKIQLVSY GISPISENEL
NSKIEEISSL DILTLTNKYF SKPFLCISGN KNICLEISNR WKKNF