Gene P9211_00771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_00771 
Symboldap2 
ID5730227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp81970 
End bp83907 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content37% 
IMG OID641284420 
Productesterase/lipase/thioesterase family protein 
Protein accessionYP_001549962 
Protein GI159902618 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.248963 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.78651 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTTCAC CAAAGAATCA AGTCAGTTCA AAATTCTTAG ATGCAGAGGT TGTTTTGGGT 
GAATTCCCCA AAATCAAATC CCCTAGAATT CTTGGTGATT GGGTCTTCTG GCTAGAACAG
CGTCCTTATG AGAATGGGAG AACCACACTC CTCACTCGTC CTTGGGGGAG GTTTGATTGC
CTTCCACAAG AGTTAACACC TTTTCCAGTA AATCTTAGAA CTCGTTTGCA TGGTTATGGA
GGTTCGCCAC TTGCCTTGGT TAAGCAAGCT GATTGCTTTG TAATGACATG GATTGATGAC
CAGCAAGGGG GTTTATGGCA TCAAAAATGG ATTATTGCTG ATCAAACAAA ACCAACAATT
TTAAAATCTC TGTCTAGCCC AATTTGTTTA TCTTTGAAGG ACAAGTATTG TCTTGCTGAT
GGTTTGATTG ACTTACAATT TAATAGATGG ATTGGTGTGA TGGAGGAAGA TAATAAAGAT
TATTTGGTTT CATTTGCGCT CGATAAAGAA TTAAAAGCAC CAATGATTCT TCATCAAGCA
ATTGATTTTC TTGGGTATCC AACATTAAGT ATAAAGTCTG ATCAATTGGC ATGGGTTGAG
TGGCAAAAGC CATACATGCC ATGGGATCAA AGCCAAATCT TTCACTCTTT TATTAATGAC
ATAGGTAAAC TTAGCTCAGT TTCGATGTTG TCTGGATCAG ATAAATCTTC CCAAAAAAGT
TCTGCTTTTC AGCCTCAATG GTTGCCTAAT GGTCAATTAA TTGTAGCTGA AGATAGTAGT
GGATGGTGGA ATCTTAAGAT TGCAGGGCCA GATTTTTCTT CTAATTTAAC TAATCAATTT
AGTAATCTTT GGCATATAAA AGCTGAAGCT GCCTGTCCCC AGTGGATTCA TGGGATGTCT
ACCATTGCTT CTTCTGGGAA AAAAGAAATT GTTGCTCTTA GTTGTCAAGA AGGTAGTTGG
TCCATGAGTG TTGTAAACAA GAGTGGTTCA GTCACAAAGT TGCAACTACC TTTTGAACAT
TTTGAAGATG TATCTTCTGA GGAAGGAAAG GCAGTTGCAA TAGCAGCTAA TTCTTTCCTA
GATTCTGGTC TGCTTGAAGT GAATTTAAAA AATGGTAGTT GGATTCATAA TTCCTTTAGA
GAGTCAATAG TTAAACCACA AGAAATCAGT ATTGCTGAAT CATTTTGGTT TAAGGGTTTT
GGAGGTGAGA TGAGTCATGC TTGGTATTAC CCCCCGATTC AGGGTCGATT GAACTATTCA
CCTCTTTTAG TGAAAGCTCA TAGTGGTCCT ACTTCTATGG CAAAAAGAGG TTTGAATTTA
GAAATTCAGT TCTGGACTTC TCGAGGATGG GGTGTTTTAG ATGTTAATTA CGCAGGATCA
ACAGGCTTTG GTCGAGCTTA TAGAGATCGC TTAAAACATT CTTGGGGAGA GGCAGATGTT
TTTGATTGCT CTCAAGCTGC CATGGAATTA ATTAATAATG GGAAAGCCGA TAAAAATTTA
GTTGCTATTG AAGGATCTAG CGCAGGAGGT TTCACGAGTT TATGTTGCCT ATGCTTTAGA
AATATTTTTA GGGTCGCTTC TTGTAAATAC CCAGTAATTG ATCTTCTTGA TATGGCAAAC
TCAACCCATC GCTTTGAAGA GTATTACTTA GATTTCCTGA TAGGTAAATT TAACAATAAC
AAGCATTTGT ATATGAGCAG ATCTCCTATC AATAATTTAG ATAAGATTAC TTGCCCTGTA
ATCTTATTTC AAGGATTAAA AGATAAGGTT GTTTCTCCTG AGAAAACTAA AGATTTGTTT
ACAGCTTTGA AAAATAAGAA AATACCTACT GAATTACATG TTTTTGATAA TGAAGGTCAT
GGCTTTAATC ATCGGTCTAC AAAAATTAAA GTTTTGCGAG AAACAGAATC ATTTTTTAGA
GAGCATTTAG GTATCTAA
 
Protein sequence
MVSPKNQVSS KFLDAEVVLG EFPKIKSPRI LGDWVFWLEQ RPYENGRTTL LTRPWGRFDC 
LPQELTPFPV NLRTRLHGYG GSPLALVKQA DCFVMTWIDD QQGGLWHQKW IIADQTKPTI
LKSLSSPICL SLKDKYCLAD GLIDLQFNRW IGVMEEDNKD YLVSFALDKE LKAPMILHQA
IDFLGYPTLS IKSDQLAWVE WQKPYMPWDQ SQIFHSFIND IGKLSSVSML SGSDKSSQKS
SAFQPQWLPN GQLIVAEDSS GWWNLKIAGP DFSSNLTNQF SNLWHIKAEA ACPQWIHGMS
TIASSGKKEI VALSCQEGSW SMSVVNKSGS VTKLQLPFEH FEDVSSEEGK AVAIAANSFL
DSGLLEVNLK NGSWIHNSFR ESIVKPQEIS IAESFWFKGF GGEMSHAWYY PPIQGRLNYS
PLLVKAHSGP TSMAKRGLNL EIQFWTSRGW GVLDVNYAGS TGFGRAYRDR LKHSWGEADV
FDCSQAAMEL INNGKADKNL VAIEGSSAGG FTSLCCLCFR NIFRVASCKY PVIDLLDMAN
STHRFEEYYL DFLIGKFNNN KHLYMSRSPI NNLDKITCPV ILFQGLKDKV VSPEKTKDLF
TALKNKKIPT ELHVFDNEGH GFNHRSTKIK VLRETESFFR EHLGI