Gene P9303_18391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_18391 
Symbol 
ID4775921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1598749 
End bp1600392 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content56% 
IMG OID640087348 
Productacyl esterase 
Protein accessionYP_001017846 
Protein GI124023539 
COG category[R] General function prediction only 
COG ID[COG2936] Predicted acyl esterases 
TIGRFAM ID[TIGR00976] putative hydrolase, CocE/NonD family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCGTTG AAGGTCGAAC GAGCTCTGGA TCCGTGAACT GGCACGATGC CTGGCTGACA 
CTCTCTGACG GGGTCAAGCT CGTTGCCAGG TTATGGGTCC CGAAGGGTGA GGGACCCTGG
CCTGCTCTCG TGATGCGTCA GCCCTACGGA CGTGCGCTCG CCTCAACGGT GACTTACATC
CATCCTGGTT GGTGGGCAAG TCACGGCTAT CTGGTCGTGG TCCAGGACGT ACGTGGTCAA
GGGGATTCCG AAGGCCACTT CAATGGCTTC CTGCAAGAAG CTTCTGATAC CAGTCAGACG
CATGCATGGG TTCGGGAGTT GCCAGAATGC AATGGCCGTC TTGGAACCTA TGGGTTTTCC
TATCAGGGCC TAACACAGCT GCTTGCCGAA CCCGGGACGC CGCCACCGGA CTGTCTGGCA
CCAGCGATGG CAGGAGTTGA TGAGCGCAAC CATTGGAGTT GTGAGGGAGG TGCTCACTGG
TGGCATCTTG GCTTGGCCTG GGGGCTGCAA CTTGCAGCAC TACAAGCTCG TCGCTGTGGC
AACTGGGAAG CATGGAGAGA GCTTCGCCGC AGTTTGGAAG ACGACAGCTA CCTGTATGAG
GGTCCGGCAC TTCTGAAACG CCACGATCCC GATGGAATGA GCTTGAGATG GCTACAACAA
GCGAGCCAAA ACGATCAAGG CTGGGTTGTA CACAAGCCCT TGGATTCCTG GCTGCGTCAA
CCGATGCTGC TTCTGGGTGG CTGGTGGGAC CCCCATTTGA ATGGCTTGCT TGATCTCTAT
CAACGATCAA GCCAAGTAGG TGGTAGTCCA GAACTTCACA TCGGTCCAGC GACTCACCTG
CAGTGGTGGC CTGATGCACA GCAACTTCAG CTGGAGTTCT TTGATCGCCA TCTGCAATCT
TCGAAAGCCT TAACGAATTC AAGACCCCAT GGGCGGATCT GGAATATCAC GTCTTGTTCT
TGGCAGAGAT TTGTAAGCCC CACCCAGACC ACAACATCAG CCCATGCCGG CTGGAGTCTT
GTCAGTGGAG GGATGGCCTG CTTGGACCCC TCAGAAGGCG CCCTGCATCA GAACAAGGAA
GGTGGCGGCG TGGTTTATGT GGTCCATGAC CCTTGGCGAC CGGTTCAAGC AGTGGGAGGA
CATCTCAGCC CAAAACCAGG AGTTGCTGAG CGCAGCGCCG TGGACCAGCG CGCCGATGTG
GCTACCTTCA CAAGCACTGC TTTGCAGGAA CCTCTCCAAC TCAATGGGAT CCCATTACTG
CAGCTGACCG TGCAGTCAGA TCAACCGGGA TTTGACCTTT GCGTTGCCTT CTCCATTGTT
AATCGCAGCC ACAGCGAGGT GAAGCAGCTC TCAACAGGTT TTCTGCGTGT GCAAGGAGAG
CAGGCCCTGC GCATGCTGCC GCGCAAGGTG AAACTTCAAC CAATATTTGC AGACCTGCAG
CGAGGAGAAC ATCTGCGCCT ATCTCTCGCA GGCGCTGCCT GGCCGGCCAT TGGTGTCAAC
CCAGGCCACG ATCGTCATCC CTGTGGCCCT CCAGGACCCC ATTGCCAAGT GGTGACCATG
ACACTGCAGC TCAATGGATC CAAGTTGAGG CTTTTGCCAT GGAACTCCGG CAAAATAGAT
TTCGATTTGC CCCAAGAGTT TTGA
 
Protein sequence
MCVEGRTSSG SVNWHDAWLT LSDGVKLVAR LWVPKGEGPW PALVMRQPYG RALASTVTYI 
HPGWWASHGY LVVVQDVRGQ GDSEGHFNGF LQEASDTSQT HAWVRELPEC NGRLGTYGFS
YQGLTQLLAE PGTPPPDCLA PAMAGVDERN HWSCEGGAHW WHLGLAWGLQ LAALQARRCG
NWEAWRELRR SLEDDSYLYE GPALLKRHDP DGMSLRWLQQ ASQNDQGWVV HKPLDSWLRQ
PMLLLGGWWD PHLNGLLDLY QRSSQVGGSP ELHIGPATHL QWWPDAQQLQ LEFFDRHLQS
SKALTNSRPH GRIWNITSCS WQRFVSPTQT TTSAHAGWSL VSGGMACLDP SEGALHQNKE
GGGVVYVVHD PWRPVQAVGG HLSPKPGVAE RSAVDQRADV ATFTSTALQE PLQLNGIPLL
QLTVQSDQPG FDLCVAFSIV NRSHSEVKQL STGFLRVQGE QALRMLPRKV KLQPIFADLQ
RGEHLRLSLA GAAWPAIGVN PGHDRHPCGP PGPHCQVVTM TLQLNGSKLR LLPWNSGKID
FDLPQEF