Gene A9601_00811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_00811 
Symboldap2 
ID4716764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp83680 
End bp85605 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content27% 
IMG OID640077779 
Productesterase/lipase/thioesterase family protein 
Protein accessionYP_001008476 
Protein GI123967618 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAATG ATGATCAGTT AAAAGTTAGG CAAACTGTAT CTAAAAAGAA ATCTTTTAAA 
GAATTAACTA TTATTAGGGA TATCATTTTT TGGATTGATC TTGTTGGTGA AGGTCAAAAT
GAAAATGCCA TTTTTGCAAG ACCATTTAAT AAAAAAGAGG CGTTTCCTCA GAAATTAACA
AGTAAAAAAT ATAATATTAA AAATAACTTT CATGGATATG GTGGTAAATC TTATAAATGT
ATATATTTAA AAAATAATTT TTATTTGATA TGGATTGATC AGATTACCAA CGCAGTTTGG
TTTCAAATTT TTAAAGAGGT TGCATCAAAT TATAGAAGTC AAAAAAGATA TCTCGATTCA
GTTCAAGAAC CAAGACAACT ATCTAAATCA ATTGATGGAA ATTTTGATTC TTCGTTTGTT
ATTTCTCAAA AAAATTTTTT ATATGGTATT TGTGAAATAA ATAATAGAGA TTACTTATTT
TCTTTAAACC TAAAAAAAAC TAAACAAGAT ATTTACCGAA TAAAAAAATT TAAAAATTTC
GCTGGAGAAT TATCTTGTAA TTCTTCTGTT AGCTTACTTT CTTGGGTCGA GTGGGGTTCT
CCATATATGC CTTGGGAGAA AAATGATCTT TTTTTTGCTC AAATTGACTT AGATGGAGAG
ATAACAAAAA TAAAAAAATT CTCAGATAAG CTGATTAATG CCAAAAAAAA CGTTTCTTTT
TTTCATCCTT ATTGGATAAG TGAAACTCTT TTAGTATGTT CTGAAGATAG TTCTGGATGG
TGGAACTTAT TGTTTTTAGA TGCTAGTAAA ATTGAGAATA TTTTTATTAA AAAAAGAGTA
GAGAGAAATT TTGTTGAATA TGGAGTACCT CAGTGGGTCT CAGGAATAAC ATTTTTTTCA
GGGGATATAA AAGATTTATT TTGTTTAGCA AAAAAAGAAA ATAATTTAGT AGTTGAACAA
TATAAAGATC TTCAATGCGT TAAAGAATTT TCTACTCCTT TTACCTCAAT AAGTGATTTC
AGTGTTTTTG AGAAGAAAGT AGTTTTGAAA GGTCATGGAT CTGATTTTCT TGGAAATTTA
CTTGAAATTG ATTTTAAAAA GGAAGTTTTA TCAAATGTTT TTGAGGAAAT AAATGCTGAA
TATATAAAAG TTTGTTCAAA ACCTGAAACA TTTTGGTTTA AAGGTTTTGA AGCTCAATCT
ACTCATTCTT TTCTTTATAG GCCGCTCGTA GAAAATTTTA GAAAGCCACC GCTCCTTGTT
AGAGCACATA GCGGACCAAC TTCATGTTTT GATGGATCAT ATAATTCTGA GGTTCAATAT
TGGACTTCGA AGGGATTTTT TGTTGCTGAA GTTAATTATG GAGGATCATC AGGATTTGGC
AAAGCATATA TAGAGAGGTT GAATTGTAAA TGGGGTATTG TTGATTCTTA TGATTGCAAA
GCACTAGCTC TTGAATTGAT TAAATCAAAT CAAGTTGATA GTGAAAAAGT AGTAATTTTT
GGGAATAGTG CTGGTGGGTT AACTGCCCTG AATTGTTTAT TATATGGGTC TATTTTTACA
GCAGCAATTT GTAAATATCC TGTTATTGAT TTGAAGGATA TGCATTACAA CACTCATAGG
TTTGAAAAAG ATTATTTAAA TTCTTTGGTA GGAATTTATG CACAAAATCA TGATGATTAT
ATAAATAGAT CACCGATAAA TCATATTAAC AAAATAAAAA AACCTATCTT ATTGTTTCAT
GGAAAAAAAG ATAAAGTAAT TTCTTATAAA CAAACTTTTA AAATCCAGGA AATTTTGATT
CAGAATAATA AATATTCAGA AGTTATTTTT TTTGATAATG AGGGGCACGG TTTTAGAAAT
ATTGAAAATA AAGAAATAGT AATGCAAAAA TCTATGGAAT TTTTAAAAAA TGCTTTGAAT
ATTTAA
 
Protein sequence
MSNDDQLKVR QTVSKKKSFK ELTIIRDIIF WIDLVGEGQN ENAIFARPFN KKEAFPQKLT 
SKKYNIKNNF HGYGGKSYKC IYLKNNFYLI WIDQITNAVW FQIFKEVASN YRSQKRYLDS
VQEPRQLSKS IDGNFDSSFV ISQKNFLYGI CEINNRDYLF SLNLKKTKQD IYRIKKFKNF
AGELSCNSSV SLLSWVEWGS PYMPWEKNDL FFAQIDLDGE ITKIKKFSDK LINAKKNVSF
FHPYWISETL LVCSEDSSGW WNLLFLDASK IENIFIKKRV ERNFVEYGVP QWVSGITFFS
GDIKDLFCLA KKENNLVVEQ YKDLQCVKEF STPFTSISDF SVFEKKVVLK GHGSDFLGNL
LEIDFKKEVL SNVFEEINAE YIKVCSKPET FWFKGFEAQS THSFLYRPLV ENFRKPPLLV
RAHSGPTSCF DGSYNSEVQY WTSKGFFVAE VNYGGSSGFG KAYIERLNCK WGIVDSYDCK
ALALELIKSN QVDSEKVVIF GNSAGGLTAL NCLLYGSIFT AAICKYPVID LKDMHYNTHR
FEKDYLNSLV GIYAQNHDDY INRSPINHIN KIKKPILLFH GKKDKVISYK QTFKIQEILI
QNNKYSEVIF FDNEGHGFRN IENKEIVMQK SMEFLKNALN I