Gene P9303_03061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_03061 
Symboldap2 
ID4778861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp321324 
End bp323282 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content56% 
IMG OID640085808 
Productesterase/lipase/thioesterase family protein 
Protein accessionYP_001016324 
Protein GI124022017 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAGCA CCAGCCATCA CCAGCGAAAC CCGAAGCAAA CCACTCAACC TCTTTCTGCC 
AGACGCGCTC TCGGACAAAC ACCCACCCTC AAAGAACCGC GCCTGATTGA TGACTGGGTG
CTTTGGCTGG AGCAACGTCC TCAGGAACAT GGTCGCACCA CTGCCCTTAT CCGTCCCTGG
GGTCAATCTG ACCACCCCCC CCAAGAGCTG ACACCTGCCC CAGCCAACCT GCGTAGTCGT
ATTCACGATT ACGGAGGAGG GGTTCTAGCA ACAGCCTGCC AAGACAACCA GCTACTGATG
GCCTGGATCG ATGATGCCGA TGGCTGCCTC TGGTTCCAGC GCTGGCAGGG CCTCAACCAG
GCCACAAAAG GTAAGAAGGC ATTATCTCCG CTCAAGCCGC CGCTTCGCCT CTCAAAACCA
AACGATGCCC AACTTGCTGA TGGTCTGATT GACCTCCCAC GACAGCGCTG GCTCGGAATC
ATGGAAGCAG ACAAGCGGGA CTGGCTGGTG ACCTTCTCCC TCAACCATGA GAACCAGGCT
GCCACGGTGT TGCATCGCCC TGCTGATTTT GCTGGTTACG CGATCCTCAG CCCGAATGGA
GATCAACTGG CCTGGGTGGA ATGGCAACAA CCGGCCATGC CCTGGGAGGC AAGCCAACTC
TGGTGGGCCA GCCTCGACCC TGCGGGTTTG ATCCAAAGCT CGGCCTGTCT AGCTGGTAGC
AAACCACTTG ATCACAAACA AACGTCCGTT TTCCAGCCCC TTTGGCTACC CAATGGAGAG
CTGGTTGTCA GCGAAGACAG CAGCGGCTGG TGGAATCTGA TGGTGGCAAA GCTGACGACT
GACCCCACTG TCCAACCCAC TTGGCGACGC CCCTGGCCAC TTTCAGCCGA AACCGGCATG
CCGCAGTGGG TTTATGGCAT GAGCAGCAGC GCATGGGATG GAGAACAAAT TCTGACCGCC
GTCTGTGAAC AAGGTTCTTG GAGGCTGAGC CGCTTGGCCG ATGATGGACA GATCAGCACC
ATCAACCAAC CTTTTGATGA TCTAAATGGT CTGCAGGCAC AGGAAGGTCG AGCCGTAGCC
ATCGCTAGCA ATGCCACCAC GAGCCCTGGG CTACTAGAGC TCAACCTCAA CTGTGGCAGC
TGGAAGCACA CCCCAGCCAA TGAGCCTTTA CTGAATGCTG ATGCAATCAG CGTTGCGGAA
CCTATCTGGT TTGAAGGCTG CCATGGCCAG GCAACCCATG CCTGGTATTA CCCGCCAATC
AATGGCAGCA AAGGCCCTGC GCCACTACTT GTCAAAAGCC ATAGCGGTCC TACCAGCATG
GCCAACCACG GTCTAAGCCT CAGCATTCAG TTCTGGACAT GCAGAGGCTG GGGAGTGGTG
GATGTGAACT ATGGCGGCTC CACTGGATTT GGCCGTGCAT ACCGCGAACG CCTACGGGGA
GGCTGGGGTG AGACAGACGT AACGGATTGC GCACAAGCAG CACTTGCACT AGTGAAATGC
AACAAGGCAA ACCCAACACA AATCGCCATT GAAGGAGGCA GTGCCGGTGG ATTTACCACC
CTGGCCTGCC TTTGTTTCAC AGATGTCTTT CGCGCTGCTG CCTGCCGTTA TGCAGTGAGT
GATCTCACCG CCATGGCAGA AGACACCCAT CGATTTGAAG CGCGATACCT CGATCACCTA
GTAGGCCGTT GGCCCGACCA AAGACAACTT TACGAAAACC GCTCACCTCT CCTGCATGCC
AACAAGATCC AATGCCCAGT GATCTTCTTT CAGGGACTTC AAGACAAAGT GGTTCCTCCA
GATCAAACAG AACGGATGGC CAATGCCTTA AAAGAAAACG GCATACCAGT TGAACTACAC
ATTTTTGAGC AGGAAGGCCA CGGCTTTCGC GACAGTGCTG TCAAGATCAA AGTCTTAGAA
GCAACTGAGC AATTCTTCCG CCGCCACCTA AAGCTCTAG
 
Protein sequence
MESTSHHQRN PKQTTQPLSA RRALGQTPTL KEPRLIDDWV LWLEQRPQEH GRTTALIRPW 
GQSDHPPQEL TPAPANLRSR IHDYGGGVLA TACQDNQLLM AWIDDADGCL WFQRWQGLNQ
ATKGKKALSP LKPPLRLSKP NDAQLADGLI DLPRQRWLGI MEADKRDWLV TFSLNHENQA
ATVLHRPADF AGYAILSPNG DQLAWVEWQQ PAMPWEASQL WWASLDPAGL IQSSACLAGS
KPLDHKQTSV FQPLWLPNGE LVVSEDSSGW WNLMVAKLTT DPTVQPTWRR PWPLSAETGM
PQWVYGMSSS AWDGEQILTA VCEQGSWRLS RLADDGQIST INQPFDDLNG LQAQEGRAVA
IASNATTSPG LLELNLNCGS WKHTPANEPL LNADAISVAE PIWFEGCHGQ ATHAWYYPPI
NGSKGPAPLL VKSHSGPTSM ANHGLSLSIQ FWTCRGWGVV DVNYGGSTGF GRAYRERLRG
GWGETDVTDC AQAALALVKC NKANPTQIAI EGGSAGGFTT LACLCFTDVF RAAACRYAVS
DLTAMAEDTH RFEARYLDHL VGRWPDQRQL YENRSPLLHA NKIQCPVIFF QGLQDKVVPP
DQTERMANAL KENGIPVELH IFEQEGHGFR DSAVKIKVLE ATEQFFRRHL KL