Gene P9303_18901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_18901 
SymbolubiD 
ID4777727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1651838 
End bp1653403 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content55% 
IMG OID640087399 
Product3-polyprenyl-4-hydroxybenzoate decarboxylase 
Protein accessionYP_001017897 
Protein GI124023590 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.358341 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCTGC TTCGTCCCGG ACCTGCGTCC CAAGACATGC GGGACTTCCT AGCCCTACTT 
GAGCAACGTG GCCAACTAAG GCGGATCAGT GCCCCCGTAG ATCCAGATTT AGAACTCGCC
GCTATTACTG ATCGGGTGCT GGGCTTGGGT GGGCCGGCCC TGTTGTTCGA AAACGTGATT
GGTTCATCGA TGCCAGTGGC AGTGAATCTG ATGGGCACCT TGGAGCGAGT GGTGTGGAGC
ATGGGTCTGG ACAACGCTCA GCAACTTGAG GATCTGGGAA CTCGTTTGGC TCTGCTGCAG
CAACCAAGGC CGCCAAAGGG TCTGCAAGAA ACCAGACAGT TTGCAAGTGT GTTTTGGGAT
CTGATCAAAG CGCGACCAGA TCTGGATCTC ACACCTCCCT GCCATCAGCA GGTGCTGCGT
GGAGATGCGT TGAATCTCGA CAACCTGCCT TTGATCCGTC CCTGGCCAGG CGATGCCAGT
GGTGTGATCA CCCTGGGCCT TGTGATTACC AAAGACCCGG AGACGGGAGT GCCGAATGTG
GGGGTCTATC GCCTACAACG CCAATCGCCA AAGACCATGA CGGTGCATTG GCTGAGCGTT
CGCGGCGGAG CCAGGCACCT ACGCAAAGCT GCTGCCATGG GCCAGAAGCT TGAGGTAGCC
ATTGCTATTG GTGTACATCC TCTGCTGATC ATGGCCGCAG CAACTCCGAT TCCTGTGCAA
CTCAGTGAAT GGCTTTTTGC TGGTCTTTAT GCAGGAGAGG GTGTACGTCT CACTGGCTGC
AAAACACTGG ATCTGAAAGT CCCTAGCCAC AGTGAAGTGG TGTTGGAAGG AACCATTACA
CCAGGTGAGG AATTAGAGGA TGGTCCATTT GGTGACCATA TGGGCTTTTA CGGAGGAGTG
GAGTCCTCAC CATTGGTGCG CTTCCACTGC GTGACTCAAC GCCGCGATCC GATTTTCCTC
ACGACGTTTA GCGGCCGCCC CCCCAAGGAA GAAGCGATGC TTGCCATCGC CTTGAACCGC
ATCTACACGC CTATCTTGCG TCAACAGGTG CCGGAGATTG TTGACTTCTT TCTACCGATG
GAGGCCCTCA GTTACAAGCT GGCGGTGATT GCAATCGATA AGTCTTATCC AGGACAAGCC
AAGCGTGCCG CGATGGCTTT CTGGAGCGCA TTGCCCCAAT TCACCTATAC AAAATTCGTT
GTCGTGGTAG ATGCAAACAT CAACGTGAGA GATCCACGCC AAGTCGTATG GGCTATCGCT
GCACAGGTCG ATCCGCAAAG GGATCTCTTT GTACTTGAGA ACACCCCCTT CGACAGCCTT
GATTTCGCTA GCGAACATCT AGGGCTTGGA GGCAGGATGG CCATCGACGC CACCACTAAG
ATTGGGCCTG AGAAGCGCCA TGAATGGGGC GAGCCACTCA GCCGGGATGC AGATCTAGAA
AGCAGGGTGG ATGCTCGATG GCAGGAGTTG GGGCTGGAGG ATTTGGGCAG CGAAGAACCT
GATCCAAGCC TGTTCGGTTA TGTGATGGAG AGCCTTTGTC GTCATGCGAT GGTCAAAAAG
ACCTGA
 
Protein sequence
MPLLRPGPAS QDMRDFLALL EQRGQLRRIS APVDPDLELA AITDRVLGLG GPALLFENVI 
GSSMPVAVNL MGTLERVVWS MGLDNAQQLE DLGTRLALLQ QPRPPKGLQE TRQFASVFWD
LIKARPDLDL TPPCHQQVLR GDALNLDNLP LIRPWPGDAS GVITLGLVIT KDPETGVPNV
GVYRLQRQSP KTMTVHWLSV RGGARHLRKA AAMGQKLEVA IAIGVHPLLI MAAATPIPVQ
LSEWLFAGLY AGEGVRLTGC KTLDLKVPSH SEVVLEGTIT PGEELEDGPF GDHMGFYGGV
ESSPLVRFHC VTQRRDPIFL TTFSGRPPKE EAMLAIALNR IYTPILRQQV PEIVDFFLPM
EALSYKLAVI AIDKSYPGQA KRAAMAFWSA LPQFTYTKFV VVVDANINVR DPRQVVWAIA
AQVDPQRDLF VLENTPFDSL DFASEHLGLG GRMAIDATTK IGPEKRHEWG EPLSRDADLE
SRVDARWQEL GLEDLGSEEP DPSLFGYVME SLCRHAMVKK T