Gene Emin_0169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0169 
Symbol 
ID6264020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp183751 
End bp184773 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content34% 
IMG OID642610632 
Productsqualene/phytoene synthase 
Protein accessionYP_001875070 
Protein GI187250588 
COG category[I] Lipid transport and metabolism 
COG ID[COG1562] Phytoene/squalene synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000509815 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.0348824 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGAAC AAGAACAAAA GCTGTTAAAT GATTTGTTGA AAAAAACGGC AAGAACGCTT 
GAGTTAAGCG CTAAAGTTTT ACCCTCAGGA TTTAGGGAAA CGTTTAGTAT CGCTTACCTT
GTATGCCGCT GCGCGGATAC TGTTGCCGAT ACTGATTTAA TAGATTTTGA AAGAAGGCTT
TTTTGGATAG AACGTTTTCC TGATATTATT AACAAAAATA AATCCGGGGA AATAGAAAAA
ATAATTAAAG AAGTTTCTTC AGATTCTTTA AAGCAAAACG AAAGATTTCT TTTGCAAAAA
ATACCGTTTG TAGCCAAAAT ATACGGCATG CTTAATAAAG AAGATAAGGA ACTTGTTTTT
GATATATTAA AAAAGGTGTG CGAAGGCATG TCTTTTGATT TAAAAACATT CAAGAAAGGC
GGTCTTACCT GTTTAAAAAC TAAGGAAGAA CTTGAATATT ACTGTGATAC CATGGGCGGC
GCGCCCGGTG TTTTTTGGAG CAAGCTTATT TTAAAATATA CGCCTGTGGC TTTAGATAAA
GACTCTTTTA TTAATATGGG GCGCAATGTC GGCCGGGCTT TACAAATAGT AAATGTTTTG
AGAGATATTA AAGAAGACCT AAATAACGGC AGGTGTTATT TTCCCGAAGA TGAATTAAGA
ACTGCCGGTG TTAAGCGGGA AGATTTGAAA AATAAAATAC TATCCGAAGA ATTGCTGGAT
GTTTTAAAAA AATGGATTGT TTGGGGCAGG GATAACATAG GTTCCGGCAG TGCTTTTTAT
AAAGCCATAC CCGTAAAACA ATGGCAAATA AGAATATCCG TAGCGTGGCC TATGCTGTGG
AGCCTTGATA GTTTTATCTT GCTTCTTAAA GCACGAAATA CTTTTGGAAA TGAAAAAGCA
AGAATATCCA AATTTAAAAT TTATATTACT ATTTTGTTAA GCCTCGGGTA TATTATTTCA
AACAACTTTT TTGATTTTAT GTTTTTCAGA CGGGTTAAAA AAATAGATAG CTTGATTAAA
TAA
 
Protein sequence
MNEQEQKLLN DLLKKTARTL ELSAKVLPSG FRETFSIAYL VCRCADTVAD TDLIDFERRL 
FWIERFPDII NKNKSGEIEK IIKEVSSDSL KQNERFLLQK IPFVAKIYGM LNKEDKELVF
DILKKVCEGM SFDLKTFKKG GLTCLKTKEE LEYYCDTMGG APGVFWSKLI LKYTPVALDK
DSFINMGRNV GRALQIVNVL RDIKEDLNNG RCYFPEDELR TAGVKREDLK NKILSEELLD
VLKKWIVWGR DNIGSGSAFY KAIPVKQWQI RISVAWPMLW SLDSFILLLK ARNTFGNEKA
RISKFKIYIT ILLSLGYIIS NNFFDFMFFR RVKKIDSLIK