Gene Emin_1522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1522 
Symbol 
ID6263570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1613492 
End bp1614946 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content47% 
IMG OID642612009 
ProductF0F1 ATP synthase subunit beta 
Protein accessionYP_001876406 
Protein GI187251924 
COG category[C] Energy production and conversion 
COG ID[COG0055] F0F1-type ATP synthase, beta subunit 
TIGRFAM ID[TIGR01039] ATP synthase, F1 beta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000415981 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value2.83591e-17 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATACGG GTATAGTTAC ACAGGTTATC GGCCCTGTTA TTGATATTGA GTTTAAAGAC 
GGCGCGTTGC CTAAGATTAA TAACGCCGTG GAAATCAAAT TCGGCGAACA AAAAATCGTG
GCTGAAGTTG CCCAGCAGCT TGGAGACAAT ACTGTAAGAG CAGTGGCTCT TTCCCCGACA
GACGGTCTTG CTCGCGGCGT AGAAGCCGTT GACACAGAGG ACGTATTGAG AGTCCCCGTC
GGTGAAGGCT GCAGAGGCAG ACTTATGAAC GTATTGGGCG CCCCCATAGA TTACGCGGGT
GAAATAAAAA CCGATAAAAA GATGCCTATT CACCGCGAGC CGCCTACTCT TGAAGAACAG
AAAACCACGC CCGAAATTTT TGAAACTGGT ATTAAAGTAG TTGACCTTTT GGCCCCTTAC
ATGAAAGGCG GCAAAGTAGG TTTATTCGGC GGCGCCGGCG TAGGAAAAAC AGTTCTTATT
ATGGAGCTTA TTAACAACGT TGCCCGCGAG CACAGCGGCA GCTCAGTGTT TGGCGGCGTG
GGGGAAAGAA GCCGTGAAGG CAACGACCTG TGGTTAGACA TGAAAGGAGC GGAACTTGCC
GACGGCAGCA CCGTTTTAGA TAAAACAGTT TTAGTTTTCG GACAGATGAA CGAACCCCCG
GGCGCGAGAG CGAAAGTAGC TTTAACAGCC TTAACACAGG CCGAATACTT CAGAGATGAA
AAAGGACAGG ACGTGCTGTT GTTTTTAGAT AATATTTTCC GCTATGTTTT GGCCAACTCC
GAAGTTTCCG CCCTTCTCGG GCGTATGCCT TCGGCCGTAG GTTACCAGCC CACTCTTAAT
ACGGAAATCG GACAGTTGCA GGAACGTATT ACATCAACAA ACAAGGGTTC TATTACCTCA
ATTCAAGCCG TTTACGTGCC CGCTGACGAC TTGACTGACC CTGGCGTAGC CTCCACATTT
ACCCACTTGG ATGCCACTAC CGTTTTGTCC CGCTCTTTAG TTGAGCTAGG CATTTATCCC
GCTGTTGATC CTTTGGAATC AACTTCCAGA ATTTTAGACC CCAGAGTATT GGGTGAAGAA
CATTACCAAG TGGCGCAAGG CGTACGCAAA ATTTTACAAA GATATAAAGA TTTGCAAGAT
TTAATCGCTA TTTTAGGTAT TGACGAACTT GGCGACGAAG ATAAAAAGAT TGTAGCTAGA
GCAAGAAGAA TACAGCGCTT TTTATCGCAG CCTTTCTTCG TGGGCGAAAA GTTTACCGGC
AGGCCGGGCA AGTATGTTAA ACTTGAGGAC ACCATTAAAG CTTTTAAAGG TTTAATAAAC
GGCGATTATG ACAATATTCC CGAACAGGCA TTCTTTATGT GCGGCGGTAT AGAGGACGTT
TTGGCTAAAG TAAGCAAAGG CGAAAACGAA GAGAAATCTG AGCCCGCCAA ACCCGCTAAA
GAAGAAAAAA GATAA
 
Protein sequence
MNTGIVTQVI GPVIDIEFKD GALPKINNAV EIKFGEQKIV AEVAQQLGDN TVRAVALSPT 
DGLARGVEAV DTEDVLRVPV GEGCRGRLMN VLGAPIDYAG EIKTDKKMPI HREPPTLEEQ
KTTPEIFETG IKVVDLLAPY MKGGKVGLFG GAGVGKTVLI MELINNVARE HSGSSVFGGV
GERSREGNDL WLDMKGAELA DGSTVLDKTV LVFGQMNEPP GARAKVALTA LTQAEYFRDE
KGQDVLLFLD NIFRYVLANS EVSALLGRMP SAVGYQPTLN TEIGQLQERI TSTNKGSITS
IQAVYVPADD LTDPGVASTF THLDATTVLS RSLVELGIYP AVDPLESTSR ILDPRVLGEE
HYQVAQGVRK ILQRYKDLQD LIAILGIDEL GDEDKKIVAR ARRIQRFLSQ PFFVGEKFTG
RPGKYVKLED TIKAFKGLIN GDYDNIPEQA FFMCGGIEDV LAKVSKGENE EKSEPAKPAK
EEKR