Gene Elen_1043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1043 
Symbol 
ID8415333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1258927 
End bp1260495 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content64% 
IMG OID645024006 
ProductATP synthase F1, alpha subunit 
Protein accessionYP_003181403 
Protein GI257790797 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.890996 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.113061 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGAAA TCACCGCACA GTCTATTGAC GAGGCACTGC GCAAGCAGCT CGATGCGCTC 
AATACGAGCG TCGAATCTCG CGAAGTCGGA ACCGTCATCC AGATCGGCGA CGGTATCGCG
CGCGTCGACG GCCTCAAGGA CGCCATGGCG GGCGAGCTCC TCGAGTTCGT GGGTTCCAAC
GGCCAGACCG TCTACGGTAT GGCCCAGAAC CTCGAAGAGG ACGAAGTCGG CGCCGTGCTG
CTTGGTGACG TCACCGCAAT CAAGGAAAAC GACCAGGTGA AGACCACCGG TCGTATCGTG
GAGATCCCCT CGGGCAAAGA GATGCTCGGC CGCGTGGTGA ACCCGCTCGG CATGCCGATC
GACGGCAAGG GCCCCATCAA GGCCGAAGGC ATGCGCCCGG TGGAGTTCAA GGCTCCCGGC
GTCATCCAGC GCCAGCCCGT CGAGGAGCCG ATGCAGACCG GCATCCTGGC CATCGACTCG
ATGATCCCCA TCGGCCGCGG TCAGCGCGAG CTGATCATCG GCGACCGCCA GACCGGCAAG
ACGTCCATTG CGGTTGATGC CATCATCAAC CAAAAGGGCA AGGACATGAT CTGCATCTAC
GTCGCCATCG GCCAGAAGGC GTCCACGGTC GCCGGCCTGG TGGAGACGCT GGAGAAGCAC
GGCGCGATGG AGTACACGAT CATCGTAAAC GCCTCCGCGT CCGACTCCGC TCCGCTGCAG
TACATCGCCC CCATGGCGGG CGCCGCCATC GGCGAGTACT TCATGTACAA CGGCGAGAAC
GGCCAGCCGG CCACGGCCGA CAACCCGGGC CGCCACGTGC TGTGCATCTA CGACGACCTG
TCCAAGCAGG CCGTGGCGTA CCGCCAGATG TCGCTGACGC TGCGTCGCCC GCCGGGACGC
GAAGCGTACC CGGGCGACAT CTTCTACCTG CACTCCCGCT TGCTGGAGCG CGCGGTCAAG
ATGTCCGACG AGTACGGCGC GGGCTCGCTC ACGGCGCTGC CCATGATCGA GACGCAGGCC
GGCGACGTGT CCGCCTACAT CCCGACCAAC GTCATCTCCA TCACGGACGG CCAGATCTTC
CTGTCCACCG ACCTGTTCTT CCAGGGCCAG CGCCCTGCGG TCAACGTCGG CATCTCGGTG
TCGCGCGTCG GCGGCTCCGC GCAGGTCAAG GCCATGAAGC AGGTTGCCGG CACGCTGCGC
CTCGATCTCG CGTCCTATCG CGAGCTGCAG GCGTTCACGC AGTTCGGCAG CGACTTGGAC
AAGTCGACCC AGGACCAGCT GAACCGCGGC GCCCACATGA CCGAGCTGCT GAAGCAGGGT
CGTTACGTGC CGATGCCCGT CATGGACCAG GCTATGTCCA TCTACGCAGG TGCTCATGGC
TACCTCGACG ACATCCTGGT GTCCGACGTC GTCCGCTTCC GCGGCGAGTT CCTCGACTTC
ATCCACGCTT CCAAGCCGGA GATCGTCGAG GCCCTCGAGA AGGCGCAGAA GTTTACCGAT
GAAATCGAAA CCGACCTGAA CGCCGCTATT GAAGCTTTCA AGCTGCAGTT CTCGCCCTCG
GCTTCTTAA
 
Protein sequence
MTEITAQSID EALRKQLDAL NTSVESREVG TVIQIGDGIA RVDGLKDAMA GELLEFVGSN 
GQTVYGMAQN LEEDEVGAVL LGDVTAIKEN DQVKTTGRIV EIPSGKEMLG RVVNPLGMPI
DGKGPIKAEG MRPVEFKAPG VIQRQPVEEP MQTGILAIDS MIPIGRGQRE LIIGDRQTGK
TSIAVDAIIN QKGKDMICIY VAIGQKASTV AGLVETLEKH GAMEYTIIVN ASASDSAPLQ
YIAPMAGAAI GEYFMYNGEN GQPATADNPG RHVLCIYDDL SKQAVAYRQM SLTLRRPPGR
EAYPGDIFYL HSRLLERAVK MSDEYGAGSL TALPMIETQA GDVSAYIPTN VISITDGQIF
LSTDLFFQGQ RPAVNVGISV SRVGGSAQVK AMKQVAGTLR LDLASYRELQ AFTQFGSDLD
KSTQDQLNRG AHMTELLKQG RYVPMPVMDQ AMSIYAGAHG YLDDILVSDV VRFRGEFLDF
IHASKPEIVE ALEKAQKFTD EIETDLNAAI EAFKLQFSPS AS