Gene Elen_0511 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0511 
Symbol 
ID8414795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp657053 
End bp660061 
Gene Length3009 bp 
Protein Length1002 aa 
Translation table11 
GC content56% 
IMG OID645023482 
Productmolydopterin dinucleotide-binding region 
Protein accessionYP_003180885 
Protein GI257790279 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.3924 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGAAGA CGACCGTGAC AAGGCGCGCG TTCGCTCAGC TTGCCGCCGC GACAGGCGCA 
ATAGCCGCTA TGGGTGTTGG AACTCGTCCT GCGGTGGCGC TTACCGATGG CAATTCGGGG
ACGGCGGGCG AGCGGGGGAT CAAGAAGATA CGCTCGTGCT GCCGCGGCTG CGGAAAGGTC
GAATGCGGTG TGTGGGTGTA CATCCAAGAT GGCAAAGTGG TTCGAACCGA AGGTGACGAA
ACCTGCTTCA ACACTATGGG CAATCATTGC AGCAAGGGGC AAGCGTCCAT TCAGGCTGCG
TATCATCCTG ATCGTATCAA GTTCCCAATG AAGCGCACGA ACCCGAAGGG CAGCGAAGAT
CCAGGTTGGG TGAGGATCAG CTGGGACGAG GCCTACCGGA CCATTGCGGA CAATATTATG
CAGTTGCGTG AGAAGTACGG TCCTGAAAGC TTGTTCACGT GGTGCGGTAC GGGCCGACAG
TGGTGCATGC AGTCCGATGC TGGCATGGCG CTTGAGCTGT TCGGTACGCC GAACATCATC
GCGGCGTACC AAGTGTGCAA GGGTCCGCGC CATTTTTCGT CGCGTCTCGA CAACGTTCAG
GCGTGGTCTT GGAGTGAGGT GATAAACCAT TCGACGAAAT ACGTCCAATG GGCTACCGAT
CCCTCCGTTT CGAATTACGA CGATTCGTCT CGTTTGGTAA TTGATGTGGC GCGCGAAGCT
GAGGCTTTTA TCGTGGTTGA CCCTCGCCTG TCGAATCTTG GCCGCACTGC GAAGTATTGG
TTAAACTTGC GACCTGGCAC CGATAATGCC ATGGCGCTAG GATGGTGCCA TATCATATTG
AAGCACGATC TGGTGGATTG GCAGTTCGTG AAACGCTGGT CCAATGCCTC GTTCATCGTC
GTGCCCGATA TGGACCCTTC GGGCTACACC GAGGCGGTTC AAAACACGAA AAGCCCCTAC
GAATATCGTA CGCGTTTGCT GACCGAAGCC GACATAGATC CTTCTATGGT TGACTGGGAG
ATTGAAGGCG AGGGGAATCC AAAACGTTAT CTTGTGTACG ATCAGATCAA TCGGCGCTGG
ACATATTGGC AAGCCGATCC CGAAGACGCG CATTGGGAAG GCGAGACCTG GACAAAACAA
ACCTCTGGAT TCACGCAAGA CGTTTCGCGT CTTCGCGACG ACGAATCGAA GGTGGCTGGC
TGGATCGCGG ATCTGTCGGA ATTCGATCCG CGTATCGATC CGGCGCTCAC CGGGGAATTC
GAAGTAAGGC TTAAGGACGG AAGTACGCAT ATCGGCCGTC CTGCATGGGA TTTGTGGGCT
GAGTACCTGC AACAATTCAC TCCCGAACGA GTGTCGGAGA TCACGGGCGT GGATGCCCAG
CTTATCGAAG ACGCCGTGGT TGAATGGGCA ACGCGCGATG ACCCGCGCAT ACCCAACGGC
GGTATAAATT ACGGTCTGGG CGTGGAGCAT GCGGGCAACT CAACGCAGAA TTGCAGGGCC
ATTATGGCGG CTTGCGCAAT GGTAGGCGCC ATCGACACGC CGGGCGGCCA GCGCGGTGCG
ACGAACGGAT GGACCGAACA GTCGGGTCCG TGCGCCATGC TTCCGTCGAT GGCGGCATTC
GCGTTCATGC CCACGCCCGA CTTGTCGCTC AAGATGGCCG GAAATGAGAA GATGCCGCTT
CTATACTGGT ACGGTGTGTG GTGCGATGCC AATGCCGCCA TGGAATGCGC GCACAAGGAG
CCGGATGCGC CCTATGAGAT TCATGGTGGC ATGATCGGCT CGGGCGACCA TATGAACATG
GGCAACGCAA CGTACAATTG GGAAGCGCTC AACATGCTCG ACTTTCTGTT CGAGGCGAAT
CTGTGGCATT CCCCTACTTC AGGTGCGGCC GACATCCTCC TTCCGGTCTG CCATTGGACC
GAGATCAACG CGTATCGCAT TGCCCAAGGG GCATCCGGTG GGTTTGGCCT ATGCGTGAAA
GCTGTTGATC CTCCTGGCGA ATGCAAAAGC GATCCGTTGT ACTTCATGGA GCTGTCGAAG
TATTTCGGCG TCCCAGCGTT TGACGGCGAC GATCCGTGGC TCGAGAACAA ACCTGATGCC
GATCTCGAAA TCGAAAACCT CACGATTCAG TGCTGTGTCC AAGGATGCGC GCCATACAAC
AATTGGAACG AGTTGGAGGC TGCATTCCAA GAGCACGGTT GGTGGAACAT GAAACGCGAG
ATCCCGGAAG ACTGGGGCAC GTACCGTCGA TACGAGGTGG GGCAGGCGTA TCGGTTGGCT
CCCCATCAGC AACCGGCTCA GCTGAACATC AATAAACCGG GGTTCCCCAC ACCTACGATG
AAACATGAGT TTTGGTGCAC TTCCATCGAA TCGTTCTTCC CGGAGGGGGC GGATGGCCCT
GAGCTTGCAC CCGGGTTCAC TTCCGAAGCG CTGCCTTATT ATGCAGAGCC CGCACACGGC
CCTGTGGTCG ATGCGGAAAC GTACAAGGAG TATCCCATTA CCTGCATCAC TGGACGACGC
ATACCGGTGT ACTTCCACTC CGAGCACCGA CAGCTGCCCT GGTGTCGCGA GCTTTGGCCG
GTGCCTCGCA TGGAGATCAA TCCCGATACG GCTGCTGAAC TTGGACTCGA ACAAGGAGAT
TGGGCCTGGA TCGAGAGCCC TTGGGGCAAG GTGCGACAGA CGGTCGATCT CTATTACGGC
ATTAAGCCGA ACATGATCAA CGCCGAGCAT CAATGGTGGT ATCCGGAGTT GGCTCAAGCG
GACAAAGGAT ATGAGTTGTC ATGCATCAAT TGCATTACCG ATCGGAAGAC TCAGGATAAA
TACAATGGAT CGTCGAATGT GCGCACCTAT CCAGTGAAGG TGTACAAAGC CACGCCTGAG
AATTCCCCCT TTGGGAATCC TATCCCTTGC GGAAACGATG GGACTGAGAT CATTCATGAC
TCTTCTGATC CTCGCCTCAA GCTATGGGAA ATCGGCGCTG CTGGCATCCA TCCGGATCAC
TTCGAGTAG
 
Protein sequence
MGKTTVTRRA FAQLAAATGA IAAMGVGTRP AVALTDGNSG TAGERGIKKI RSCCRGCGKV 
ECGVWVYIQD GKVVRTEGDE TCFNTMGNHC SKGQASIQAA YHPDRIKFPM KRTNPKGSED
PGWVRISWDE AYRTIADNIM QLREKYGPES LFTWCGTGRQ WCMQSDAGMA LELFGTPNII
AAYQVCKGPR HFSSRLDNVQ AWSWSEVINH STKYVQWATD PSVSNYDDSS RLVIDVAREA
EAFIVVDPRL SNLGRTAKYW LNLRPGTDNA MALGWCHIIL KHDLVDWQFV KRWSNASFIV
VPDMDPSGYT EAVQNTKSPY EYRTRLLTEA DIDPSMVDWE IEGEGNPKRY LVYDQINRRW
TYWQADPEDA HWEGETWTKQ TSGFTQDVSR LRDDESKVAG WIADLSEFDP RIDPALTGEF
EVRLKDGSTH IGRPAWDLWA EYLQQFTPER VSEITGVDAQ LIEDAVVEWA TRDDPRIPNG
GINYGLGVEH AGNSTQNCRA IMAACAMVGA IDTPGGQRGA TNGWTEQSGP CAMLPSMAAF
AFMPTPDLSL KMAGNEKMPL LYWYGVWCDA NAAMECAHKE PDAPYEIHGG MIGSGDHMNM
GNATYNWEAL NMLDFLFEAN LWHSPTSGAA DILLPVCHWT EINAYRIAQG ASGGFGLCVK
AVDPPGECKS DPLYFMELSK YFGVPAFDGD DPWLENKPDA DLEIENLTIQ CCVQGCAPYN
NWNELEAAFQ EHGWWNMKRE IPEDWGTYRR YEVGQAYRLA PHQQPAQLNI NKPGFPTPTM
KHEFWCTSIE SFFPEGADGP ELAPGFTSEA LPYYAEPAHG PVVDAETYKE YPITCITGRR
IPVYFHSEHR QLPWCRELWP VPRMEINPDT AAELGLEQGD WAWIESPWGK VRQTVDLYYG
IKPNMINAEH QWWYPELAQA DKGYELSCIN CITDRKTQDK YNGSSNVRTY PVKVYKATPE
NSPFGNPIPC GNDGTEIIHD SSDPRLKLWE IGAAGIHPDH FE