Gene Elen_0507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0507 
Symbol 
ID8414791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp651274 
End bp654132 
Gene Length2859 bp 
Protein Length952 aa 
Translation table11 
GC content63% 
IMG OID645023478 
Productmolydopterin dinucleotide-binding region 
Protein accessionYP_003180881 
Protein GI257790275 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.485596 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.607063 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAACA CCACGCAAAG GCGGTCGTTC GGCACCATGA GCCGTCGCAG CTTCATGAGG 
CTCGCAGGGG TGACGAGCGC TGCGCTTGCG CTGACGTCGG CAACGGCGCC GGCGGCGCTT
GCCGAAGAGC ACGACAGCGG AACGGTTGCG TCGTCGGACG GCGTGCAGCG TATTCGCACG
ATGTGCCGCG GGTGCGGCAA GATGGAATGC GGCGTGTGGG TGACGGTGGA GAACGGCCGC
GCCATCAAGA TCGAGGGGGA CGAGAGCTCG TTCGCGTCGT CGGGGAACAG CTGCAGCAAG
TCGCAGGCTT CGCTGCAGGC GTGCTACCAT CCCGATCGGC TCGCGTATCC CATGAAACGC
ACGAATCCCA AGGGCGACGA CGATCCGGGT TGGGTGCGCA TCAGCTGGGA CGAGGCTCTC
GCCGAAGCCG GCACGAAGCT CAACGAGATC AAGGAGCAGC GCGGCGGGAA CTCCATGTTC
TCGATGTGCG GCACCAGCCG CATCTACTGC ATGGCGAGCG CGCTCGGCAT GCAGGGCATT
CTGAACACGG CGAACACCCA TCAGGCGTAC CAGATTTGCA AGGGTCCCCG CCATGTGGCC
ACCGGTATGG TGTCGGCTCG TGCGTACAGC TGGATGGCCA CGGTCGACCG GCCGAGCGTG
TTCGTGCAAT GGGGCGGCGC TTCGGAGCTG TCCAACTACG ATGACTCGTG TCGCACGACG
GTCGATGCTG CGGTCAAGGC GGACAAGCAC ATCATCGTCG ATCCGCGTCA GACGAACCTC
GGCAAAGAAG CCGACATATG GAATCCGCTG CGTCCCGGCA CCGACGGCGC GGTGGGGCTC
GGCTGGCTCA ACGTGATCAT GGAGAACAAC CTGTACGACG AGCTTTGGGT GAAGCGGTGG
ACGAACGGCC CGTTCCTCGT GTGCGAGGAT ATCGAGCCTT CCGGTTGGCA GCAGATGGGT
GCCGGCGGTC CAGAAGAAAT CAAAACGCGC CTGCTCAAGG AATCCGATGT GCAGGAAGAC
GGCAGTCCGA AACGGTTCAT GGTCTACGAT CAGTTGAACC AGCGGCTTAC GTATTTCGAT
GCGGATACGG GATACTGGGA GGGCGAGCAG CCGCGTACGC TGACGGGGAA GGAGGCGCGG
CAAAAGCACC TCGCACCCGG CGTGACGCAA GGTTGGGTGC CCGATCCTAC CGGGTTCGAT
CCGGAGATCG ATCCTCAGAT ATTCGGGCAG GTGGAAGTCA CCCTGAAAGA CGCTTCGACC
TCGGTCTGCA AAACGGTGTG GCAAACGTTC TCGGACTACG TAGCCGATTT CACGCCCGAG
AAAGTGGAGG AGATCACCAG CGTTTCGGCT GATGCTTTGC GCGAGGCGGC CATCACCTAT
GCGACGCCTA TCGATCCATC GACCGGATAC GGCAACGGCG GCATCCAGTA CATGCTGGCC
ATCGAGCATG CGTGCAACTC GGTCCAGAAC AGCCGTATCT GCGATCTCAT CGTGGGGATC
ACGGGGAACT TCGACACCCC CGGCGGCAAT CGCGGCGCAA CGGCCGCGAC TTTCGACGAA
GAGTTCGCCA TGATGGGCAG CGGTCTGCCC ATGGCGTCGG CTGACTTGTG GGACAAGGTG
CTGGGCGTGG AGGACATTCC TTTGCTCAAG CATCACGGCA TCTGGGCCGA TTCGACGGCC
ATTTGGGATG CGTGCAACAA CGAAGGGGCG CCGTACCCGT TGTACGGCGG CGTCTGCCAG
TCGGGCGATG TGATGAACAT GTCCAACGCT CTTTGGGGTT GGGAAGGCTT GAAGAAGCTG
GACTTCCTGC TGGACATCGA CCTGTGGCAT ACGCCCACGT CGCAGCTGGC GGATATCCTG
CTGCCTGCGC GCCATTGGCT CGAGGTGGAT TGCCCTCGCC GCTCCCAAGG GTCTGGCGGC
ATGGAGGGAA GCCATTGCAA GTGCGTGGAG CCGCTCGGCG AGAGCTGGTT CGACGTGGAC
ATCATCATCC AGCTGTGCAA GGCCATGGGC ATTCCCTGGA GCGCCGACCC CGACGATCCG
TGGCCGGACT CCATCAAGGA GCTTGATGCG GCATGCGAGC CGATGGGTCT TACCTGGGAG
GAGTGGAAGC AGGAGTTTCA GAAGACCGGT TTCCGCGACT GCAAGAAGGA ATACCCCGAC
GACTGGGGCA CTTACCGACG CTACGAAACG GGCCACTGCC GCTCCGATGG CAAGCCGGGC
TTGCAGACGC CCACGCTCAA GCAGGAGATA TGGTCGACCA TCATCGAGAC GTACCATCCT
GACGGCCGGT ACAACCTTCC CACGTATTCC GAGCCCCCGG AGAGTCCCGT TGCGCAGCCT
GAGCGGGCTC AGGAGTACCC CTACATCATG ACGACGGGCC GTCGCATTCC CGTGTACTTC
CACTCCGAGC ACCGGCAGCT GCCGTGGTGC CGCGAGCTGT GGCCGGTGCC GCGCGTAGAG
ATCAACCCGA AGGATGCGCT TGAGCTCGGC ATAGAGCAGG GCGATTGGGT GTGGATCGAG
ACAGAGCGCG GCAAGGTGCG ACAAGTGGCG GATCTCTATC ACGGCATCCG CCCGGGGACC
ATCAACTGCG AGCATCAGTG GTGGCTGCCC GAGTTCCACG GCGCGACGAA GGGTTTCGAC
CTCATCAGCA TCAACTGCCT GGTGAACAAG GACATGCGCG ATCCTCTGTG CGGATCTTCG
TACGCGCGCG CTTACAACGT GAAGGTGTAC AAAGCCACGC CCGAGAACTC GCCGTTCGGC
AATCCCGTGC CGTGCGACGT CGACGGGACC GAGATGATCA CGTCGCCCGA TGACCCGCGT
TTGAAGGAAT GGCTGCCGAA CTACGAGGGG AGGGACTAG
 
Protein sequence
MANTTQRRSF GTMSRRSFMR LAGVTSAALA LTSATAPAAL AEEHDSGTVA SSDGVQRIRT 
MCRGCGKMEC GVWVTVENGR AIKIEGDESS FASSGNSCSK SQASLQACYH PDRLAYPMKR
TNPKGDDDPG WVRISWDEAL AEAGTKLNEI KEQRGGNSMF SMCGTSRIYC MASALGMQGI
LNTANTHQAY QICKGPRHVA TGMVSARAYS WMATVDRPSV FVQWGGASEL SNYDDSCRTT
VDAAVKADKH IIVDPRQTNL GKEADIWNPL RPGTDGAVGL GWLNVIMENN LYDELWVKRW
TNGPFLVCED IEPSGWQQMG AGGPEEIKTR LLKESDVQED GSPKRFMVYD QLNQRLTYFD
ADTGYWEGEQ PRTLTGKEAR QKHLAPGVTQ GWVPDPTGFD PEIDPQIFGQ VEVTLKDAST
SVCKTVWQTF SDYVADFTPE KVEEITSVSA DALREAAITY ATPIDPSTGY GNGGIQYMLA
IEHACNSVQN SRICDLIVGI TGNFDTPGGN RGATAATFDE EFAMMGSGLP MASADLWDKV
LGVEDIPLLK HHGIWADSTA IWDACNNEGA PYPLYGGVCQ SGDVMNMSNA LWGWEGLKKL
DFLLDIDLWH TPTSQLADIL LPARHWLEVD CPRRSQGSGG MEGSHCKCVE PLGESWFDVD
IIIQLCKAMG IPWSADPDDP WPDSIKELDA ACEPMGLTWE EWKQEFQKTG FRDCKKEYPD
DWGTYRRYET GHCRSDGKPG LQTPTLKQEI WSTIIETYHP DGRYNLPTYS EPPESPVAQP
ERAQEYPYIM TTGRRIPVYF HSEHRQLPWC RELWPVPRVE INPKDALELG IEQGDWVWIE
TERGKVRQVA DLYHGIRPGT INCEHQWWLP EFHGATKGFD LISINCLVNK DMRDPLCGSS
YARAYNVKVY KATPENSPFG NPVPCDVDGT EMITSPDDPR LKEWLPNYEG RD