Gene Elen_0497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0497 
Symbol 
ID8414781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp635502 
End bp638810 
Gene Length3309 bp 
Protein Length1102 aa 
Translation table11 
GC content64% 
IMG OID645023468 
Productmolydopterin dinucleotide-binding region 
Protein accessionYP_003180871 
Protein GI257790265 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.455569 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAGC TCACGCTCAC GCGCCGCAGC TTCATGAAGG CGATGGCGGT CACCGGTGCC 
GCTGCGTCGC TCACGGCGGT CGCCGAGCCG ATGCGGGCGC TTGCCGAGGG CGTCGATGCC
GATGCGGGGG AAGTCAAGCG CGTTCGCTCG TGCTGCCGCG CTTGCGGCAA GGTGGAGTGC
GGCGTTTGGG TAACCGTGCG CGACGGCAAG GTCGTCAAGG TGGAAGGCGA CGAGTCCGCG
CCGCAAAGCC GCGGGCACTG CTGCTCGAAG TCGCAGTCAT CGATGCAGGC GCTATATCAT
CCCGACCGCC TTCGTTTCCC GGTGAAGCGC ACGAACCCTA AGGGCGAGGA CGATCCCGGT
TGGGTGCGCA TCACGCTGGA CGAGGCGTTC GAGATATGCG GCGAGAAGCT CAAGGAAGTG
AAGGAGAAGT ACGGCGGGGA GTCCATCTTC GTGATGTGCG GCACGTCGCG CGTGTGGTCG
CTCGGGCCTT ACCAGGGCAT GAAGCAGCTG TTCGGCACGC CGAACGCGCA TTTGGCCTAC
CAGGTGTGCA AAGGCCCGCG CCACTTCGGC GGCATCATGA CCGACGAGAT GGGGTCCCCG
TGGATGGAGG TGGAGGCCGA GCCCAGCGTG TACGTGCAGT GGGGCACGGC GTGCGAGTAC
TCCAACTACG ACTCCACGAA CCGCACGGTC ACCGACGTCG CGCATCGTGC GTCTAAGCAT
ATCGTGGTGG ATCCGCGCGT CACGCCTTTG GGCAAGGAAG CCGACATCTG GCTGCCGCTT
CGTCCCGGCA CCGACGGCGC GCTGGCGCTG TCGTGGCTCA ACTGGATCAT CGAAAACGAA
GCTTACGATG ACACGATGGT GCGTCGCTGG TCGAACGCGT CGTTCCTCTA CGTGGACGAC
AAGCCCGAGC TGACGCAAGG CTGGTTGGTG GAGGGCAACG GCGGCATCAA CATGAAGACG
AAGCTGCTGA CCGAGGCCGA CCTCAAGGAA GATGGCAAGT ACCAGCGCTT CATGGTGTGG
GACGAGGCGA ACGAGCGTCT CACGTACTGG GATGCCGAGC TTGGCATGTG GGAGGGCGAG
GAGCACCGCA TCCCCACCAC CGGCACATGG ATCGAGCATC CGTACAAGCC GCTTGTTGCC
GATGCATGGC TGCCCGATCA GTCGACGTTC GCCGATCCGG CTACCGAGCC CGACCGCTTC
CCCGACGGTT TCGAGGAATG CAATCCCAAG GGGTTGCCGA AGCGCCCATC TCTGCTTCCT
GGTGAGGTAG AGGTAACGCT CAAAGACGGA ACCGTGCACA AGGCTCGCTC TGTGTGGGAT
GCGTTCCATC AGATGACCAG CGAGTACACC ATGGAGAAGG CAGAGGAGAT CACCGGCGTG
CCGGCGGCGA AGAGCGAGGA GGCTGTACGC GCCTGGACGA CGCGCGTCAA TCCGCTGCAT
GGCAACGGCG GCATCCACTT CCAGCTGGCC ACCGACCAGA ACGGCAACTC CATCCAGAAC
GTGCGTGCGC TGCAGATCCT GTCGTGCATC ACCGGCAACT CCGACGAGCC GGCCGGCAAC
CGCGGCTCGT CGAAGGCGCA GTTCGACGGC AATCCAGGGC GTTCGAACAT GCAGGCCGGC
GCGCCGTACG GCGACGAGGC CAAGATGTGG GACGGCCGCG ACGTCGACCT CGAAGGGCTC
GCCGCGAAGA TGCAGGATTT CGTGCAGTAC CTCATCGACA ACGATTCGCC CCTGGCCGAG
CGCTACGGCA ACAAGGTGCC CAGCCATGAG GAAGCCCTCG TTATCGCCAA GCGCATGGGC
GGCGCATTCC GCACGTCGCA GGCGTGGCCG AACCCCAAGA CCGTGTTCGA GCGACAAGCC
AACGAGGTTG ACGCCGAGCG ATTCCCCCTC AACCGCTACT GGGCGCGCTG GGCGGACGCG
AACTCCATCT GGGACGGCTG CCTGGACAAC GACGTGCCGT ACCAGTTGCA TGCGGGCGTG
TGCCAGTCGG GCGACTTCAT GAATATGGGC AACATCGACG TGGCGTGGGA GGCCATGACG
AAGCTGGACT TCTTCACCGA CATCAACCTG TGGTTCTGCC CGAACAACGG CAACGCCGAC
GTCATCTTTC CGTGCTACCA CTGGTTGGAA GTTGACACGA CGCGCGTCAG CCAGGGCGCC
GGCGGGTTCT TCGGCTGCGG GTGCGCCGCT GTCGAGGCGC CGGGCGAGTG CATCTACGAT
CCCGACTGGA ACGTGGGCAT GTACAAGGCC ATGGGCGTGC CATGGAACAC GAAGGACGAG
TCGGCCGAGC CGTCCGAGAT CCTCAACTTC GGCGAGAAGA CCGACTACCG TTGGCCGAAC
CGCAGCCGCG TGCTCAAGGA CAACGTGGAC GCGTGGAAGA CAGCCGAGTT CCCCGACGGT
CCCACGTGGG AGCAAGCGAA GGAGCACTTC CAGAAGAACG GCTGGATGGA TTGCCGCACG
TGGCATCCCG AGCGTTGGGG CACGTATCGC CGGCACGAGA TGGGCTGGCG TCGCCAGCAG
GGCGGCTTCA ACCTGTTCCC GTTGGTGGAC GACCATCCGG GCTTCATGAC GCCGTCGGGC
CTCATTGAAA TCTGGTCGCT TGTGTGCGAG GCGTACTTAG GCGACGAGGA CAAGTTCCCC
GTGTACCGCG AACCCACGAA CTCGCCGGTG ACCACGCCCG AGTACTTCGA CGCCGCGAAG
GTTGCCGACA TCGATGAGTC GAAGCTGCGC AATATGGATT ATCCCGCCAG CCTGCAGGAG
CACGCCGATG CGACGTTCCT CATGACCACG GGCGCGCGCC AGCCGGTGTA CTTCCACAGC
GAGCATCGCC AGCTGCCGTG GTGCCGCGAG CTGTGGCCCG CGCCGCGCCT GGAGATGAAC
CCGAACGACG CTGCGCGTCT GGGGCTCGAG CAGGGCGACT GGGTGTGGAT CGAGAGCCCG
TGGGGCAAGG TGCGCGAGGT GCTGGATCTG TACTACGGCA TCAGCAAGGG CGTGGTGAAC
GCGAACCACG CGTGGTGGTT CCCCGAGATG GACACGGCCA GCCACGGCTA CGAGCTGGTG
AACATCAACT GCGTGATGGA TCCTTACGGC CAGGATGTGG TGTGCGGTGC AGCAACGATG
CGCTCGGTTC CCGTGCTGGT TTATAAGGCT ACGGCGGAAA ACTCGCCGTT CGGCAACCCG
GTGCCGTGCG ATCCGCAGGG TAACCCGTGC ATCTGCGATG CGGGCGATCC GCGGTTGAAG
GAATGGATGG GAACGGGATT GCGATCTCGC GTTGAAGGCG AAGAGGCGTG GACCGGAGAG
GGGGCGTAG
 
Protein sequence
MTQLTLTRRS FMKAMAVTGA AASLTAVAEP MRALAEGVDA DAGEVKRVRS CCRACGKVEC 
GVWVTVRDGK VVKVEGDESA PQSRGHCCSK SQSSMQALYH PDRLRFPVKR TNPKGEDDPG
WVRITLDEAF EICGEKLKEV KEKYGGESIF VMCGTSRVWS LGPYQGMKQL FGTPNAHLAY
QVCKGPRHFG GIMTDEMGSP WMEVEAEPSV YVQWGTACEY SNYDSTNRTV TDVAHRASKH
IVVDPRVTPL GKEADIWLPL RPGTDGALAL SWLNWIIENE AYDDTMVRRW SNASFLYVDD
KPELTQGWLV EGNGGINMKT KLLTEADLKE DGKYQRFMVW DEANERLTYW DAELGMWEGE
EHRIPTTGTW IEHPYKPLVA DAWLPDQSTF ADPATEPDRF PDGFEECNPK GLPKRPSLLP
GEVEVTLKDG TVHKARSVWD AFHQMTSEYT MEKAEEITGV PAAKSEEAVR AWTTRVNPLH
GNGGIHFQLA TDQNGNSIQN VRALQILSCI TGNSDEPAGN RGSSKAQFDG NPGRSNMQAG
APYGDEAKMW DGRDVDLEGL AAKMQDFVQY LIDNDSPLAE RYGNKVPSHE EALVIAKRMG
GAFRTSQAWP NPKTVFERQA NEVDAERFPL NRYWARWADA NSIWDGCLDN DVPYQLHAGV
CQSGDFMNMG NIDVAWEAMT KLDFFTDINL WFCPNNGNAD VIFPCYHWLE VDTTRVSQGA
GGFFGCGCAA VEAPGECIYD PDWNVGMYKA MGVPWNTKDE SAEPSEILNF GEKTDYRWPN
RSRVLKDNVD AWKTAEFPDG PTWEQAKEHF QKNGWMDCRT WHPERWGTYR RHEMGWRRQQ
GGFNLFPLVD DHPGFMTPSG LIEIWSLVCE AYLGDEDKFP VYREPTNSPV TTPEYFDAAK
VADIDESKLR NMDYPASLQE HADATFLMTT GARQPVYFHS EHRQLPWCRE LWPAPRLEMN
PNDAARLGLE QGDWVWIESP WGKVREVLDL YYGISKGVVN ANHAWWFPEM DTASHGYELV
NINCVMDPYG QDVVCGAATM RSVPVLVYKA TAENSPFGNP VPCDPQGNPC ICDAGDPRLK
EWMGTGLRSR VEGEEAWTGE GA