Gene Elen_0501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0501 
Symbol 
ID8414785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp642823 
End bp645921 
Gene Length3099 bp 
Protein Length1032 aa 
Translation table11 
GC content66% 
IMG OID645023472 
Productmolydopterin dinucleotide-binding region 
Protein accessionYP_003180875 
Protein GI257790269 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.638156 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAAGC TCAATCTGAC GCGCCGTGCG TTCACGAAGC TCACGGCCGT GACGGGTGCG 
GCGTTGGCCT GCGCGGCAGC GGTCGCCCCG AACGCGGCGC TGGCCGAAGA TGCCGGAGCG
GCCGCGCGCG GCGACGACGT GAAACGCGTG CGCACGTGCT GCCGCGGCTG CGGAAAGATG
GAGTGCGGCG TGTGGGTGAC CGTGCAAAAC GGCCGCGCTA TCAAGGTGGA GGGCGACCAG
TCGTCATTCC AGTCGAGCGG CAACTGCTGC GGCAAGTCGC AGTCGTCCAT CCAGGCGGCG
TACCACCCCG ACCGCATCTA CCATCCCATG AAGCGCACGA ATCCCAAGGG CGAGGATCCC
GGCTGGCAGC GCATCACGTG GGACGAGGCG ATGGAGATCG TCGGCACCAA GTTCAACGAG
CTGATGGACC GCTACGGCGG ACAGTGCATC TTCAACATGG CGGGCACGTC GCGCCAGTGG
GTGTACGGGC CCTACGCGTT CTACAAGTGG CTGTTCGACA CGCCGAACGC GCACGTGGCG
TCGGAAATCT GCAAGGGGCC GCGCCGTTTG ATGGGGTGGA TCAGCTCGGT CGACGGCGCG
CCGTGGATGG CGCTGCGCGA CGGGCCGCGC GTGTACGTGC AGTGGGGGAC CGCGCCCGAA
AACTCGAACT ACGATGACAG CTGCCGCAAC CTCGTGGACA AGATGACCGA GGCCGACGTG
CACATCTGCA TCGATCCGCG CCTGTCCGGC TCGGGCAAGG AGGCCGACTA CTGGCTGAAC
CTGCGTCCCG GCTCCGACGG CGCGCTGGCG CTGTGCTGGC AGCACCTCGT CATCAAGAAC
GACCTGGTGG ACTGGGAGTT CGTGAAGCGC TGGACGAACT CGTCGTTCCT CGTGGTGGAG
GACCGGGAGC CCACGGGCGG CCGCTACATC GACCTGTCCA CGCCGCTCAA CAACGCCGGC
ATCCCCGCCG ACGTCGTGGG CACGAAGCTC AAGACGCGCC TGCTGAAGGA AAGCGATGTG
GTGGAGGGCG GCAGCCCGCG CAAGTTCTAC GCGTGGAACA AACTGGCCAA CGACGGCGCC
GGCGGCTTGG TGATGTGGGA CGTCGACACC ACGCAATGGG AGGGCTGCAA CCACGTGGCC
CCCACGCGCG ACCAGATGGA GGTCGTGTAC AAGGGCACGT CGCAGGAGGG CTATTTGCCG
CCGCTGTCCT ATCACGAGCT GGAGGAAGCC GGCATCGACC TGGATATGCG TGGGACGCAT
GAGGTGGAGC TGCTCGACGG TTCGAAGCAC ACGGCGAAGC CGGTGTGGGC GTACCTCGAG
GAATCGGTGG CGGACTGCAC GCCCGCGTGG TGCTCCGAGA TCACGGGCCT CGATCCTGCG
CTCATCGAAG AGGCGTGCCT CGTGTGGGCC ACGCGTCCCG AGGGGCAGGA TTACGGCAAC
GGCGGTATCC ACCTGAACCT CGCGCCCGAC CAGATCGGCA ACTGCACGCA GACGGTGCGC
GCGGTGCTGC ACCTCATCTA CATGACCGGC AACTTCGACA CGCCCGCCGG CAACCGCGGC
CTCACGCGCT CGCCCATCGA CGAGCAGGCC ACGGCCGCGC CCGGCTCGAA CATGCCGCAG
GAAGTGAAGG CCCAGCTGAT CGCCTTGGGC GAGATCCCCG TGGAAGGCGT CACGCCCGAT
CCCCTCAACG TGCCCGACCG CTACGACACG CTGTCGAACA TGGTGGGCGC CGACGAGTTC
CCCATCACGG CGTACTACAA CGAGTGGGCC GACGCGACGC GCATCTGGGA CGCCTGCCTC
ACCGGCGAGC CGTACCCCGT GCGCGGCGGC ATCAACGAGT CCGGCTCGTT CATGAACATG
TCGAACGCGA ACCTGGCCTG GGAGGCGTTG CAGTCGCTCG ATTTCTGGGT GGACATCAAC
ATGTTCCACC ATCCCGGCAC CGAGATGGCC GACATCCTGC TGCCGTGCCA GCATTGGCTG
GAGATCAACA ACATCCGCGT GTCGCAGGGC GCGTCCGGCG GTATCGGCGC CACCATCCGC
GCGGTCGAGC CGCCCAGCGA CACGAAGTTC GACTACGACA TCAACCGCCT GCTGTTCGAC
GCCGTGGGCG GCCCGAACGG AACCTGGACC AACATCGCGG GCGACGCGCC CGGCGGCTAC
CACGTGGACG AGCGCTTGGA GGACTGGTTC CAGAACAACT CGAAGACCAA TCCCAAGGTG
AAATGGCAGC ATTGGGACGA CTTCGTGGAG GACTTTCAGG AGAACGGCTG GATCAACGCC
AAGGAGATCG AGCCCGACCG CTGGGGCACG TACCGCCGCT TCGAGACCGG CTGGATGCGC
ATCGGCAAGG ACGCGTGCAC CGGCTCCACG TTCAGCGCGG CGTTCGACGA CGCCGGCAAT
CCGGTGAACA ACTTCGGCTG CCCCACGCCG ACGGGCCTCG TGGAATTTTG GCCGCTCGTG
TTCGAGACGT ACTGCGTGGA CAAGGCGAAC GAGTTCAACC CCGGCAAGTT CGACCTGGTG
CACGAGATGA TGCCGCACTA CGACGAGCCG AAATCGGGCC CCAAGGGCGA CGTGGACATG
AACGAGTATC CCATTATCCT CACCACCGGC CGCCGCATAC CCGTGTACTT CCATTCCGAG
CACCGGCAGC TGCCGTGGTG TCGCGAGCTG TGGCCGGCAC CGCGTTTGGA GGTGAACCCC
GAAGACGCTG CCGAGCTGGG GCTCGAGCAG GGCGATTGGG CCTGGATCGA GACGGAGTGG
GGCAAGGTGC GCCAGTGCGT CGATTTGTAC TACGGCATCG CGAAGGGGTG GGCGAACGCC
GAGCACGCCT GGTGGTTCCC CGAGCTGCCC GCGCCGACGC ACGGGTTCGA GCTGTCGAAC
ATCGAGTGCA TCTGGAACCC CTACGGCCAG GATCCGTTCA TCGGGTCGTC CCACATGCGC
GGCGTGCCCG TGAAGATATA CAAGGCCACG CCCGAGAACT GCCCCGACGG CAAGGTCATC
CCCTGCGCGC CCGAGGACGG CACCGAGATC ATCTACGATG CCTCCGACCC GCGCCTGAAG
GAATGGCTGC CGAACTACAC CATCCGAGAG GAGGCGTAA
 
Protein sequence
MGKLNLTRRA FTKLTAVTGA ALACAAAVAP NAALAEDAGA AARGDDVKRV RTCCRGCGKM 
ECGVWVTVQN GRAIKVEGDQ SSFQSSGNCC GKSQSSIQAA YHPDRIYHPM KRTNPKGEDP
GWQRITWDEA MEIVGTKFNE LMDRYGGQCI FNMAGTSRQW VYGPYAFYKW LFDTPNAHVA
SEICKGPRRL MGWISSVDGA PWMALRDGPR VYVQWGTAPE NSNYDDSCRN LVDKMTEADV
HICIDPRLSG SGKEADYWLN LRPGSDGALA LCWQHLVIKN DLVDWEFVKR WTNSSFLVVE
DREPTGGRYI DLSTPLNNAG IPADVVGTKL KTRLLKESDV VEGGSPRKFY AWNKLANDGA
GGLVMWDVDT TQWEGCNHVA PTRDQMEVVY KGTSQEGYLP PLSYHELEEA GIDLDMRGTH
EVELLDGSKH TAKPVWAYLE ESVADCTPAW CSEITGLDPA LIEEACLVWA TRPEGQDYGN
GGIHLNLAPD QIGNCTQTVR AVLHLIYMTG NFDTPAGNRG LTRSPIDEQA TAAPGSNMPQ
EVKAQLIALG EIPVEGVTPD PLNVPDRYDT LSNMVGADEF PITAYYNEWA DATRIWDACL
TGEPYPVRGG INESGSFMNM SNANLAWEAL QSLDFWVDIN MFHHPGTEMA DILLPCQHWL
EINNIRVSQG ASGGIGATIR AVEPPSDTKF DYDINRLLFD AVGGPNGTWT NIAGDAPGGY
HVDERLEDWF QNNSKTNPKV KWQHWDDFVE DFQENGWINA KEIEPDRWGT YRRFETGWMR
IGKDACTGST FSAAFDDAGN PVNNFGCPTP TGLVEFWPLV FETYCVDKAN EFNPGKFDLV
HEMMPHYDEP KSGPKGDVDM NEYPIILTTG RRIPVYFHSE HRQLPWCREL WPAPRLEVNP
EDAAELGLEQ GDWAWIETEW GKVRQCVDLY YGIAKGWANA EHAWWFPELP APTHGFELSN
IECIWNPYGQ DPFIGSSHMR GVPVKIYKAT PENCPDGKVI PCAPEDGTEI IYDASDPRLK
EWLPNYTIRE EA