Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0501 |
Symbol | |
ID | 8414785 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 642823 |
End bp | 645921 |
Gene Length | 3099 bp |
Protein Length | 1032 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645023472 |
Product | molydopterin dinucleotide-binding region |
Protein accession | YP_003180875 |
Protein GI | 257790269 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.638156 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAAAGC TCAATCTGAC GCGCCGTGCG TTCACGAAGC TCACGGCCGT GACGGGTGCG GCGTTGGCCT GCGCGGCAGC GGTCGCCCCG AACGCGGCGC TGGCCGAAGA TGCCGGAGCG GCCGCGCGCG GCGACGACGT GAAACGCGTG CGCACGTGCT GCCGCGGCTG CGGAAAGATG GAGTGCGGCG TGTGGGTGAC CGTGCAAAAC GGCCGCGCTA TCAAGGTGGA GGGCGACCAG TCGTCATTCC AGTCGAGCGG CAACTGCTGC GGCAAGTCGC AGTCGTCCAT CCAGGCGGCG TACCACCCCG ACCGCATCTA CCATCCCATG AAGCGCACGA ATCCCAAGGG CGAGGATCCC GGCTGGCAGC GCATCACGTG GGACGAGGCG ATGGAGATCG TCGGCACCAA GTTCAACGAG CTGATGGACC GCTACGGCGG ACAGTGCATC TTCAACATGG CGGGCACGTC GCGCCAGTGG GTGTACGGGC CCTACGCGTT CTACAAGTGG CTGTTCGACA CGCCGAACGC GCACGTGGCG TCGGAAATCT GCAAGGGGCC GCGCCGTTTG ATGGGGTGGA TCAGCTCGGT CGACGGCGCG CCGTGGATGG CGCTGCGCGA CGGGCCGCGC GTGTACGTGC AGTGGGGGAC CGCGCCCGAA AACTCGAACT ACGATGACAG CTGCCGCAAC CTCGTGGACA AGATGACCGA GGCCGACGTG CACATCTGCA TCGATCCGCG CCTGTCCGGC TCGGGCAAGG AGGCCGACTA CTGGCTGAAC CTGCGTCCCG GCTCCGACGG CGCGCTGGCG CTGTGCTGGC AGCACCTCGT CATCAAGAAC GACCTGGTGG ACTGGGAGTT CGTGAAGCGC TGGACGAACT CGTCGTTCCT CGTGGTGGAG GACCGGGAGC CCACGGGCGG CCGCTACATC GACCTGTCCA CGCCGCTCAA CAACGCCGGC ATCCCCGCCG ACGTCGTGGG CACGAAGCTC AAGACGCGCC TGCTGAAGGA AAGCGATGTG GTGGAGGGCG GCAGCCCGCG CAAGTTCTAC GCGTGGAACA AACTGGCCAA CGACGGCGCC GGCGGCTTGG TGATGTGGGA CGTCGACACC ACGCAATGGG AGGGCTGCAA CCACGTGGCC CCCACGCGCG ACCAGATGGA GGTCGTGTAC AAGGGCACGT CGCAGGAGGG CTATTTGCCG CCGCTGTCCT ATCACGAGCT GGAGGAAGCC GGCATCGACC TGGATATGCG TGGGACGCAT GAGGTGGAGC TGCTCGACGG TTCGAAGCAC ACGGCGAAGC CGGTGTGGGC GTACCTCGAG GAATCGGTGG CGGACTGCAC GCCCGCGTGG TGCTCCGAGA TCACGGGCCT CGATCCTGCG CTCATCGAAG AGGCGTGCCT CGTGTGGGCC ACGCGTCCCG AGGGGCAGGA TTACGGCAAC GGCGGTATCC ACCTGAACCT CGCGCCCGAC CAGATCGGCA ACTGCACGCA GACGGTGCGC GCGGTGCTGC ACCTCATCTA CATGACCGGC AACTTCGACA CGCCCGCCGG CAACCGCGGC CTCACGCGCT CGCCCATCGA CGAGCAGGCC ACGGCCGCGC CCGGCTCGAA CATGCCGCAG GAAGTGAAGG CCCAGCTGAT CGCCTTGGGC GAGATCCCCG TGGAAGGCGT CACGCCCGAT CCCCTCAACG TGCCCGACCG CTACGACACG CTGTCGAACA TGGTGGGCGC CGACGAGTTC CCCATCACGG CGTACTACAA CGAGTGGGCC GACGCGACGC GCATCTGGGA CGCCTGCCTC ACCGGCGAGC CGTACCCCGT GCGCGGCGGC ATCAACGAGT CCGGCTCGTT CATGAACATG TCGAACGCGA ACCTGGCCTG GGAGGCGTTG CAGTCGCTCG ATTTCTGGGT GGACATCAAC ATGTTCCACC ATCCCGGCAC CGAGATGGCC GACATCCTGC TGCCGTGCCA GCATTGGCTG GAGATCAACA ACATCCGCGT GTCGCAGGGC GCGTCCGGCG GTATCGGCGC CACCATCCGC GCGGTCGAGC CGCCCAGCGA CACGAAGTTC GACTACGACA TCAACCGCCT GCTGTTCGAC GCCGTGGGCG GCCCGAACGG AACCTGGACC AACATCGCGG GCGACGCGCC CGGCGGCTAC CACGTGGACG AGCGCTTGGA GGACTGGTTC CAGAACAACT CGAAGACCAA TCCCAAGGTG AAATGGCAGC ATTGGGACGA CTTCGTGGAG GACTTTCAGG AGAACGGCTG GATCAACGCC AAGGAGATCG AGCCCGACCG CTGGGGCACG TACCGCCGCT TCGAGACCGG CTGGATGCGC ATCGGCAAGG ACGCGTGCAC CGGCTCCACG TTCAGCGCGG CGTTCGACGA CGCCGGCAAT CCGGTGAACA ACTTCGGCTG CCCCACGCCG ACGGGCCTCG TGGAATTTTG GCCGCTCGTG TTCGAGACGT ACTGCGTGGA CAAGGCGAAC GAGTTCAACC CCGGCAAGTT CGACCTGGTG CACGAGATGA TGCCGCACTA CGACGAGCCG AAATCGGGCC CCAAGGGCGA CGTGGACATG AACGAGTATC CCATTATCCT CACCACCGGC CGCCGCATAC CCGTGTACTT CCATTCCGAG CACCGGCAGC TGCCGTGGTG TCGCGAGCTG TGGCCGGCAC CGCGTTTGGA GGTGAACCCC GAAGACGCTG CCGAGCTGGG GCTCGAGCAG GGCGATTGGG CCTGGATCGA GACGGAGTGG GGCAAGGTGC GCCAGTGCGT CGATTTGTAC TACGGCATCG CGAAGGGGTG GGCGAACGCC GAGCACGCCT GGTGGTTCCC CGAGCTGCCC GCGCCGACGC ACGGGTTCGA GCTGTCGAAC ATCGAGTGCA TCTGGAACCC CTACGGCCAG GATCCGTTCA TCGGGTCGTC CCACATGCGC GGCGTGCCCG TGAAGATATA CAAGGCCACG CCCGAGAACT GCCCCGACGG CAAGGTCATC CCCTGCGCGC CCGAGGACGG CACCGAGATC ATCTACGATG CCTCCGACCC GCGCCTGAAG GAATGGCTGC CGAACTACAC CATCCGAGAG GAGGCGTAA
|
Protein sequence | MGKLNLTRRA FTKLTAVTGA ALACAAAVAP NAALAEDAGA AARGDDVKRV RTCCRGCGKM ECGVWVTVQN GRAIKVEGDQ SSFQSSGNCC GKSQSSIQAA YHPDRIYHPM KRTNPKGEDP GWQRITWDEA MEIVGTKFNE LMDRYGGQCI FNMAGTSRQW VYGPYAFYKW LFDTPNAHVA SEICKGPRRL MGWISSVDGA PWMALRDGPR VYVQWGTAPE NSNYDDSCRN LVDKMTEADV HICIDPRLSG SGKEADYWLN LRPGSDGALA LCWQHLVIKN DLVDWEFVKR WTNSSFLVVE DREPTGGRYI DLSTPLNNAG IPADVVGTKL KTRLLKESDV VEGGSPRKFY AWNKLANDGA GGLVMWDVDT TQWEGCNHVA PTRDQMEVVY KGTSQEGYLP PLSYHELEEA GIDLDMRGTH EVELLDGSKH TAKPVWAYLE ESVADCTPAW CSEITGLDPA LIEEACLVWA TRPEGQDYGN GGIHLNLAPD QIGNCTQTVR AVLHLIYMTG NFDTPAGNRG LTRSPIDEQA TAAPGSNMPQ EVKAQLIALG EIPVEGVTPD PLNVPDRYDT LSNMVGADEF PITAYYNEWA DATRIWDACL TGEPYPVRGG INESGSFMNM SNANLAWEAL QSLDFWVDIN MFHHPGTEMA DILLPCQHWL EINNIRVSQG ASGGIGATIR AVEPPSDTKF DYDINRLLFD AVGGPNGTWT NIAGDAPGGY HVDERLEDWF QNNSKTNPKV KWQHWDDFVE DFQENGWINA KEIEPDRWGT YRRFETGWMR IGKDACTGST FSAAFDDAGN PVNNFGCPTP TGLVEFWPLV FETYCVDKAN EFNPGKFDLV HEMMPHYDEP KSGPKGDVDM NEYPIILTTG RRIPVYFHSE HRQLPWCREL WPAPRLEVNP EDAAELGLEQ GDWAWIETEW GKVRQCVDLY YGIAKGWANA EHAWWFPELP APTHGFELSN IECIWNPYGQ DPFIGSSHMR GVPVKIYKAT PENCPDGKVI PCAPEDGTEI IYDASDPRLK EWLPNYTIRE EA
|
| |