Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0511 |
Symbol | |
ID | 8414795 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 657053 |
End bp | 660061 |
Gene Length | 3009 bp |
Protein Length | 1002 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 645023482 |
Product | molydopterin dinucleotide-binding region |
Protein accession | YP_003180885 |
Protein GI | 257790279 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.3924 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGAAGA CGACCGTGAC AAGGCGCGCG TTCGCTCAGC TTGCCGCCGC GACAGGCGCA ATAGCCGCTA TGGGTGTTGG AACTCGTCCT GCGGTGGCGC TTACCGATGG CAATTCGGGG ACGGCGGGCG AGCGGGGGAT CAAGAAGATA CGCTCGTGCT GCCGCGGCTG CGGAAAGGTC GAATGCGGTG TGTGGGTGTA CATCCAAGAT GGCAAAGTGG TTCGAACCGA AGGTGACGAA ACCTGCTTCA ACACTATGGG CAATCATTGC AGCAAGGGGC AAGCGTCCAT TCAGGCTGCG TATCATCCTG ATCGTATCAA GTTCCCAATG AAGCGCACGA ACCCGAAGGG CAGCGAAGAT CCAGGTTGGG TGAGGATCAG CTGGGACGAG GCCTACCGGA CCATTGCGGA CAATATTATG CAGTTGCGTG AGAAGTACGG TCCTGAAAGC TTGTTCACGT GGTGCGGTAC GGGCCGACAG TGGTGCATGC AGTCCGATGC TGGCATGGCG CTTGAGCTGT TCGGTACGCC GAACATCATC GCGGCGTACC AAGTGTGCAA GGGTCCGCGC CATTTTTCGT CGCGTCTCGA CAACGTTCAG GCGTGGTCTT GGAGTGAGGT GATAAACCAT TCGACGAAAT ACGTCCAATG GGCTACCGAT CCCTCCGTTT CGAATTACGA CGATTCGTCT CGTTTGGTAA TTGATGTGGC GCGCGAAGCT GAGGCTTTTA TCGTGGTTGA CCCTCGCCTG TCGAATCTTG GCCGCACTGC GAAGTATTGG TTAAACTTGC GACCTGGCAC CGATAATGCC ATGGCGCTAG GATGGTGCCA TATCATATTG AAGCACGATC TGGTGGATTG GCAGTTCGTG AAACGCTGGT CCAATGCCTC GTTCATCGTC GTGCCCGATA TGGACCCTTC GGGCTACACC GAGGCGGTTC AAAACACGAA AAGCCCCTAC GAATATCGTA CGCGTTTGCT GACCGAAGCC GACATAGATC CTTCTATGGT TGACTGGGAG ATTGAAGGCG AGGGGAATCC AAAACGTTAT CTTGTGTACG ATCAGATCAA TCGGCGCTGG ACATATTGGC AAGCCGATCC CGAAGACGCG CATTGGGAAG GCGAGACCTG GACAAAACAA ACCTCTGGAT TCACGCAAGA CGTTTCGCGT CTTCGCGACG ACGAATCGAA GGTGGCTGGC TGGATCGCGG ATCTGTCGGA ATTCGATCCG CGTATCGATC CGGCGCTCAC CGGGGAATTC GAAGTAAGGC TTAAGGACGG AAGTACGCAT ATCGGCCGTC CTGCATGGGA TTTGTGGGCT GAGTACCTGC AACAATTCAC TCCCGAACGA GTGTCGGAGA TCACGGGCGT GGATGCCCAG CTTATCGAAG ACGCCGTGGT TGAATGGGCA ACGCGCGATG ACCCGCGCAT ACCCAACGGC GGTATAAATT ACGGTCTGGG CGTGGAGCAT GCGGGCAACT CAACGCAGAA TTGCAGGGCC ATTATGGCGG CTTGCGCAAT GGTAGGCGCC ATCGACACGC CGGGCGGCCA GCGCGGTGCG ACGAACGGAT GGACCGAACA GTCGGGTCCG TGCGCCATGC TTCCGTCGAT GGCGGCATTC GCGTTCATGC CCACGCCCGA CTTGTCGCTC AAGATGGCCG GAAATGAGAA GATGCCGCTT CTATACTGGT ACGGTGTGTG GTGCGATGCC AATGCCGCCA TGGAATGCGC GCACAAGGAG CCGGATGCGC CCTATGAGAT TCATGGTGGC ATGATCGGCT CGGGCGACCA TATGAACATG GGCAACGCAA CGTACAATTG GGAAGCGCTC AACATGCTCG ACTTTCTGTT CGAGGCGAAT CTGTGGCATT CCCCTACTTC AGGTGCGGCC GACATCCTCC TTCCGGTCTG CCATTGGACC GAGATCAACG CGTATCGCAT TGCCCAAGGG GCATCCGGTG GGTTTGGCCT ATGCGTGAAA GCTGTTGATC CTCCTGGCGA ATGCAAAAGC GATCCGTTGT ACTTCATGGA GCTGTCGAAG TATTTCGGCG TCCCAGCGTT TGACGGCGAC GATCCGTGGC TCGAGAACAA ACCTGATGCC GATCTCGAAA TCGAAAACCT CACGATTCAG TGCTGTGTCC AAGGATGCGC GCCATACAAC AATTGGAACG AGTTGGAGGC TGCATTCCAA GAGCACGGTT GGTGGAACAT GAAACGCGAG ATCCCGGAAG ACTGGGGCAC GTACCGTCGA TACGAGGTGG GGCAGGCGTA TCGGTTGGCT CCCCATCAGC AACCGGCTCA GCTGAACATC AATAAACCGG GGTTCCCCAC ACCTACGATG AAACATGAGT TTTGGTGCAC TTCCATCGAA TCGTTCTTCC CGGAGGGGGC GGATGGCCCT GAGCTTGCAC CCGGGTTCAC TTCCGAAGCG CTGCCTTATT ATGCAGAGCC CGCACACGGC CCTGTGGTCG ATGCGGAAAC GTACAAGGAG TATCCCATTA CCTGCATCAC TGGACGACGC ATACCGGTGT ACTTCCACTC CGAGCACCGA CAGCTGCCCT GGTGTCGCGA GCTTTGGCCG GTGCCTCGCA TGGAGATCAA TCCCGATACG GCTGCTGAAC TTGGACTCGA ACAAGGAGAT TGGGCCTGGA TCGAGAGCCC TTGGGGCAAG GTGCGACAGA CGGTCGATCT CTATTACGGC ATTAAGCCGA ACATGATCAA CGCCGAGCAT CAATGGTGGT ATCCGGAGTT GGCTCAAGCG GACAAAGGAT ATGAGTTGTC ATGCATCAAT TGCATTACCG ATCGGAAGAC TCAGGATAAA TACAATGGAT CGTCGAATGT GCGCACCTAT CCAGTGAAGG TGTACAAAGC CACGCCTGAG AATTCCCCCT TTGGGAATCC TATCCCTTGC GGAAACGATG GGACTGAGAT CATTCATGAC TCTTCTGATC CTCGCCTCAA GCTATGGGAA ATCGGCGCTG CTGGCATCCA TCCGGATCAC TTCGAGTAG
|
Protein sequence | MGKTTVTRRA FAQLAAATGA IAAMGVGTRP AVALTDGNSG TAGERGIKKI RSCCRGCGKV ECGVWVYIQD GKVVRTEGDE TCFNTMGNHC SKGQASIQAA YHPDRIKFPM KRTNPKGSED PGWVRISWDE AYRTIADNIM QLREKYGPES LFTWCGTGRQ WCMQSDAGMA LELFGTPNII AAYQVCKGPR HFSSRLDNVQ AWSWSEVINH STKYVQWATD PSVSNYDDSS RLVIDVAREA EAFIVVDPRL SNLGRTAKYW LNLRPGTDNA MALGWCHIIL KHDLVDWQFV KRWSNASFIV VPDMDPSGYT EAVQNTKSPY EYRTRLLTEA DIDPSMVDWE IEGEGNPKRY LVYDQINRRW TYWQADPEDA HWEGETWTKQ TSGFTQDVSR LRDDESKVAG WIADLSEFDP RIDPALTGEF EVRLKDGSTH IGRPAWDLWA EYLQQFTPER VSEITGVDAQ LIEDAVVEWA TRDDPRIPNG GINYGLGVEH AGNSTQNCRA IMAACAMVGA IDTPGGQRGA TNGWTEQSGP CAMLPSMAAF AFMPTPDLSL KMAGNEKMPL LYWYGVWCDA NAAMECAHKE PDAPYEIHGG MIGSGDHMNM GNATYNWEAL NMLDFLFEAN LWHSPTSGAA DILLPVCHWT EINAYRIAQG ASGGFGLCVK AVDPPGECKS DPLYFMELSK YFGVPAFDGD DPWLENKPDA DLEIENLTIQ CCVQGCAPYN NWNELEAAFQ EHGWWNMKRE IPEDWGTYRR YEVGQAYRLA PHQQPAQLNI NKPGFPTPTM KHEFWCTSIE SFFPEGADGP ELAPGFTSEA LPYYAEPAHG PVVDAETYKE YPITCITGRR IPVYFHSEHR QLPWCRELWP VPRMEINPDT AAELGLEQGD WAWIESPWGK VRQTVDLYYG IKPNMINAEH QWWYPELAQA DKGYELSCIN CITDRKTQDK YNGSSNVRTY PVKVYKATPE NSPFGNPIPC GNDGTEIIHD SSDPRLKLWE IGAAGIHPDH FE
|
| |