Gene Elen_1006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1006 
Symbol 
ID8415296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1216662 
End bp1218947 
Gene Length2286 bp 
Protein Length761 aa 
Translation table11 
GC content68% 
IMG OID645023970 
Productmolybdopterin oxidoreductase 
Protein accessionYP_003181367 
Protein GI257790761 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.249803 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACGA TTAACGACAC CACGATGCAG GGCAACATCA GCCGCCGCTC GTTCGTGAAG 
GGCGCGTCCG CCCTGGCCGC CGGGGGCGCG CTGGCCGGCG CGTTCGGCTT CGACATCGCG
CACGCCGAGG GCACGGTGGA CCCCGACGCC CCCGTCGAGA AGCGCTACAC GTACTGCGAC
ATGTGCAACC AGGTGCCCAA GTGCGGCATG ACCGCCTACG TGCAGGACGG CAAGATCGTG
CGCGTCGAGT CGCGCACGCC CCATCCCACC ACGCCGCTGT GCGCGAAGGG CCTCGCCAGC
ATCCAGGAGC TCTACGATCC CAAGCGCCTG CAAACGCCGC TGCGCCGCAC GAACCCCAAG
GGCACGGGGC AGTCGCAGTG GGAGCCCATC ACCTGGGACG AAGCCTACGA CGCCATCGTC
AGCGAGTTCA ACCGCGTCAA GGAGGAGGAC GGCCCCGACG CCGTGATGTT CTACTGCGGC
GACCCGAAGG AGCCGCGCCC GCCCATCCAG CGCGTGGCCA CGCTGTTCGG CAGCTCCACG
TACGGCCTCG AAAGCTCGCT GTGCTCGACG GCCACGAACA TCACCTCCCA GCTCGTGTAC
GGCCGCGGGC AGAGCTCGTC GGGCTCCGAC CCCAGCGACG ACACGGGCAG CTGCATGATC
TGGAGCCTCA ACGCGGCCTG GTCGCAGCCG AACCGCCACG CGAAGCTCAT GGACCAGAAG
GAGCGCGGCT GCAAGTTCGT CATCGTCGAT CCGCGCATCA CGCCCACCGT CATGGGGCTC
GCCGACGTCC ACCTGCAGCT GCGGCCCGGC TCCGACGGCG CGCTGGCCCT CGGGTTCATC
AACATCCTCG TCCGCGACAA CCTTGTGGAC AAGCAGTTCG TCGACGAGTG GACGCACGGC
TACGAAGGGC TCGCCGACCT GGCCGCCCAG TACCCGCCCG AGAAGGTCGA GGAGATCACC
TGGGTGCCCG CGGCCAAGCT GGAAGAGGCC GTGCGCCTGC TGGCCGACAA CGCGCCCAGC
GCGCTGGTCA CCAGCTCGGC GGGGCTCGCC CACTCCTCCA ACGTGGGCCA CGCGCTGCGC
GCCGTGTTCA TGATCCCCGC CCTCATGGGC ATGATCGAGA AGAAGGGCGG CGTCATGTTC
GCCTCGGGCG GCCTGCCGCT CGACGTGAGC GCCTCCACTG CCAAGTTCCG CGCCGAGGAC
GTCTACACCG AGCAGAACTT CGCCGACAAG CGCGTGGACA AGGACGATTT CCCCGCCTGG
GTGTCGTTCA CCAAGCACTT CCAGACGGCG CGGCTGCCCG AGTACATCGA CGAGGGGAAG
ATCAAGGCCG CCATCCTCGT GGCCAGCAAC GTGATGATCT GGCCCGAATC GGACCGCTAC
CAGGAAGCCC TCGGCAAGCT GGAGTTCGTG GCCGCCGTGG ACTACTACGA GCGCCCCTGG
ACCCACGACT ACGTGGACAT CCTGCTGCCC GCCGCCATGT GCCACGAGCG CATGGCCCCC
TTCGCCGCCT ACGGCCGCAA GCTGTTCTTC CGCGAGCCGT GCGTCCAGCC GGCCGGGCAG
GCGCGCGAGG ATTGGAAGAT CATGCTCGAC CTGGGATGCA AGCTGGGCTT CGAGGAGCAG
TGCTTCGGCG GCGACGTGGA AGCGGCCCTC GACAACATCC TGCAGACGGC GGGCCTCGAC
GTCACGCTCG ACGACCTGCG CGCGAACCCC GAGGGCCTGG AGATCCCCGG ATCGCCGAAC
GAGGAGGGCA AGCACGCTGC GGGCAAGCTG CGCAAGGACG GGCAGCCCGG CTTCAACACG
CCGTCCGGCA AGCTGGAGTT CGACTCCGAG ATCCTCAAGG GCTTCGGCTA CGAGGGGCTT
CCCGTGTACG AGGAGCCGGT GCACACCCCC TACGCGCCCA CCGAGGAGGA CAAGCGCTAC
CCGCTCGTGC TGAACGCGGG GTCGCGCCTG CCCTACTACA CGCACTCCAA GCTGCGCGAG
ATCCCCTGGC TCAACCAGTT CATGCCCGAG CCCGTCGTGC GCCTGCACCC CCAGGACGCG
CGCGACCGCT CCATCGGCAC CGGCGACGTG GTGCGCGTGT TCAACCACCA GAACGAGATC
GAGATGAAGG CCGAGGTGAC GAACCTCGTG CACGCCGGCA TGGTCGACAT CTTCCACGGA
TGGCACCAGG CGAACGTGAA TCTGCTGACC ACGCGCGATT TCGATCCCAT CACCGGGTTC
CCGCCGTTCA GGTGCGGCCT GTGCGAGGTC GAGGCCACCG GCAAGGGACG CCTCGTCAGC
CAGTAG
 
Protein sequence
MTTINDTTMQ GNISRRSFVK GASALAAGGA LAGAFGFDIA HAEGTVDPDA PVEKRYTYCD 
MCNQVPKCGM TAYVQDGKIV RVESRTPHPT TPLCAKGLAS IQELYDPKRL QTPLRRTNPK
GTGQSQWEPI TWDEAYDAIV SEFNRVKEED GPDAVMFYCG DPKEPRPPIQ RVATLFGSST
YGLESSLCST ATNITSQLVY GRGQSSSGSD PSDDTGSCMI WSLNAAWSQP NRHAKLMDQK
ERGCKFVIVD PRITPTVMGL ADVHLQLRPG SDGALALGFI NILVRDNLVD KQFVDEWTHG
YEGLADLAAQ YPPEKVEEIT WVPAAKLEEA VRLLADNAPS ALVTSSAGLA HSSNVGHALR
AVFMIPALMG MIEKKGGVMF ASGGLPLDVS ASTAKFRAED VYTEQNFADK RVDKDDFPAW
VSFTKHFQTA RLPEYIDEGK IKAAILVASN VMIWPESDRY QEALGKLEFV AAVDYYERPW
THDYVDILLP AAMCHERMAP FAAYGRKLFF REPCVQPAGQ AREDWKIMLD LGCKLGFEEQ
CFGGDVEAAL DNILQTAGLD VTLDDLRANP EGLEIPGSPN EEGKHAAGKL RKDGQPGFNT
PSGKLEFDSE ILKGFGYEGL PVYEEPVHTP YAPTEEDKRY PLVLNAGSRL PYYTHSKLRE
IPWLNQFMPE PVVRLHPQDA RDRSIGTGDV VRVFNHQNEI EMKAEVTNLV HAGMVDIFHG
WHQANVNLLT TRDFDPITGF PPFRCGLCEV EATGKGRLVS Q