Gene Elen_1843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1843 
Symbol 
ID8416147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2167756 
End bp2169051 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content68% 
IMG OID645024813 
ProductElectron-transferring-flavoproteindehydrogenase 
Protein accessionYP_003182196 
Protein GI257791590 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.186881 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGATG ATCAGTTCGA CGCGATCGTC GTCGGCGCCG GGTGCGCCGG CGCGGTCGCA 
GCCTACCGGC TGGCCTCCGC GGGCCGTTCC GTTCTCGTAG TGGAACGCGG CAACTACGCC
GGCGCCAAGA ACATGACCGG CGGGCGCATC TACACGCATG CGCTCGCCCG CGTGTTCCCC
GACTTCGAGG GAAGCGCTCC CCTGCAACGG CGCATCACGC ACGAGAAGAT ATCGATGGCC
GCCCCCGATG CCCTGTTCAC CATGGACTTC ACGGCCGACG AGCTGCGCGA GTCGGGCAAG
GACTCCTACG CGGTGCTGCG CGGACCCTTC GACCAGTGGC TGGCCGAGCG GGCCGAGGAG
GCGGGAGCCG ACTTCATCTA CGGCATCGCC GTGGAGGACC TCATCGTGCG CGACGGGCGG
GTATGCGGCG TCGTCGCAGG CGGCGACGAG GTGGAAGCGC GCATCACCGT CTTGGCAGAC
GGAGCGAACT CCCTGCTCGC CCAGAAGGCG GGACTCGCCG ACGACGTCCC CCTTCCCTGC
CAGATGGCCG TGGGCGCGAA GGAGACCATC GCGCTTCCGC CGTCGGTCAT CGAAGACCGG
TTCCTGTGCG CGCCGGGCGA AGGCGCAGCA TGGATGTTCG CCGGCGACTG CACGAAAGGG
CGCGTGGGAG GCGGGTTCCT GTACACGAAC GACGACTCGA TCTCGCTCGG CCTCGTGGCA
ACGCTGTCGG ATCTGGCCGC GGGCGACGTG CCCATCTACC AGATGCTGGA GGACTTCAAG
CAGCGCGCCG ACATCGCCCC GCTGGTCAAG GGCGGCGAGC TGGTGGAGTA CTCCGGGCAC
CTCGTGCCCG AAGGCGGCCT CGCCATGGTT CCGCGCCTGT ACGGCGACGG CGTGCTGGTG
TGCGGCGACG CGGCCATGCT GTGCATGAAC CTGGGGTACT CGGTGCGCGG CATGGACTTC
GCCGTCAGCT CGGGCGACCT CGCGGCCGAC GCCGTCATGC GCGCGCTCGA GGCCGACGAC
GTGTCGGCCG CGCAGCTGGC CGCCTACGAG CAGCTGCTCG ACGGCTGCTA CGTCCTGCAG
GACCTCAAGC GCTTCCGCCG CTTCCCGCGC TATATGGAAG AGACGACGCG CATCTTCAAC
GAGTACCCGG CCATGGCCCG CGACATCATG CTCAACCTGT TCAAGGTGGA CGGGCATCCT
CAGGCGCGCA TCAAAGACAA ACTTCTGGGC CCGCTCAAGC GCGTGGGGCT GTGGCGGGTG
GCGCAGGACG GCAGGAAGGG GATGAAGGCC TTATGA
 
Protein sequence
MSDDQFDAIV VGAGCAGAVA AYRLASAGRS VLVVERGNYA GAKNMTGGRI YTHALARVFP 
DFEGSAPLQR RITHEKISMA APDALFTMDF TADELRESGK DSYAVLRGPF DQWLAERAEE
AGADFIYGIA VEDLIVRDGR VCGVVAGGDE VEARITVLAD GANSLLAQKA GLADDVPLPC
QMAVGAKETI ALPPSVIEDR FLCAPGEGAA WMFAGDCTKG RVGGGFLYTN DDSISLGLVA
TLSDLAAGDV PIYQMLEDFK QRADIAPLVK GGELVEYSGH LVPEGGLAMV PRLYGDGVLV
CGDAAMLCMN LGYSVRGMDF AVSSGDLAAD AVMRALEADD VSAAQLAAYE QLLDGCYVLQ
DLKRFRRFPR YMEETTRIFN EYPAMARDIM LNLFKVDGHP QARIKDKLLG PLKRVGLWRV
AQDGRKGMKA L