Gene Elen_1336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1336 
Symbol 
ID8415634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1599427 
End bp1600860 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content67% 
IMG OID645024305 
Productprotein of unknown function DUF512 
Protein accessionYP_003181694 
Protein GI257791088 
COG category[C] Energy production and conversion 
COG ID[COG1625] Fe-S oxidoreductase, related to NifB/MoaA family 
TIGRFAM ID[TIGR03279] putative FeS-containing Cyanobacterial-specific oxidoreductase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.506965 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCGG TGTACCCGTC GGGCGATATC GATCGCGAGG CGGCGGCCAG GCAGCGCGGC 
GGCTGCGGCG CGCTGCGCGA GGCGCCGCGG GCTCTCGTGA TCGCCGTCGC GCCCGACAGC
CCGGCCGACG ATGCGGGGTT CGAGCCTGGA TGCTACGTGA CCACGGTGGA CGGCAGGCCC
GTGCGCGACC TCATCGACTG GCGCTGGCTT GCGGCCGACG ACGTCATGGA TCTGGGCTAC
GTCGATCTCG ACGGCGACGA GGGCGTCGTG GAACTGGAGC GCGAGGAGGG CGAGGACTGG
GGCTTCGAGT TCGAAGGCGT CGTGTTCGAC GGCGTGAGGC AATGCCGCAA CGCCTGCACG
TTCTGCTTCA TGCGCCAGCT GCCCGACGAT ATGCGCTCCT CGCTCACCTT ACGCGACGAC
GATTTTCGCC TGAGCTTTCT CGCGGGCACC TTCGTCACGT TCACCAACCT GAAGCCCGAA
GACGAACGGC GCATCGTCGA GCAGCGCATA TCGCCTCTGC GCCTGTCGCT GCACGTGGCG
GATCCGGAGG TGCGCCGACG CATGATCGGC AAGCACGCGC AGCACGGCAT CGACGTGCTG
GAGCGGCTGC TGGAAGCGGG CATCGAGTTC CACGCCCAGA TCGTGCTCGT GCCCGACCAG
AACGACGGCG CCGTGCTGGA AGACACGCTT GCCTGGGCGT ACGCGCGCCC CGGCATTCTC
GACGTGTGCA TCGTGCCGCT GGGATTCACG AAGCACCAGA GCGTCTTCGA CCGCAGCTTC
AACGACCCCG TGTCGTCGCG CGCGGTGATG GACCTCGTCA TTGCGTTCCA ACGGCGCGCT
CTGGCCGAGC GCGGCAGCAT GTGGGCCTTC CCGGCCGACG AGTTCTACCA TAACGCCTAC
GGCCCCGAGC TGTTGCGGAA CCTGCCGCCG TCGGAGCACT ACGGTGACTT CGGCATGTTC
GAGGACGGCG TGGGCATCAT CCGCTCGTTC GTTGACGACT GGAAAAGGGC AGAGCGCGCG
GGGCTCGTCG AGCGTTGCGC CGCGGCTCTG CGCGCCGCCG ACGCGCGCGT CCACTACGTG
GCCGGCTGCG CGACGCGCCA CTTCCTGGGC CCGCTCATCG ATGCGAGCCC GCTCAAGGGG
CTGCTGATCC CGCTGTTCGT GAAGAACGAG TTCTTCGGCG GCAACGTGGA CGTGACGGGG
CTTCTATGCG GCTGCGACAT GGCCGATGCC GTGCGTGCCG AGCGCGAGCG CGGCTTGCAG
CTGGCGCTCA TCCCGCGCGT CGTGTTCAAC GACGACGCGG TAACGCTCGA TGACATGAGT
TTGGAGGATA TGGAAAAGCG GGCGGGCGCG CGCATGTCCG TGGTATCCTG TAACGCATCG
GATTATCTCC TCGAGATCAT CCACCTGGTC GGACGAACCG ATCCGACCTC CTGA
 
Protein sequence
MPPVYPSGDI DREAAARQRG GCGALREAPR ALVIAVAPDS PADDAGFEPG CYVTTVDGRP 
VRDLIDWRWL AADDVMDLGY VDLDGDEGVV ELEREEGEDW GFEFEGVVFD GVRQCRNACT
FCFMRQLPDD MRSSLTLRDD DFRLSFLAGT FVTFTNLKPE DERRIVEQRI SPLRLSLHVA
DPEVRRRMIG KHAQHGIDVL ERLLEAGIEF HAQIVLVPDQ NDGAVLEDTL AWAYARPGIL
DVCIVPLGFT KHQSVFDRSF NDPVSSRAVM DLVIAFQRRA LAERGSMWAF PADEFYHNAY
GPELLRNLPP SEHYGDFGMF EDGVGIIRSF VDDWKRAERA GLVERCAAAL RAADARVHYV
AGCATRHFLG PLIDASPLKG LLIPLFVKNE FFGGNVDVTG LLCGCDMADA VRAERERGLQ
LALIPRVVFN DDAVTLDDMS LEDMEKRAGA RMSVVSCNAS DYLLEIIHLV GRTDPTS