Gene Elen_1856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1856 
Symbol 
ID8416160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2182247 
End bp2184307 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content70% 
IMG OID645024826 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_003182209 
Protein GI257791603 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCGG CGACGCCTTC GCGCGAGGCG CTGTGGAACA TCGAGGGGTC GTGGCTCGTG 
TACCCCTGCT TCCTGTTGGT GCTGGTCGTC GCCGCGTACT TCTTCTGGCG GCGCTACCGC
CTGTGGAAGA TCGGCCGACC GCTTGAGCGC GGCGATCGCC CGCTCGAGCG CTTGAAGGGC
GCGTTCGTGG ACGCGCTCTT GCAGGTCACC GTCGTGAAGG AGCGGGGCGT CGGCATCGCG
CACCTCGGGA TGTACGTGGG CATGGCCGTC ATGGTGGTGG CCACGGCGAG CTATGCGGTG
CAGGTGGACC TGGGGCTCGA CATCGCCAAG GGCGACTACT ACCTGTACGT GCTGGCGCTC
GGCACCGATA TCGCGGGGTT GGCGTTCTGC ATCGCGATGG TCGCCTGCAT CGTGCGGCGG
GCGGCCGGCA GGAACCCGTC GCTCGAGACG AAGCCGGCCG ACATCGTTGT GCTGGCATGG
CTGCTGGTCA TCGGCGTCAC GGGCTTCGTC GTGGAGGGGC TGCGCATCGT GGGTACGAAC
GATCCGTGGG CCGCGTGGTC GCCCATCGGC AATCTGTTCG CGCCGCTGTT CGCAGGCTTG
AGCGCCGCCC AGGTGTCGAC GGCGCACCAG GTTCTATGGT GGTTCCATAT GGCCATCGCC
TTCGGGATCC TGGCGTACTG GATGTACTCG AAGCTCGTGC ACGTGCTGCT GGTCCCGGCC
ACCGTGTACT GCCGTCCGCT TGAGCCGAAG GGGACGCTGT CCTACGTCGA CCTCGAGGAC
GAGGAGCTGG AAGAGTTCGG CGTGGGAAAG CTGGAGGACT TCACATGGAA GGACCTGCTG
GACGCCGAGG CGTGCGTGCG CTGCGGCCGC TGCGAAACGG TGTGCCCTGC GCACGGAAGC
GGCAAGCCGC TGTCGCCGAA GGACCTGATG CAGGCGCTCG ACGCCCATCT GGGGGAGCGC
GGGCCGCTCG TGCGCGCCGA GCGTCGGGCC GAAGCGGCGG GCGAAGCGTT CGAGCCGACC
GAGGAGCAGC GGGCCGTGCT GGACAAGGCG CTCGTGGGCG ACGTGGTCGC GCCCGAGGCG
CTGTGGTCGT GCACTACGTG CGGCGCGTGC ATGGAGGCGT GCCCGGCGCT GCTGGAGCAC
GTGCCGAAGG TGGTGGGCAT GCGCACCTAC CAGGTGTCGA TGGAAAGCGC GTTCCCGTCG
GAGGCTAAGG CGGCCTTCCG CAACCTCGAG ACGAACGGCA ACCCGTGGGG CTTGGGGTGG
CAGAGCCGCA TGGCATGGGC GGAGGGGCTC GACGTGCCCA CGCTGGCCGA CCGTCCGCAG
GCGGAGTACG TGTACTGGCC TGGCTGCTCG GGAGCGTACG ACGCGCGCAA CCGCAAGGTG
TCGCGCGCCC TCGTGGCGCT GCTGAGGCAT GCGGGCGTGG ACTTCGCCGT CATCGGCCCG
GAGGAGAAGT GCTGCGGCGA CGCGGCGCGG CGCATGGGCA ACGAGTTTCT GTACTACCAG
CTTGCCACCG AGAACATCGA GACGCTGAAC GCCTACGGCG CGAAGAAGAT CATCGTGCAG
TGCCCGCACT GCGCCCAGGC GCTGGAGCGC GATTACCCGC AGCTTGGCGG CCGGTTCGAA
GTGGTGCGGC ACGCGCAGCT GCTCGAGAGG CTCGTGGCCG AAGGGAGGCT GCCGGGCGCG
GAGCGGGCGG GCGCGCAGGC GGCGTTCGAG CGCGTCACGT ACCACGACTC GTGCTACCTG
GGACGCTACG CCGACGTGTA CGACGAGCCG CGTGCGGTGG TGAAGGCGTG CGGCGCCCAG
GTGGTGGAGA TGGAGCGGAC GCGCGAGAAG AGCTTCTGCT GCGGCGCGGG CGGCGGGCGC
ATGTGGCTCG AGGAGCGCGA GGGGCGGCGC ATGAACGTCC TGCGCGCCGA GCAGGCGCGC
GACACCGGTG CGGACGCCGT GGCCACCGCC TGCCCGTTCT GCCTGTCCAT GCTGGAAGAC
GGCCTGGCGT CCCAGGACGA TGCCCTGCCG GTACGGGACA TCGCCGAGCT GTTGTCCGAC
GCGCTGGCTC TGTCGCGGTG A
 
Protein sequence
MDAATPSREA LWNIEGSWLV YPCFLLVLVV AAYFFWRRYR LWKIGRPLER GDRPLERLKG 
AFVDALLQVT VVKERGVGIA HLGMYVGMAV MVVATASYAV QVDLGLDIAK GDYYLYVLAL
GTDIAGLAFC IAMVACIVRR AAGRNPSLET KPADIVVLAW LLVIGVTGFV VEGLRIVGTN
DPWAAWSPIG NLFAPLFAGL SAAQVSTAHQ VLWWFHMAIA FGILAYWMYS KLVHVLLVPA
TVYCRPLEPK GTLSYVDLED EELEEFGVGK LEDFTWKDLL DAEACVRCGR CETVCPAHGS
GKPLSPKDLM QALDAHLGER GPLVRAERRA EAAGEAFEPT EEQRAVLDKA LVGDVVAPEA
LWSCTTCGAC MEACPALLEH VPKVVGMRTY QVSMESAFPS EAKAAFRNLE TNGNPWGLGW
QSRMAWAEGL DVPTLADRPQ AEYVYWPGCS GAYDARNRKV SRALVALLRH AGVDFAVIGP
EEKCCGDAAR RMGNEFLYYQ LATENIETLN AYGAKKIIVQ CPHCAQALER DYPQLGGRFE
VVRHAQLLER LVAEGRLPGA ERAGAQAAFE RVTYHDSCYL GRYADVYDEP RAVVKACGAQ
VVEMERTREK SFCCGAGGGR MWLEEREGRR MNVLRAEQAR DTGADAVATA CPFCLSMLED
GLASQDDALP VRDIAELLSD ALALSR