Gene Elen_0140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0140 
Symbol 
ID8414424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp193969 
End bp196731 
Gene Length2763 bp 
Protein Length920 aa 
Translation table11 
GC content67% 
IMG OID645023120 
Product(NiFe) hydrogenase maturation protein HypF 
Protein accessionYP_003180523 
Protein GI257789917 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0068] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00143] [NiFe] hydrogenase maturation protein HypF 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGAAG CGCTGGACAT TCAAGTCAAA GGCATCGTGC AGGGCGTGGG CTTTCGCCCC 
TTCGTCTACC GTTTGGCGAA GAAATACCTG ATCGACGGAT GGGTGCTGAA CGCCACCGAC
GGCGTGTTCA TCCATGCCGA GGGCGAGACG AAGCTGCTCG ACGAGTTCGT CATCGAGCTG
TCCGAGAACC CACCGGCCGC CTCGCGCGTG GAGGAAGTGA CGCTGAAGGA AGTGCCGCTG
GAGGACTTCG ACTCGTTCGA GATCCGCTTC TCCGACGCAG GCGCCGTGGA GAAGACCACG
CTCGTGTCGC CCGACCTGGC CACCTGCGAC GATTGTGCCC GCGAGCTGTT CAACCCGAAC
GACCGCCGCT ACCGCTACCC GTTCATCAAC TGCACGAACT GCGGCCCGCG CTTCACTATC
ATCGAGAAGC TGCCCTACGA TCGCAAGAGC ACGTCGATGA AGGACTTCCC CATGTGCGAG
CGCTGCGCCC GCGAGTACGG CGACCCGTTG GACCGCCGCT TCCACGCGCA GCCCGACGCG
TGCTTCGAAT GCGGCCCCCA TATCAGCTGG CGCGAACACG AGGGCATGCC CGGAGCGCTC
GCCGGCGTTC ATGCCCCCGG CGTTTCCGCC GACGTGCCGG CCGCCCCTGA CGTCCCCGCT
GGCGCTTCCG CCTCCCATGC GGCGCAAACA AATGTTTCAC GTGAAACATT GTGCGCTTCC
GATGCCATTG ACGCGCCCCT GGGACCCATC GCCTGGGGAA CCACGCGCGA GGAGAGCGAT
GCCATACTTG CGCGCGCCGT GGAACTGCTG CTGGCGGGCA AGATCCTGGC CGTGAAGGGC
CTGGGCGGTT TCCATCTGGT GTGTGACGCG TCGAACCCCG AAGCGGTGGC GTTGCTGCGC
GCGCGCAAGC GCCGCGAGGG CAAGGCGTTC GCCGTCATGA TGGCCGATAT GGAAGCCGTG
CGTGCCTATT GCGAAGCGGG CGAAGCGGAG GAGGGCATCC TCACGGCCAC GCAGCGCCCC
ATCGTGCTGC TGCGCAAGCG CCCCGATGCT GTTTTTGCTC CCGGCTTGGC CGACGCCCTC
CCCGAACTCG GCGTCATGCT GCCCTACACG CCCCTCCAGC ATCTGCTGCT GCACGACTTC
GCGAAAGCGA GCGGTTGTGC CGAGCGCAGC TCCCAGCGCG AAGCCGAAGG ATCCCGCGCA
GCGACGGCCG TCCCTATGCT GGTGATGACC TCGGGCAACA TCCACGACGA GCCCATCGTC
ATCGACGACG AGGATGCCTA CGAGAAGCTG TTCGCCGTGG CCGACGCGTT CCTGGGCCAC
GACCGCGCCA TCCGCGCCCG CTACGACGAC TCGGTGGTGC GCGTTATCGA AGCGGGCAGC
GCCGGCGAAG CCGTCCAATT CATCCGCCGC GCCCGAGGCT ACGCGCCGCT CCCGTTGGCG
ATGCCCGCGA AGGCGCGCGA AGGAGAAGAT GTTTCACGTG AAACATCCGT TCGCGAACAG
GGTTGCTCCA TCTTCGCCAC CGGGCCCGAG CAGAAGAACA CGTTTGCGCT CACGCGCGAC
GCCGAGGCGT TCGTGTCGCA GCACATCGGC GACCTCGAGA ACGCCGAAAC CTACGACGCG
TGGCTCCAGG CGAAGGACCG CTACGAAACG CTGTTCGAGA TCGAACCCGA CCGCATCGCG
TGCGACCTGC ATCCCGAGTA CCTCACGTCG AAGTGGGCCC ACGAGCAGAG CCTCCCCGTC
ACCGAGGTGC AGCACCATCA TGCGCACGTC GTGTCGGTGA TGGCCGAGCA CGGCCTGGCC
GGCCCCGTGT GCGGCATCGC GTTCGACGGC ACGGGCTACG GCATGGACGG CGCCATCTGG
GGCGGGGAAG TGCTGCTCAC CAACCTGACT GCGTTCGAGC GTTTCGCGAA CTTCGCCTAC
GTGCCCATGC CGGGCGGCGC GGCGGCCGTA AAGCATCCGC TGCGCATGGC CTACGGCGTG
CTGTGGGCGT TCGACCTGCT CGACCATCCG GGGGCTGCGG CTGCGCTCGA GGCTCTGGGC
GAGCAGGCGG CCGTGTGCGA TCAGATGATC GACCGCGGCA TCAACACGCC CATGACCTCG
TCCGTCGGCC GGCTGTTCGA CGCGGCCAGC GCGCTGTTGG GCATCTGCGC CGAGCCTGCG
TACGAAGGCG AGCCCGCCAT CCTGCTGGAA GCCGCAATCG GGCGAACAGA TGTTTCACGT
GAAACATCTG CCGCCGACGC AACCGATGCG ACCCCTGCTG ACACCAACCG GGATCCTTCG
ACTCCGGCGC TTCGCGCCTC CGCTCAGAAT GACGATGCCG GGCTTAGCGC CTCCGCTCGG
GAAGCCGCGT CCCGTTACGC CATCGCGGTG GTGAAGAACA CCGCCACTGC CGCCAGCACG
GCGCAGGACA CCTCGGTCGT GCTGTTCGAT GCCGCGCCAG CGTTCGCCGC GCTGCTCGAC
GACCTCGCGG CCGACGTGCC TGTCGGCATC ATCGCGCGCT GCTTCCACGA CGCGTTCGTG
CAAGCCATCG TCACGGCGGC CGAGCTCGTG CGCAGCTTGT ACGGCATAAC CACGCTTGCC
CTGTCCGGCG GCGTGTTCAT GAACCGCTAC CTGCTGGAGC ACGCTCTGGC GGCGCTCGAG
CAGGCGGGTT TCACGGTGGC CGTCAACCGC GATCTGCCGC CCAACGACGG CTGCATCTCC
TTCGGCCAGG CTGTGGTAGC ATGGGCTTCG AGCAAAGAAG AAGGAGAACA GCCATGTGCT
TAG
 
Protein sequence
MKEALDIQVK GIVQGVGFRP FVYRLAKKYL IDGWVLNATD GVFIHAEGET KLLDEFVIEL 
SENPPAASRV EEVTLKEVPL EDFDSFEIRF SDAGAVEKTT LVSPDLATCD DCARELFNPN
DRRYRYPFIN CTNCGPRFTI IEKLPYDRKS TSMKDFPMCE RCAREYGDPL DRRFHAQPDA
CFECGPHISW REHEGMPGAL AGVHAPGVSA DVPAAPDVPA GASASHAAQT NVSRETLCAS
DAIDAPLGPI AWGTTREESD AILARAVELL LAGKILAVKG LGGFHLVCDA SNPEAVALLR
ARKRREGKAF AVMMADMEAV RAYCEAGEAE EGILTATQRP IVLLRKRPDA VFAPGLADAL
PELGVMLPYT PLQHLLLHDF AKASGCAERS SQREAEGSRA ATAVPMLVMT SGNIHDEPIV
IDDEDAYEKL FAVADAFLGH DRAIRARYDD SVVRVIEAGS AGEAVQFIRR ARGYAPLPLA
MPAKAREGED VSRETSVREQ GCSIFATGPE QKNTFALTRD AEAFVSQHIG DLENAETYDA
WLQAKDRYET LFEIEPDRIA CDLHPEYLTS KWAHEQSLPV TEVQHHHAHV VSVMAEHGLA
GPVCGIAFDG TGYGMDGAIW GGEVLLTNLT AFERFANFAY VPMPGGAAAV KHPLRMAYGV
LWAFDLLDHP GAAAALEALG EQAAVCDQMI DRGINTPMTS SVGRLFDAAS ALLGICAEPA
YEGEPAILLE AAIGRTDVSR ETSAADATDA TPADTNRDPS TPALRASAQN DDAGLSASAR
EAASRYAIAV VKNTATAAST AQDTSVVLFD AAPAFAALLD DLAADVPVGI IARCFHDAFV
QAIVTAAELV RSLYGITTLA LSGGVFMNRY LLEHALAALE QAGFTVAVNR DLPPNDGCIS
FGQAVVAWAS SKEEGEQPCA