Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0140 |
Symbol | |
ID | 8414424 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 193969 |
End bp | 196731 |
Gene Length | 2763 bp |
Protein Length | 920 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645023120 |
Product | (NiFe) hydrogenase maturation protein HypF |
Protein accession | YP_003180523 |
Protein GI | 257789917 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0068] Hydrogenase maturation factor |
TIGRFAM ID | [TIGR00143] [NiFe] hydrogenase maturation protein HypF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGAAG CGCTGGACAT TCAAGTCAAA GGCATCGTGC AGGGCGTGGG CTTTCGCCCC TTCGTCTACC GTTTGGCGAA GAAATACCTG ATCGACGGAT GGGTGCTGAA CGCCACCGAC GGCGTGTTCA TCCATGCCGA GGGCGAGACG AAGCTGCTCG ACGAGTTCGT CATCGAGCTG TCCGAGAACC CACCGGCCGC CTCGCGCGTG GAGGAAGTGA CGCTGAAGGA AGTGCCGCTG GAGGACTTCG ACTCGTTCGA GATCCGCTTC TCCGACGCAG GCGCCGTGGA GAAGACCACG CTCGTGTCGC CCGACCTGGC CACCTGCGAC GATTGTGCCC GCGAGCTGTT CAACCCGAAC GACCGCCGCT ACCGCTACCC GTTCATCAAC TGCACGAACT GCGGCCCGCG CTTCACTATC ATCGAGAAGC TGCCCTACGA TCGCAAGAGC ACGTCGATGA AGGACTTCCC CATGTGCGAG CGCTGCGCCC GCGAGTACGG CGACCCGTTG GACCGCCGCT TCCACGCGCA GCCCGACGCG TGCTTCGAAT GCGGCCCCCA TATCAGCTGG CGCGAACACG AGGGCATGCC CGGAGCGCTC GCCGGCGTTC ATGCCCCCGG CGTTTCCGCC GACGTGCCGG CCGCCCCTGA CGTCCCCGCT GGCGCTTCCG CCTCCCATGC GGCGCAAACA AATGTTTCAC GTGAAACATT GTGCGCTTCC GATGCCATTG ACGCGCCCCT GGGACCCATC GCCTGGGGAA CCACGCGCGA GGAGAGCGAT GCCATACTTG CGCGCGCCGT GGAACTGCTG CTGGCGGGCA AGATCCTGGC CGTGAAGGGC CTGGGCGGTT TCCATCTGGT GTGTGACGCG TCGAACCCCG AAGCGGTGGC GTTGCTGCGC GCGCGCAAGC GCCGCGAGGG CAAGGCGTTC GCCGTCATGA TGGCCGATAT GGAAGCCGTG CGTGCCTATT GCGAAGCGGG CGAAGCGGAG GAGGGCATCC TCACGGCCAC GCAGCGCCCC ATCGTGCTGC TGCGCAAGCG CCCCGATGCT GTTTTTGCTC CCGGCTTGGC CGACGCCCTC CCCGAACTCG GCGTCATGCT GCCCTACACG CCCCTCCAGC ATCTGCTGCT GCACGACTTC GCGAAAGCGA GCGGTTGTGC CGAGCGCAGC TCCCAGCGCG AAGCCGAAGG ATCCCGCGCA GCGACGGCCG TCCCTATGCT GGTGATGACC TCGGGCAACA TCCACGACGA GCCCATCGTC ATCGACGACG AGGATGCCTA CGAGAAGCTG TTCGCCGTGG CCGACGCGTT CCTGGGCCAC GACCGCGCCA TCCGCGCCCG CTACGACGAC TCGGTGGTGC GCGTTATCGA AGCGGGCAGC GCCGGCGAAG CCGTCCAATT CATCCGCCGC GCCCGAGGCT ACGCGCCGCT CCCGTTGGCG ATGCCCGCGA AGGCGCGCGA AGGAGAAGAT GTTTCACGTG AAACATCCGT TCGCGAACAG GGTTGCTCCA TCTTCGCCAC CGGGCCCGAG CAGAAGAACA CGTTTGCGCT CACGCGCGAC GCCGAGGCGT TCGTGTCGCA GCACATCGGC GACCTCGAGA ACGCCGAAAC CTACGACGCG TGGCTCCAGG CGAAGGACCG CTACGAAACG CTGTTCGAGA TCGAACCCGA CCGCATCGCG TGCGACCTGC ATCCCGAGTA CCTCACGTCG AAGTGGGCCC ACGAGCAGAG CCTCCCCGTC ACCGAGGTGC AGCACCATCA TGCGCACGTC GTGTCGGTGA TGGCCGAGCA CGGCCTGGCC GGCCCCGTGT GCGGCATCGC GTTCGACGGC ACGGGCTACG GCATGGACGG CGCCATCTGG GGCGGGGAAG TGCTGCTCAC CAACCTGACT GCGTTCGAGC GTTTCGCGAA CTTCGCCTAC GTGCCCATGC CGGGCGGCGC GGCGGCCGTA AAGCATCCGC TGCGCATGGC CTACGGCGTG CTGTGGGCGT TCGACCTGCT CGACCATCCG GGGGCTGCGG CTGCGCTCGA GGCTCTGGGC GAGCAGGCGG CCGTGTGCGA TCAGATGATC GACCGCGGCA TCAACACGCC CATGACCTCG TCCGTCGGCC GGCTGTTCGA CGCGGCCAGC GCGCTGTTGG GCATCTGCGC CGAGCCTGCG TACGAAGGCG AGCCCGCCAT CCTGCTGGAA GCCGCAATCG GGCGAACAGA TGTTTCACGT GAAACATCTG CCGCCGACGC AACCGATGCG ACCCCTGCTG ACACCAACCG GGATCCTTCG ACTCCGGCGC TTCGCGCCTC CGCTCAGAAT GACGATGCCG GGCTTAGCGC CTCCGCTCGG GAAGCCGCGT CCCGTTACGC CATCGCGGTG GTGAAGAACA CCGCCACTGC CGCCAGCACG GCGCAGGACA CCTCGGTCGT GCTGTTCGAT GCCGCGCCAG CGTTCGCCGC GCTGCTCGAC GACCTCGCGG CCGACGTGCC TGTCGGCATC ATCGCGCGCT GCTTCCACGA CGCGTTCGTG CAAGCCATCG TCACGGCGGC CGAGCTCGTG CGCAGCTTGT ACGGCATAAC CACGCTTGCC CTGTCCGGCG GCGTGTTCAT GAACCGCTAC CTGCTGGAGC ACGCTCTGGC GGCGCTCGAG CAGGCGGGTT TCACGGTGGC CGTCAACCGC GATCTGCCGC CCAACGACGG CTGCATCTCC TTCGGCCAGG CTGTGGTAGC ATGGGCTTCG AGCAAAGAAG AAGGAGAACA GCCATGTGCT TAG
|
Protein sequence | MKEALDIQVK GIVQGVGFRP FVYRLAKKYL IDGWVLNATD GVFIHAEGET KLLDEFVIEL SENPPAASRV EEVTLKEVPL EDFDSFEIRF SDAGAVEKTT LVSPDLATCD DCARELFNPN DRRYRYPFIN CTNCGPRFTI IEKLPYDRKS TSMKDFPMCE RCAREYGDPL DRRFHAQPDA CFECGPHISW REHEGMPGAL AGVHAPGVSA DVPAAPDVPA GASASHAAQT NVSRETLCAS DAIDAPLGPI AWGTTREESD AILARAVELL LAGKILAVKG LGGFHLVCDA SNPEAVALLR ARKRREGKAF AVMMADMEAV RAYCEAGEAE EGILTATQRP IVLLRKRPDA VFAPGLADAL PELGVMLPYT PLQHLLLHDF AKASGCAERS SQREAEGSRA ATAVPMLVMT SGNIHDEPIV IDDEDAYEKL FAVADAFLGH DRAIRARYDD SVVRVIEAGS AGEAVQFIRR ARGYAPLPLA MPAKAREGED VSRETSVREQ GCSIFATGPE QKNTFALTRD AEAFVSQHIG DLENAETYDA WLQAKDRYET LFEIEPDRIA CDLHPEYLTS KWAHEQSLPV TEVQHHHAHV VSVMAEHGLA GPVCGIAFDG TGYGMDGAIW GGEVLLTNLT AFERFANFAY VPMPGGAAAV KHPLRMAYGV LWAFDLLDHP GAAAALEALG EQAAVCDQMI DRGINTPMTS SVGRLFDAAS ALLGICAEPA YEGEPAILLE AAIGRTDVSR ETSAADATDA TPADTNRDPS TPALRASAQN DDAGLSASAR EAASRYAIAV VKNTATAAST AQDTSVVLFD AAPAFAALLD DLAADVPVGI IARCFHDAFV QAIVTAAELV RSLYGITTLA LSGGVFMNRY LLEHALAALE QAGFTVAVNR DLPPNDGCIS FGQAVVAWAS SKEEGEQPCA
|
| |