Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1582 |
Symbol | |
ID | 8415881 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1879769 |
End bp | 1881520 |
Gene Length | 1752 bp |
Protein Length | 583 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645024551 |
Product | thiamine pyrophosphate protein domain protein TPP-binding |
Protein accession | YP_003181939 |
Protein GI | 257791333 |
COG category | [C] Energy production and conversion |
COG ID | [COG4231] Indolepyruvate ferredoxin oxidoreductase, alpha and beta subunits |
TIGRFAM ID | [TIGR03336] indolepyruvate ferredoxin oxidoreductase, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.077306 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATTGA TGTCAGGCAA CGAGGCAATA GCCCAGGGGG CATGGGAGGC CGGCGCGCGC ATCGGCGTGG CCTATCCCGG CACGCCGTCG ACGGAGACGC TCGAGGCGTT CGCGAAGAAG GACGGCGTGT ACGCCGAATG GTGCGTCAAC GAGAAGGTGG CCGTCGAGGT GGGCATCGGG GCGTCGGTCG CCGGCGCGCG CGTGCTGTCC ACGATGAAGC ACGTCGGCGT GAACGTGGCG GCCGACCCGT TGTTCACTGC GGCATACGCG GGCGTGGGCG GCGGGCTCGT GGTGCTGGCT GCCGATGATC CGGGCATGTA CTCGTCCCAA AACGAACAGG ATTCCCACTG GTACGCACGC GCAGCCCACA TCCCCATGCT CGACCCTGCC GATTCCGCCG AGGCTTTGCG CTTCACGCGC GAGGCGTACG ACGTGTCCGA GCGCTTCGAC GTGCCCGTCT TCATCCGTTC CACAGTGCGC GTGTCGCACA CGAAAACGCC GGTGGAGCCC GGCGAGCGCA CCGAGATCGC GCTCAAGCCC TACGAAAGCG ATCCGGCGAA GTGGGTCATG ATGCCCGCGT TCGCCAAGCC CCGCCGCAAG GTGCAGCTGG CGCGCATCGA CGCGCTGCGC GCCTGGGCCG AGGAGTGCCC CTACAACGAG GTCGTGCGCA AGGGGAGCGC CGTGGGCGTG GTGTGCGCCG GCGCCGTCTA CCAGCACGTC GTGGAAGCGC TGCCCGACGC GTCCGTGTTC AAGCTGGGCC TCACCTGGCC GCTTCCGCAG CAGGCGCTGC GCGACTTCGC CGAGAGCGTG GACGCGCTCT ACGTGGTGGA GGAGGCTTCC GAGTACCTGG ACGAGGGCGT GCGGGCGCTG GGCATCGAGG TTGCCGCGTT CGAGAGCCCG CTGCCGCGCG ACGGCGAACT GACCCCCGGC CTCATCAGAG CGGCGTTCGG CTTCGAGGAG CCTGCTCACG AGCCGTTGCC CGCCGGCCTT CCGGGCCGCC CGCCGGCGTT GTGCGCCGGG TGCCCGCACC GTCTCGTGTT CAAGGAGCTT TCCCGCATGA AGGCCGTCGT CACCGGCGAC ATCGGCTGCT ACACTTTGGG CGCGCTGCCG CCGCTGTCCG CCATGGACAC CACCATCGAC ATGGGCGCGT CGGTTTCCAT GTCGCATGGC TTCGAGCTGG CTTGGGCGGG CACCGACCAC CGTCCCGTCG TGGGCGTCAT CGGCGACTCG ACGTTCGCGC ACTCGGGCCT GTCGGCGCTG ATCTCCACCG TGTACAACCA AGGACGCGGA ACCGTCTGCG TGCTGGACAA CCGCACCACG GCCATGACGG GTCGCCAGGG AAACCCCTTC AACGGCGAGA CGCTGCAAGG CCGCCTCTCG CGCGAGCTCG ACCTTGAGAG CGTCGTGCGC GCCATCGGCG TCGAGGACGT GCGCACCGTC GACCCGAACG ACGCCAAAGC CGTTCGCCGC GCGCTCAAGG AGGCCGTGGC CTCGGAGGAG CTTTCCGTGC TCGTGTTCCG CAGCCCCTGC GTGCTGATCG ACCGGCATCG CGAGCCCGCC TACGCGGTGA CCGACGCCTG CACGGCATGC GGCGTGTGCT CGACTTTAGG CTGTCCGGCC ATCGCGAAGG ATCCGGCGAA CGACCACGCG CTCATCGATG CCGCCCAGTG CATCGGATGC GGCCAGTGCG CCCAGTACTG CGCTTGGAAC GCCATCGCGC AACCCGCTGG GGAGAAAGGA GGCGTCGCAT GA
|
Protein sequence | MELMSGNEAI AQGAWEAGAR IGVAYPGTPS TETLEAFAKK DGVYAEWCVN EKVAVEVGIG ASVAGARVLS TMKHVGVNVA ADPLFTAAYA GVGGGLVVLA ADDPGMYSSQ NEQDSHWYAR AAHIPMLDPA DSAEALRFTR EAYDVSERFD VPVFIRSTVR VSHTKTPVEP GERTEIALKP YESDPAKWVM MPAFAKPRRK VQLARIDALR AWAEECPYNE VVRKGSAVGV VCAGAVYQHV VEALPDASVF KLGLTWPLPQ QALRDFAESV DALYVVEEAS EYLDEGVRAL GIEVAAFESP LPRDGELTPG LIRAAFGFEE PAHEPLPAGL PGRPPALCAG CPHRLVFKEL SRMKAVVTGD IGCYTLGALP PLSAMDTTID MGASVSMSHG FELAWAGTDH RPVVGVIGDS TFAHSGLSAL ISTVYNQGRG TVCVLDNRTT AMTGRQGNPF NGETLQGRLS RELDLESVVR AIGVEDVRTV DPNDAKAVRR ALKEAVASEE LSVLVFRSPC VLIDRHREPA YAVTDACTAC GVCSTLGCPA IAKDPANDHA LIDAAQCIGC GQCAQYCAWN AIAQPAGEKG GVA
|
| |