Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2064 |
Symbol | |
ID | 8416380 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2429696 |
End bp | 2431021 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645025045 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_003182416 |
Protein GI | 257791810 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.27669 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.0343903 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTAATG CCAACGAAGG GAGTTCTATG ACGCAGATCG ACGCGGCACG CGCCGGCACC ATCACCCGCG AAATGGCCAT CGTGGCCGAG AAGGAGGGGC GCGACCCCGA GTTCATCCGC GAGGGGGTTG CGGCCGGGCG CATCGCCATC CCCGCCAACA TCCACCATAC CAGCTTGTCG CCCGAAGGCG TGGGCGGCGG CCTGCGCACG AAGGTGAACG TGAACCTGGG CATTTCGGGC GACGTCGCCG ACGAGGCCGA GGAATGGAAG AAGGTGGACG TGGCGCTGGA GCTGGGCGCC GAGGCCATCA TGGACTTGTC GAACACCGGC AAGACGCACG CGTTCCGCAG CGCGCTCATC GAGAAGTCGC CGGCCATGAT CGGCACCGTG CCCATGTACG ACGCCATCGG CTATCTGGAG AAGCCGCTGA TCACCATCAC GGTGGAGGAC TTCCTCGACG TGGTCCGCGC GCATGCAGAG GACGGCGTGG ACTTCGTCAC CATCCACGCG GGCATGAATC GGCGCACCAT CGAGTCGTTT CGCGAGACGG GCCGCCTCAC GAACATCGTG AGCCGCGGCG GGTCGCTCAT CTTCGCGTGG ATGGAGGCCA CCGGCAACGA GAACCCCTTC TACGAGTTCT ACGACGAGGT GCTGGCCATC CTGCACGAGC ACGACGTGAC CATCAGCCTG GGCGATGCCA TGCGGCCCGG CTCCTCGTAC GACGCCACCG ACGCGGGGCA GATCGCCGAG CTCATCGAGA TCGGCAAGCT CACGAAGCGC GCATGGGACG CGGGCGTGCA GGTGATGGTG GAAGGCCCCG GGCATATGGC GCTCGATGAG ATCGCCGCCA ACATGAAGCT GGAAAAGCGG CTGTGCCACG ACGCGCCCTT CTACGTGCTG GGGCCGCTGG TCACCGACAT CGCGCCGGGC TACGACCATA TCACGGCCGC CATCGGGGGC GCGGTTGCCG CGGCTTCGGG CGCCGACTTC CTCTGCTACG TCACGCCGGC CGAGCATCTG CGCCTGCCCG ACGCCGCCGA CGTGCGCGAG GGCCTCGTGG CCACGAAGAT CGCCGCGCAT GCGGCCGACA TCGCGCGCGG GGTGCCCGGA GCGCGCGACC GCGACAACCG CATGAGCGAC GCTCGGCGCC GCGTGGACTG GGAGGGCATG TTCGCCGAGG CGCTCGATCC GGTCAAGGCG CGCCGCTACT TCGAGAGCGC TCCGCCCTCG ACCGACGGCA CCTGCACCAT GTGCGGCGAG ATGTGCGCCA TGCGAACCGT GAACACCATC ATGGACGGCC TGACGGTCGA TCTCGGGAAG GAGTAG
|
Protein sequence | MGNANEGSSM TQIDAARAGT ITREMAIVAE KEGRDPEFIR EGVAAGRIAI PANIHHTSLS PEGVGGGLRT KVNVNLGISG DVADEAEEWK KVDVALELGA EAIMDLSNTG KTHAFRSALI EKSPAMIGTV PMYDAIGYLE KPLITITVED FLDVVRAHAE DGVDFVTIHA GMNRRTIESF RETGRLTNIV SRGGSLIFAW MEATGNENPF YEFYDEVLAI LHEHDVTISL GDAMRPGSSY DATDAGQIAE LIEIGKLTKR AWDAGVQVMV EGPGHMALDE IAANMKLEKR LCHDAPFYVL GPLVTDIAPG YDHITAAIGG AVAAASGADF LCYVTPAEHL RLPDAADVRE GLVATKIAAH AADIARGVPG ARDRDNRMSD ARRRVDWEGM FAEALDPVKA RRYFESAPPS TDGTCTMCGE MCAMRTVNTI MDGLTVDLGK E
|
| |