Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1626 |
Symbol | |
ID | 8415925 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1928171 |
End bp | 1929061 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645024595 |
Product | dihydrodipicolinate synthase |
Protein accession | YP_003181983 |
Protein GI | 257791377 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase |
TIGRFAM ID | [TIGR00674] dihydrodipicolinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0389231 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0000523491 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCGCAGC CTCGGTTCGG CCGGATGATC CCGGCCATGG TCACGCCCTT CGATGAGAAT CGCGAGCTCG ATCTTGACAA GGCTCAGGCG CTTGCCGCGC GCCTCGTCGA CGGCGGCAGC GACTCCCTCA TCATCAACGG CACGACGGGG GAGAGCCCGA CCGTGTTCTA CCCGCAGAAG ATGGAGCTGT TCCGCGCTGT GGTGGAGGCT GTGGGCAACC GCGTCCCCGT CATCGCGAAC GTGGGCGACA ACTGCACGGC CGACACGGTG GGCTTCGCCC GCGACGTGGC CGAGTTGGGT GTCGACGGCT TTATGTGCGT GGTGCCCTAC TACAACAAGC CTCCCCAAGA GGGCATGTAC CGTCACTTCC GCACCATCGC CGACGCGGTG GAGCTGCCCA TCATCCTGTA CAACATTCCG GGCCGCTGCG TGGTGAACAT GGAAGCCGAG ACCACGCTGC GCCTCGCCCA CGACTGCGAC AACGTCGTGG CCGTGAAGGA GGCGTCGGGC AAGATGGATC AGGTCGAGGC CATCGTCGCA GGCGCTCCGG ACGGCTTCGT CGTGTACTCC GGCGACGACT CCGCCACGCT CGACGTCATG AAGCGAGGCG GCGCCGGAGT CATCTCCACC ATCGGCAACG TGTCTCCCGC TCGCATGAAG GAGATCGTCG AGCTGGCGGC TGCAGGTGAC TGGGAGGCCG CCGAGGCGGC CAACGAGCGC TTGATGCCGC TCATGACGGG ACTGTTCGAA ACGTCGAACC CCATTCTCGT CAAGGAAGCG CTCAAGCTGC TGGGCTTCCC CGTGGGCGGC GTGCGCCTGC CGCTCGTGGA TGCCACGCCC GAGCAGTCCG AGCGCCTGGC CGCCACCATG CGCGAGGTGG GCGTGCTGTA G
|
Protein sequence | MSQPRFGRMI PAMVTPFDEN RELDLDKAQA LAARLVDGGS DSLIINGTTG ESPTVFYPQK MELFRAVVEA VGNRVPVIAN VGDNCTADTV GFARDVAELG VDGFMCVVPY YNKPPQEGMY RHFRTIADAV ELPIILYNIP GRCVVNMEAE TTLRLAHDCD NVVAVKEASG KMDQVEAIVA GAPDGFVVYS GDDSATLDVM KRGGAGVIST IGNVSPARMK EIVELAAAGD WEAAEAANER LMPLMTGLFE TSNPILVKEA LKLLGFPVGG VRLPLVDATP EQSERLAATM REVGVL
|
| |