Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2551 |
Symbol | |
ID | 8416875 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2983336 |
End bp | 2984283 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645025532 |
Product | Shikimate dehydrogenase substrate binding domain protein |
Protein accession | YP_003182895 |
Protein GI | 257792289 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0169] Shikimate 5-dehydrogenase |
TIGRFAM ID | [TIGR00507] shikimate 5-dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACA CGCAAGCCGA ACGGCCGGCC GAGAGCCTGT ACCTCTTGGG ACATCCCATT GCGCACTCTT CGTCGCCCGC GATGTACAAC GCCGTGTACG AGCGCTTAGG ATTGCCGTGG CGCTACGGTT TGGCTGATTG CGCGACCGAG GAAGAGGCGC GCTCGTTCGT GGAGGCGCGC GGCTTCCTGT CCATCAACAT CACCACGCCG TACAAGCCGC TCGCGTTCGA GGCCGCCACG GCGAAGGCTG CCACGGCGAA GCTCGCCCAG GGCGCGAACG TGCTGGTGAA GAAGGGTGAC GCGCTCATCG GCTTCAACAC CGACGGCCAG GGCTGCGTGG CGTACTTGGA GCGCACGGGC TTTTGCTTCG CAGGCAAGCG CGTGGCCGTG TGCGGCACGG GCCCCACGGC GCTGTCCATC CTGCACGCAT GCGCCATCGC CGGGGCGGAC GTGGCCATGC TGGTCGGGCG CGACAAGGAG CGCTCCCGCA AGGTGCTCGA AGGCTACGTC GAGCGGTTCG GCCTGTTGGC AAATGCCACG GTGGACCTGC CGGCAGCGCA GGCGCACCAT CGCAGCTTCC GCACGGCTTA CGAGCGCACC ACGTTCAAGT TCGGCAGCTA CACCACGTCC ACGAAGGCGC TGGCCGCCGC CGATCTCGTG GTGAACGCGA CGCCGCTCGG CATGAACGAG GGCGACGGCT CGCCGTTCGA CGTCGAGCTT CTGAGCGCGG GGCAGACCGT GTTCGACGCG GTATACGGCC ACGGCGAGAC GGCGCTCGTG CGCGCCGCGC GCGAAGCGGG ATGTACGGTG CACGACGGCG CCGGGATGCT GGTAGCGCAG GCGGTGGCTA CCGTGCACGC CGTGTGCGAC CTCGCCGAGG TCGACGTCGC CCTGTCTGAC GACGAGCTGT TCGCCTTGAT GGCGGAAGCG GCAGGGTTCG ACCTGTAG
|
Protein sequence | MTDTQAERPA ESLYLLGHPI AHSSSPAMYN AVYERLGLPW RYGLADCATE EEARSFVEAR GFLSINITTP YKPLAFEAAT AKAATAKLAQ GANVLVKKGD ALIGFNTDGQ GCVAYLERTG FCFAGKRVAV CGTGPTALSI LHACAIAGAD VAMLVGRDKE RSRKVLEGYV ERFGLLANAT VDLPAAQAHH RSFRTAYERT TFKFGSYTTS TKALAAADLV VNATPLGMNE GDGSPFDVEL LSAGQTVFDA VYGHGETALV RAAREAGCTV HDGAGMLVAQ AVATVHAVCD LAEVDVALSD DELFALMAEA AGFDL
|
| |