Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2441 |
Symbol | |
ID | 8416765 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2861584 |
End bp | 2862639 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 645025423 |
Product | glycosyl transferase group 1 |
Protein accession | YP_003182786 |
Protein GI | 257792180 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.719271 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGGCC CCAGTCTAGA ATCGCATGGG GGTATGGCAT CGTGCGCTTC CACGCTTTTA AACGGAGGCC TGGACAGAAG GTGCAATGTT CGCTATCTCG CGACGACCGA GGAGGGATGC AAAGCACGCA AGTTGGCCTG CGGACTGAGC ACCCTGTTTG TGTTCTCCAG AGAGGCGAAG GACTGCGATC TGGTTCATAT TCATTTTTCG TACGGCGTAA GCATGACCCG CAAAGCTCTA TTCGTCCGAC GTGCTAAGAA CATGGGTAAG AAAGTGATTC TTCATTCCCA TTCGAGCGCA ATGGAGCGGG CAATCCTTGA AGGCGATTCG GGATCGAAAA ACGAGATTAA GAAGTTCCTG TCGCTCGCGG ATGCTCTGAT CGTCCTTTCC TCGAAGTGGA AGGACCTGAT CTGCGACGAG CTGGACATCA AACGCTCGAT CGTCCACGTG ATCCCGAACG GCGTTCCATT GGGCGATCCG AGCGCAAAGC CCGATCACGA TGAAAGATCC TGCTGCAACA TCTTGTTCTT AGGCAGGCTG GAGGAGGAAA AGGGCGTCGG TACGCTGATA GAAGCCACGG GAGCCCTCGT TCGAAACGGC GCCGTAATCG AACTCGTGCT GGCCGGATCG GGAAGCGACG AGGAGACTCA GAGATACCAG CTTCTGGCTC GACGGGAAGG CGTAAATTGC AGCTTCGTGG GATGGGTGGA TTCGGAAAAG AAGAGGGATC TGCTGGTCGA GGCCGACGTG TTTGCCCTTC CTTCAAAGCG AGAGGTTCTT CCCATATCCC TTCTCGAAGC CATGGGAGCC GGCGTCGCTT CGGTCGCTTC CGATTGCGGA TCCGTCCCAG AAGTCATACA CCACGGACGA AACGGGATGC TTTGCGAACC GGGGGATCCC GAATCGCTCC GCACATGCCT GGCTTTGCTC GTGCAGGACC CTGTGTTGCG CAAGAGGCTC GCAATACAGG GATTCGAAAC CGTAAAAAAG GGCTACTCGG TCGAAAACTC AGTCGACGCG TTGCTTGGTT TGTACAAAGA GGTGCTTCAT GGATGA
|
Protein sequence | MLGPSLESHG GMASCASTLL NGGLDRRCNV RYLATTEEGC KARKLACGLS TLFVFSREAK DCDLVHIHFS YGVSMTRKAL FVRRAKNMGK KVILHSHSSA MERAILEGDS GSKNEIKKFL SLADALIVLS SKWKDLICDE LDIKRSIVHV IPNGVPLGDP SAKPDHDERS CCNILFLGRL EEEKGVGTLI EATGALVRNG AVIELVLAGS GSDEETQRYQ LLARREGVNC SFVGWVDSEK KRDLLVEADV FALPSKREVL PISLLEAMGA GVASVASDCG SVPEVIHHGR NGMLCEPGDP ESLRTCLALL VQDPVLRKRL AIQGFETVKK GYSVENSVDA LLGLYKEVLH G
|
| |