Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2039 |
Symbol | |
ID | 8416350 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2388748 |
End bp | 2389848 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 645025016 |
Product | glycosyl transferase family 2 |
Protein accession | YP_003182392 |
Protein GI | 257791786 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.294118 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.96458 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGGGC CGTCGGTCAG CGTTATCGTT CCCCTGTACA ATGCGGAAGC GTTTGCGGCG CAGTGCATTG ATAGCGTGTT GGCGCAAACG CTGCGCGAAT TCGAATTGAT CTGTGTCGAC GACGGTTCCG ACGACGGCAC ATGCGAGATC GTGGAGGAAC GTGCGCGCCG TGACGAGCGC GTGGTGCTCG TCAGGCAGGC TGCCAACGCA GGGCCCGGTG CGGCGCGCAA TGCCGGGTTA GACAGGGCGC GCGGCCGGTA CGTGTACTGT CTCGATGCGG ACGACTACCT TGAGCGCGAT ATGCTCGCCC GCTGCATCGA AGCCCTCGAC GGGACCGGTG CGGATATGGC GCTCGTGGCG TTTCGCACGT ACAACGAACA GGTGGGTAGG GTTTTTCCGG CCGAATGGGG CATGCGGCAC GAGGATACGT ACCCATCGTA TCCGAACGGT ACGTTTGCGT GGGAGACGGC TCCCGACTTG TTTTTCGAGA CCGTGCAGAA CGTGCCGTGG AACAAGGTGG TGCGCCGCGA GTTGCTGGAA GTGCGGAGGA TCCGCTTCCA GAACCTGCGT CTGACGGAGG ACCTCATGTA CTCGTTGCCC GCGGCAGTGG CAGCGTCGCG TGTTGTCCGG GTCGCCGAAC CGCTTGTGGT GCATCGGGAA TTCGCCGGCA CGAATGCTAT GGCCGACAAG GGGCGCTATC CCCTTGATTT CCTCGAAGCG TTCGCCGAAC TGCGTCGATG GCTGTGCGAT AACGGGGTGT ACGACTCCTT ACGCACAGCG TATCGAACCT GGTTGCTCGA TGCCGCGTAC TACAATCTAC CCACATACCG CGATTTCGAA GCGTTCGCCG TCACGTTCGA ACGACTGACC GCGGACGATC TGGGTGCGTA CGATCTGGCG GACTGCGATC CGGCGCAGGT GCGCGACCAT CGTCACCGTG CGTTGCTCGA AGCCCTCCAA ACGCTGTCGC GCGAGCGCTT TTTGCTGGCA TGCGCCAACA TCGAGGCGGC TGAGGTGCAG GAGCAAAAGT GCGGATTCCA GAATATTCAG ACATCGTTGC GCTGGCTTTT CAGCCGCGCG CGCGATCGGA TGCGAGGATA G
|
Protein sequence | MPGPSVSVIV PLYNAEAFAA QCIDSVLAQT LREFELICVD DGSDDGTCEI VEERARRDER VVLVRQAANA GPGAARNAGL DRARGRYVYC LDADDYLERD MLARCIEALD GTGADMALVA FRTYNEQVGR VFPAEWGMRH EDTYPSYPNG TFAWETAPDL FFETVQNVPW NKVVRRELLE VRRIRFQNLR LTEDLMYSLP AAVAASRVVR VAEPLVVHRE FAGTNAMADK GRYPLDFLEA FAELRRWLCD NGVYDSLRTA YRTWLLDAAY YNLPTYRDFE AFAVTFERLT ADDLGAYDLA DCDPAQVRDH RHRALLEALQ TLSRERFLLA CANIEAAEVQ EQKCGFQNIQ TSLRWLFSRA RDRMRG
|
| |