Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2037 |
Symbol | |
ID | 8416348 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 2386616 |
End bp | 2387725 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 645025014 |
Product | glycosyl transferase family 2 |
Protein accession | YP_003182390 |
Protein GI | 257791784 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.537056 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACAG TCCCATCGCA CCTTTCCGTC ATCGTTCCCA TATTCAACGC CGAGCCGTAT CTCGAACAAT GCCTCGACAG CGTTTTGGCG CAGACGCACC GCGAGCTCGA CATCATCTGC CTCAACGACG GCAGCACCGA CGGTTCGCTT GCCATCATGC AGGCATACGC CGATCGCGAT GAGCGCATCC GCGTCATCGA CAAGCAGAAC CAAGGCTACG GCGCAACCTG CAACCGCGGT CTTGAGGAGG CGCACGGCAC CTGGATATCC ATCGTCGAGC CCGACGACTG GATCGAGCCC GGCATGTACG CCGACATGCT CGGCTTCGCA GCAACGCTTG ACGGCCCGGT GGACATCGTG AAGACCCCGT ACTGGCGCAT CTGGATGCCC GACACTCCCG AGCAGCGCAA GCTTAACTGC AGCTACCGCA ACCGCATCAA GCCAAGCCGG CAGCCTTTTG CAATCGGCGA TGCGGCGCAT CTGCTGACCC ACCACCCGTC CATTTGGTCG GCTATCTATC GCAAGGAGTT TCTCGATGCT CGCGGAATCC GCTTTCGCGA GTACCCGGGC GCCGGCTGGG CGGACAACCC GTTCCTCGTC GAAACGCTGT GCCAAACGGC TCGCATCGCC TATCTGGACA CGCCGTACTA TTGCTATCGC GAGGAGACGC CTGAGAAATC GAAGTCGTTC GCGCTGAACA ACACGCTGTT GCCCATAGAG CGCTGGAACG ACATGATGGA CGTGCTTGAA AACCTCGGGA TGCGCGACGA AGCCGTGCTG CGCGCCCATA ACAGCCGCGG GTTCACCTAT TTGAGCGGCA TCATCGAAGA AGTGCCCCTC ACCAGAAGCG ACGTCCGCGA AGCAGCCACG CGCATGTTCG AGCGCATGGA CGCCAACCTC GTGCTTTCGG ATGCGGAGAT ATCTCCCGGA TGCAAGCGGA TGTTTGCCGA CCTGCGCAGC ATGCCCGAAC CCCGCATCAG CAGCATCCCC TACAGTTGGG GACTCGTAAA GCAGGGGCTG TACAACTTGA AAAACGTCGG CCCTTCGTTC ACCTGGTACG CCATGAAAAG CTATTTCGCG AAAAAGGGCA GCCGCGAAGG CAAGGCCTAG
|
Protein sequence | MKTVPSHLSV IVPIFNAEPY LEQCLDSVLA QTHRELDIIC LNDGSTDGSL AIMQAYADRD ERIRVIDKQN QGYGATCNRG LEEAHGTWIS IVEPDDWIEP GMYADMLGFA ATLDGPVDIV KTPYWRIWMP DTPEQRKLNC SYRNRIKPSR QPFAIGDAAH LLTHHPSIWS AIYRKEFLDA RGIRFREYPG AGWADNPFLV ETLCQTARIA YLDTPYYCYR EETPEKSKSF ALNNTLLPIE RWNDMMDVLE NLGMRDEAVL RAHNSRGFTY LSGIIEEVPL TRSDVREAAT RMFERMDANL VLSDAEISPG CKRMFADLRS MPEPRISSIP YSWGLVKQGL YNLKNVGPSF TWYAMKSYFA KKGSREGKA
|
| |