Gene Elen_2037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2037 
Symbol 
ID8416348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2386616 
End bp2387725 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content60% 
IMG OID645025014 
Productglycosyl transferase family 2 
Protein accessionYP_003182390 
Protein GI257791784 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.537056 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAG TCCCATCGCA CCTTTCCGTC ATCGTTCCCA TATTCAACGC CGAGCCGTAT 
CTCGAACAAT GCCTCGACAG CGTTTTGGCG CAGACGCACC GCGAGCTCGA CATCATCTGC
CTCAACGACG GCAGCACCGA CGGTTCGCTT GCCATCATGC AGGCATACGC CGATCGCGAT
GAGCGCATCC GCGTCATCGA CAAGCAGAAC CAAGGCTACG GCGCAACCTG CAACCGCGGT
CTTGAGGAGG CGCACGGCAC CTGGATATCC ATCGTCGAGC CCGACGACTG GATCGAGCCC
GGCATGTACG CCGACATGCT CGGCTTCGCA GCAACGCTTG ACGGCCCGGT GGACATCGTG
AAGACCCCGT ACTGGCGCAT CTGGATGCCC GACACTCCCG AGCAGCGCAA GCTTAACTGC
AGCTACCGCA ACCGCATCAA GCCAAGCCGG CAGCCTTTTG CAATCGGCGA TGCGGCGCAT
CTGCTGACCC ACCACCCGTC CATTTGGTCG GCTATCTATC GCAAGGAGTT TCTCGATGCT
CGCGGAATCC GCTTTCGCGA GTACCCGGGC GCCGGCTGGG CGGACAACCC GTTCCTCGTC
GAAACGCTGT GCCAAACGGC TCGCATCGCC TATCTGGACA CGCCGTACTA TTGCTATCGC
GAGGAGACGC CTGAGAAATC GAAGTCGTTC GCGCTGAACA ACACGCTGTT GCCCATAGAG
CGCTGGAACG ACATGATGGA CGTGCTTGAA AACCTCGGGA TGCGCGACGA AGCCGTGCTG
CGCGCCCATA ACAGCCGCGG GTTCACCTAT TTGAGCGGCA TCATCGAAGA AGTGCCCCTC
ACCAGAAGCG ACGTCCGCGA AGCAGCCACG CGCATGTTCG AGCGCATGGA CGCCAACCTC
GTGCTTTCGG ATGCGGAGAT ATCTCCCGGA TGCAAGCGGA TGTTTGCCGA CCTGCGCAGC
ATGCCCGAAC CCCGCATCAG CAGCATCCCC TACAGTTGGG GACTCGTAAA GCAGGGGCTG
TACAACTTGA AAAACGTCGG CCCTTCGTTC ACCTGGTACG CCATGAAAAG CTATTTCGCG
AAAAAGGGCA GCCGCGAAGG CAAGGCCTAG
 
Protein sequence
MKTVPSHLSV IVPIFNAEPY LEQCLDSVLA QTHRELDIIC LNDGSTDGSL AIMQAYADRD 
ERIRVIDKQN QGYGATCNRG LEEAHGTWIS IVEPDDWIEP GMYADMLGFA ATLDGPVDIV
KTPYWRIWMP DTPEQRKLNC SYRNRIKPSR QPFAIGDAAH LLTHHPSIWS AIYRKEFLDA
RGIRFREYPG AGWADNPFLV ETLCQTARIA YLDTPYYCYR EETPEKSKSF ALNNTLLPIE
RWNDMMDVLE NLGMRDEAVL RAHNSRGFTY LSGIIEEVPL TRSDVREAAT RMFERMDANL
VLSDAEISPG CKRMFADLRS MPEPRISSIP YSWGLVKQGL YNLKNVGPSF TWYAMKSYFA
KKGSREGKA