Gene Elen_2038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2038 
Symbol 
ID8416349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2387722 
End bp2388744 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content62% 
IMG OID645025015 
Productglycosyl transferase family 2 
Protein accessionYP_003182391 
Protein GI257791785 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.723912 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCTG TTTCGATCAT CGTTCCCATG TACAACGTGC GCCCGTATAT CGGCGAATGC 
CTTGCGTCGC TCAAGCGGCA GACGTTTTCT GATTTCGAGG CTATCTGCAT CGACGACGGG
TCGACGGACG GCACGCTCGA AGCCGCGCGC AATGCGGCAG GGGATGACGA ACGATTCGTT
TTCGTCGGCC AAGTGAATGC GGGACAGTCT GTCGCGCGCA ATGTGGGGAT CGGCCTTGCG
ACAGGGCGTT CCGTCCTTTT CCTGGACTCC GACGACTACT ATGAGGATCG AGCGCTGCAG
ACGCTGTTCG ATCATGCCGA GCGCGATGAG CTCGACGTCC TGTTCTTCTC CGCTCGCACG
GTGTACGAGG ATGCGGAGTC CCGTGCGCGG TATCGCGACG ATTATGAGGA TCGAGCGGCC
ATCGAAGGCA TTATGACAGG GCAGGAGCTT TTGGAGCGGT TTGCCGAGAA CCGCTCGTTT
TCGGTTTCGC CGGCGCTTCA GCTGATTAGG CGCGACTTCC TGACGGAATC GGGCATCTCG
TTCTTCGAGG GCATCGTGCA CGAGGACAAC TTGTTCACGG GTCTTATTCT TGCCCAAGCC
TCGCGTGCGG CGTTCCTTAA CGAGCAGCTG TATGTCCGTA GGGTGCGTGC GGGCTCTACG
ATGACGTCGC AGCGCGAGCT GCGCCACGTG TACGGGCATT TCAAGAGCGC CTACGAGCTG
GAAGCCTGGT TGCGCGCACA TGTGGCGTCC TGTCGTCCTG GGTTCGCCCG CGCGCTGCTG
CGCCATATCG CCTTTTGCTA CGATAGGGCT GCGTTCGACG CGCTGTCGCT CGGTGGGGAG
GGGCTTGAGG GTTGCGCCGG CTCGCTCGAC GCTGACGAGG AGCTGTCGTT TCGCCTGCAC
GTGATCGAGC ATGCGAACGA GATGCGCGCC GTACGCTCCG AGTACGCCGA TTCGACGTCG
TATAAAGTGG GCAGCGCCGT CATGGCGCTG CCGTGCTGGG TGAAGGATCG ACTCGGGCGC
TAG
 
Protein sequence
MAAVSIIVPM YNVRPYIGEC LASLKRQTFS DFEAICIDDG STDGTLEAAR NAAGDDERFV 
FVGQVNAGQS VARNVGIGLA TGRSVLFLDS DDYYEDRALQ TLFDHAERDE LDVLFFSART
VYEDAESRAR YRDDYEDRAA IEGIMTGQEL LERFAENRSF SVSPALQLIR RDFLTESGIS
FFEGIVHEDN LFTGLILAQA SRAAFLNEQL YVRRVRAGST MTSQRELRHV YGHFKSAYEL
EAWLRAHVAS CRPGFARALL RHIAFCYDRA AFDALSLGGE GLEGCAGSLD ADEELSFRLH
VIEHANEMRA VRSEYADSTS YKVGSAVMAL PCWVKDRLGR