Gene Elen_2441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2441 
Symbol 
ID8416765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2861584 
End bp2862639 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content55% 
IMG OID645025423 
Productglycosyl transferase group 1 
Protein accessionYP_003182786 
Protein GI257792180 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.719271 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGGCC CCAGTCTAGA ATCGCATGGG GGTATGGCAT CGTGCGCTTC CACGCTTTTA 
AACGGAGGCC TGGACAGAAG GTGCAATGTT CGCTATCTCG CGACGACCGA GGAGGGATGC
AAAGCACGCA AGTTGGCCTG CGGACTGAGC ACCCTGTTTG TGTTCTCCAG AGAGGCGAAG
GACTGCGATC TGGTTCATAT TCATTTTTCG TACGGCGTAA GCATGACCCG CAAAGCTCTA
TTCGTCCGAC GTGCTAAGAA CATGGGTAAG AAAGTGATTC TTCATTCCCA TTCGAGCGCA
ATGGAGCGGG CAATCCTTGA AGGCGATTCG GGATCGAAAA ACGAGATTAA GAAGTTCCTG
TCGCTCGCGG ATGCTCTGAT CGTCCTTTCC TCGAAGTGGA AGGACCTGAT CTGCGACGAG
CTGGACATCA AACGCTCGAT CGTCCACGTG ATCCCGAACG GCGTTCCATT GGGCGATCCG
AGCGCAAAGC CCGATCACGA TGAAAGATCC TGCTGCAACA TCTTGTTCTT AGGCAGGCTG
GAGGAGGAAA AGGGCGTCGG TACGCTGATA GAAGCCACGG GAGCCCTCGT TCGAAACGGC
GCCGTAATCG AACTCGTGCT GGCCGGATCG GGAAGCGACG AGGAGACTCA GAGATACCAG
CTTCTGGCTC GACGGGAAGG CGTAAATTGC AGCTTCGTGG GATGGGTGGA TTCGGAAAAG
AAGAGGGATC TGCTGGTCGA GGCCGACGTG TTTGCCCTTC CTTCAAAGCG AGAGGTTCTT
CCCATATCCC TTCTCGAAGC CATGGGAGCC GGCGTCGCTT CGGTCGCTTC CGATTGCGGA
TCCGTCCCAG AAGTCATACA CCACGGACGA AACGGGATGC TTTGCGAACC GGGGGATCCC
GAATCGCTCC GCACATGCCT GGCTTTGCTC GTGCAGGACC CTGTGTTGCG CAAGAGGCTC
GCAATACAGG GATTCGAAAC CGTAAAAAAG GGCTACTCGG TCGAAAACTC AGTCGACGCG
TTGCTTGGTT TGTACAAAGA GGTGCTTCAT GGATGA
 
Protein sequence
MLGPSLESHG GMASCASTLL NGGLDRRCNV RYLATTEEGC KARKLACGLS TLFVFSREAK 
DCDLVHIHFS YGVSMTRKAL FVRRAKNMGK KVILHSHSSA MERAILEGDS GSKNEIKKFL
SLADALIVLS SKWKDLICDE LDIKRSIVHV IPNGVPLGDP SAKPDHDERS CCNILFLGRL
EEEKGVGTLI EATGALVRNG AVIELVLAGS GSDEETQRYQ LLARREGVNC SFVGWVDSEK
KRDLLVEADV FALPSKREVL PISLLEAMGA GVASVASDCG SVPEVIHHGR NGMLCEPGDP
ESLRTCLALL VQDPVLRKRL AIQGFETVKK GYSVENSVDA LLGLYKEVLH G