Gene Elen_0633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0633 
Symbol 
ID8414923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp808457 
End bp809458 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content50% 
IMG OID645023610 
Productglycosyl transferase family 2 
Protein accessionYP_003181007 
Protein GI257790401 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.718934 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCGC TTGTCAGTGT TATCATCCCT GTTTATAACG TTGAGTCGTA CGTGTCGGAA 
TGCATAGAGA GCGTGATCGC TCAAACTTAT AGCAATATCG AGATCGTTGT AGTAAACGAC
GGATCGACAG ATGGATCGGG GTTTTCTTGC GATCAATACG CTTGTAGCGA TTCCCGTGTG
GTTGTGGTGC ACAAAGAGAA CGAAGGGCTG AGTGCTGCCC GAAACGCTGG AATAGCCGTT
TGTCGGGGTG ATTTCGTGGC CTTCGTTGAC GGCGATGATT TCGTGTCCCC TGTTTTCATT
GAGACGCTTA TGCATGCTAT CGAAGTGTGC GACTGTGAGA TAGCGGCGAT ACCTTGTGGT
ACGGCATTCG AGGACGGCTC TTCATGCGAG CTTGTTGCGA AGGCGGCCTT TATTCCCGAC
GCAAGGGTTA TGGATTCATA TACGGTTCAG AAGTTGATGC TCTATCAAAG GCTCGATACG
GGAGTTCCAT GGAGGCTTTA TGCTAGACGT ATACTCGGCG ATGCTCCTTT TGCTGTGGGG
TTGTATTACG AGGACCTGGC AAGCGTTTAC AAGTTCATTC ATGATGTAGA TCGTGTTGCG
TTGGTTGATT GCAGGGCCTT GTATGCGTAT CGTCTGCGGA AATCCGGAAT CATCAGTCAG
GCGTACAGTC CGATAAAAGC CTTATCTGCC ATTGAGGTTT CACGACGACT TAGCTCGGAT
ATGCAGGAAT GGTATCCGGA TCTAGCCGTT GCATCCGCAT CGCGCTGTTT TTCTCTTTGC
CGGATGGTTT ATGCACAGAT ACCGGTAGAA TACGAGCTTT CCAGTAAGTT CGAGAATGAT
CGTCGAGCTT TGTGGGGCGA ACTCAAGGAG AGAAGAAAGA TCGTTCTGAG TGATTCTTCG
GCTCGGAAGA GGGAACGACT TGCTGCAGCC ATTGCCTTGA TCGGCGAAGC TCCGTTTGCT
CTTTTTTGCC ATGCGTGTAG AAAAGCCGGT CTTCTCAGAT GA
 
Protein sequence
MNPLVSVIIP VYNVESYVSE CIESVIAQTY SNIEIVVVND GSTDGSGFSC DQYACSDSRV 
VVVHKENEGL SAARNAGIAV CRGDFVAFVD GDDFVSPVFI ETLMHAIEVC DCEIAAIPCG
TAFEDGSSCE LVAKAAFIPD ARVMDSYTVQ KLMLYQRLDT GVPWRLYARR ILGDAPFAVG
LYYEDLASVY KFIHDVDRVA LVDCRALYAY RLRKSGIISQ AYSPIKALSA IEVSRRLSSD
MQEWYPDLAV ASASRCFSLC RMVYAQIPVE YELSSKFEND RRALWGELKE RRKIVLSDSS
ARKRERLAAA IALIGEAPFA LFCHACRKAG LLR