Gene Elen_1210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1210 
Symbol 
ID8415501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1451118 
End bp1452452 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content59% 
IMG OID645024173 
Productglycosyl transferase family 2 
Protein accessionYP_003181569 
Protein GI257790963 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.47071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.038027 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGGACC AGTTTTTTTC GCAGATCTCG TTCGTTGACA TATTCAACTT CTGCGTGTTT 
CTCACGTTCA CGATCTGCTA CACGTATCAG CTCTACTACG TGTTCGTGGT GCTGACGCGC
AAGCCCAAGG AGCTCACGGC GAAGAAGAAC CACAAGTTCG CCGCAGTCAT CTCGGCTCGC
AACGAGAGCG CCGTCATCGG CGACCTCATC CACTCCATCA AGGTGCAGAA CTATCCGTCC
GAGCTCATCG ACGTGTTCGT CATCGCCGAC AACTGCACGG ACGACACCGC GCGCGTGGCC
CGCGAGGCAG GCGCCATCGT CTTTCCCCGC AGCAACGACA AGGAAGTTGG CAAGGGCTAC
GCGCTCGACT ACGGCTTCCA GTGCATTCGC GAGCGCTACG CCGACAAGGG TTACGAGGCG
TACTTCGTGT TCGACGCCGA CAACGTGCTG GATGTGAACT ACTTCCGCGA GATGAACAAG
ACCTTCGACA ACGGAGCGAA GGCCTCGACC AGCTATCGAA ACTCCAAGAA CTACGACTCC
AACTGGATAT CCGCGGGCTA CGCCGTGTGG TTCCTGCGCG AGGCGAAGTT CCTGAACCAG
GCGCGTCTCA CGCTGAACAC CAGTTGCGCC GTGTCGGGCA CGGGCTTCTT CATAGCCGCC
GACATCATCG AGAAGAACGG CGGTTGGAAG TGGCACCTGC TCACCGAGGA CATCGAGTTC
TCTGCGAACA GCATTCTCGA GGGCACGCGC ATCAGCTACA CGCCCACGGC CATCCTCTAC
GATGAGCAGC CCATCACGTT CCGCGACTCG TGGAACCAGC GCTTCCGCTG GGCGAAGGGC
TTCTACCAGG TGTTCTGGCA CTACGGTGCC CGCCTGGCGA AAGGCATCGC CGTGAACCCC
AAGGGCGCGC GCTTCGCTTG CTACGACATG CTCATGACCA TCGCGCCGGG CATGCTGCTT
ACCATCGTGT CGGTGCTGTT CAACGCCATC ATCGTGTTCC TCAGCCTCAC CGGAGCCATG
TCCACGGGCA TCATGGTTGC CTCCTCGCTG TCGTCCATCT TGTTCTGCCT GCTGAACTAC
TTCATCTTCA TGTTCATGTT CGGCGTGCTG ACCACGTTCG TGGAATGGGA CTCCATCCGT
TCCACCACGG GCAAGAAGGT TCTGTACATG TTCACGTTCC CCGTGTTCAT GATGACCTAT
ATCCCCATCG CGCTGGTCGC GCTCGTGAAG AAGTGCAACT GGAAGCCCAT CAAGCACAGC
ATCTCGGTTG ATGTGGCCGA GCTCTCCGAC GCGGCAAGCG CCGCACCCCA AAAGCAGCGT
GAGCGCACCA TGTAG
 
Protein sequence
MLDQFFSQIS FVDIFNFCVF LTFTICYTYQ LYYVFVVLTR KPKELTAKKN HKFAAVISAR 
NESAVIGDLI HSIKVQNYPS ELIDVFVIAD NCTDDTARVA REAGAIVFPR SNDKEVGKGY
ALDYGFQCIR ERYADKGYEA YFVFDADNVL DVNYFREMNK TFDNGAKAST SYRNSKNYDS
NWISAGYAVW FLREAKFLNQ ARLTLNTSCA VSGTGFFIAA DIIEKNGGWK WHLLTEDIEF
SANSILEGTR ISYTPTAILY DEQPITFRDS WNQRFRWAKG FYQVFWHYGA RLAKGIAVNP
KGARFACYDM LMTIAPGMLL TIVSVLFNAI IVFLSLTGAM STGIMVASSL SSILFCLLNY
FIFMFMFGVL TTFVEWDSIR STTGKKVLYM FTFPVFMMTY IPIALVALVK KCNWKPIKHS
ISVDVAELSD AASAAPQKQR ERTM