Gene Elen_1212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1212 
Symbol 
ID8415503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1454080 
End bp1455087 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content66% 
IMG OID645024175 
Productextracellular solute-binding protein family 3 
Protein accessionYP_003181571 
Protein GI257790965 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000663729 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0536092 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGTT ATAAAACGCT CGCGTCCTGC TGCATGGCGG TCGCGTGCAT GGCGGCCCTG 
CTCGTCGTTC TGACGGGCTG CTCATCGCAG CAGAGCTACA CTCCCCCCGA GAAGACGCCC
ACGCTATCCT CGCCGACCAT CGGCAAGGAC GGTACGCTGC GCGTCGGTGT GAACACCGAC
AACCAGCCCC TGGCGGGACA GCCTTCCTCC TCGTCCAAAA TCGTCGGCAT CGACGTGGAC
GTGGCGGCGG CGCTGGCTGA CAGCTTCGGG CTGAAGCTCG AGGTCGTCAA CGTGGGATCG
GATGCCGAAT CGGCTCTCAA AGAGGGAACG GTCGACATCG TCATGGGCAT CGACAAGTCC
GACAGCAGCA CCTCGTTCTG GAAGTCTGAC GCGTACCTGC CTACGGCCGT GGCGTTGTTC
TCCGCGCCGT CCAACACGCA GGTTCCCACG AACGTCGTCG AGACGAAGAT CGCCGCGCAG
GTGTCGTCGA AGAGCGCTTG GGCGGTGACG AACGAATTCG ACAAGGCAAC CTTCTCCACG
ACCGACGACC TCAAGAGCGC GTTCGCCGAG CTGGCCTCGG GCCAGGTGCA GTACGTGGCG
GCCGATGCCA TCATCGGGAC GTACGCGGCG CACAGCGCGG GCGACGACGT GCATATCGTG
GCGCTCATGC AGCAGGCGGG CGGCTACGGC GTGGGCGTGT CGGATGCGAA CACCGATCTC
AAGCAAGCGG TCTCCGAAGC CCTCGCCACG CTGACCGGCA ACGGCACCAT CGGCGTCATC
GAGACGAAGT GGCTGGGTAC CGCGCTCGAC CTTTCGTCCA CGCCGCTGAC TGCCGGCGCC
ACCAAGTCCA CGGACGCGGG CGCGACCGTT GCTTCGAAGG AGCCGAAAGA CGAGAGCGAA
GGCGAGAACG CTGACGGGGA CGCTGCTCCT GCCGACGAAG GCACGGGCGC CGGCGACGAG
GTGAACGCGG GCGAGAACGC CGTGCAGCCT GGAGACGTCG CTGCTTAG
 
Protein sequence
MKRYKTLASC CMAVACMAAL LVVLTGCSSQ QSYTPPEKTP TLSSPTIGKD GTLRVGVNTD 
NQPLAGQPSS SSKIVGIDVD VAAALADSFG LKLEVVNVGS DAESALKEGT VDIVMGIDKS
DSSTSFWKSD AYLPTAVALF SAPSNTQVPT NVVETKIAAQ VSSKSAWAVT NEFDKATFST
TDDLKSAFAE LASGQVQYVA ADAIIGTYAA HSAGDDVHIV ALMQQAGGYG VGVSDANTDL
KQAVSEALAT LTGNGTIGVI ETKWLGTALD LSSTPLTAGA TKSTDAGATV ASKEPKDESE
GENADGDAAP ADEGTGAGDE VNAGENAVQP GDVAA