Gene Elen_0084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0084 
Symbol 
ID8414365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp110532 
End bp112118 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content65% 
IMG OID645023061 
ProductSubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_003180467 
Protein GI257789861 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.689632 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATTCTGA CCGAAACCCT GGCCTTGCTC GTTGAAAAAC GCGATTGGTT TTTCGAGCTC 
CTTCTGCAGC ATATCGGCAT CTCGCTCATA TCCATTGCAC TGGCGGCGCT CATCGGACTG
TCGCTGGGAA TCGCCATCGC CACATGGAGG CGAGGGGCGA AGCCCGTGCT TGCGCTGGTC
AATTTCGTGT ACACCATCCC ATCCATTGCG CTGTTCGGCT TCCTCATCCC CATCACGGGC
ATCGGCGACC CTACCGCCAT CGTGGCGCTC ACCGTGTACG CGCTGCTGCC GATGGTGCGC
AACACCTACA CCGGCCTCAC CACCATCGAC CCTGCCATCA TCGAGGCGGC GCGCGGCATG
GGCTCCACCG ACCGCCAGCT GCTGTACCGT ATCGAACTGC CGCTGGCCGC GCCCGTCATC
ATGAGCGGCA TCCGCAACAT GGCCACCATG ACCATCGCGC TGGCCGGCAT CGCCACGTTC
ATCGGCGCAG GCGGCTTGGG CGTGGCCATC TTCCGCGGCA TCACGACGTA CAACCTGGCC
ATGACCCTGG CGGGCAGCGT GCTGATCGCG CTGCTGGCCA TCGTGGTGGA CCTGCTGCTG
GGCTTAGCCG AGAAGTCCAC ACGACGCCAC CTGGAGCCTT CGAACGCCCG GCGACGGAAG
CGTTCCGGCG CGCGACGGGT TTCGCGTCGC AAGCTGGCCC CGGCCGTCGC GGCGGGCGCG
GCCGTGGTGC TGATAGCGGG CGGGGCCTTC GCGTTCGCGA ACCGCGGCGG CGGAGAGAAC
GTGGTGAACA TCGCCACGAA GCCCATGACC GAGCAGTACA TCCTCGGCGA GATGCTGAAC
ACGCTGATCG AGCACGATAC CGACCTCAAG GTGGAGCTCA CGCAGGGCGT GGGCGGCGGC
ACGTCGAACA TCGAGCCCGG CATGGAGAAG GGCGACTTCG ACCTCTACCC CGAGTACACG
GGAACGGGAT GGAACGCCGT CCTCAAGCAC GACGACACCT ACGACGAGTC GATGTTCGAC
ACGATGCAGC AGGAATACGA ATCGCAGCTA GGCCTCACCT GGGTGGGCAT GTACGGGTTC
AACAACACGT ACGGCCTCGC TGTGAGCCGC GACGCGGCCG AGCGCTACAA CCTGCGCACG
TATTCTGACC TGGCGGGCGC AGCGGGCGGC CTGACGCTGG GCGCCGAGCC CGACTTCTTC
GACCGGCAGG ACGGCTACCC CGGATTGCAG CAAGCCTACG GCATGAACTT CGGCACCACG
AGAGACATGG ACATCAGCTT GAAGTACCAG GCGCTGTTCG AGGGCCAGGT GGATGCCATC
GTCGTGTCCA CCACCGACGG GCAGGTGGCC GACGAGCGCC TCGTGGTGCT GGAGGACGAC
CGGCACTTCT ATCCGTCGTA TCTTTGCGGC AACGTCGTGC GACAGGACGC GCTGGAAAAG
CACCCCGAGC TGCGCGGCGA GCTGCTGAAG CTGCAAGGGG CCATCACCGA TGCCGATATG
GCGCGCATGA ACAACGAAGT GGAGACGCAA GGGCAGGAGC CGAAAGCAGT GGCCGACGCG
TTCCTGGCCG AGAAGGGGCT GATGTAG
 
Protein sequence
MILTETLALL VEKRDWFFEL LLQHIGISLI SIALAALIGL SLGIAIATWR RGAKPVLALV 
NFVYTIPSIA LFGFLIPITG IGDPTAIVAL TVYALLPMVR NTYTGLTTID PAIIEAARGM
GSTDRQLLYR IELPLAAPVI MSGIRNMATM TIALAGIATF IGAGGLGVAI FRGITTYNLA
MTLAGSVLIA LLAIVVDLLL GLAEKSTRRH LEPSNARRRK RSGARRVSRR KLAPAVAAGA
AVVLIAGGAF AFANRGGGEN VVNIATKPMT EQYILGEMLN TLIEHDTDLK VELTQGVGGG
TSNIEPGMEK GDFDLYPEYT GTGWNAVLKH DDTYDESMFD TMQQEYESQL GLTWVGMYGF
NNTYGLAVSR DAAERYNLRT YSDLAGAAGG LTLGAEPDFF DRQDGYPGLQ QAYGMNFGTT
RDMDISLKYQ ALFEGQVDAI VVSTTDGQVA DERLVVLEDD RHFYPSYLCG NVVRQDALEK
HPELRGELLK LQGAITDADM ARMNNEVETQ GQEPKAVADA FLAEKGLM