Gene Elen_0562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0562 
Symbol 
ID8414847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp715135 
End bp716355 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content65% 
IMG OID645023534 
Producthypothetical protein 
Protein accessionYP_003180936 
Protein GI257790330 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAACG TATTCAAAGG CGCACTGCTC GCGCTGGTGC GCGAGAAGAG CGTGTTCATC 
TGGTCGCTGG CGTTTCCCCT GATCCTGTCC ACGATGTTCG TGTTCATGTT CGCGAACCTG
GACGAGGCGG GGCAGTTCGA GCCCATCCCC ACCGCGGTGG TGGCCGACGA GAACTACGAC
GCGGCGCCGG GGTTCTCGGA GATGATCGAC ACGCTGGCGG AGCCGGGAGC CGACCAGATG
CTCGACGTGG CGCGCGTAGC CACCGAGCAG GAGGCGCGCG ATCTCATGAG CGGAAACGAT
ACCGCAGGAG CGGGCTACTT CAACATCTCG GGCGATGGAG CGGCCGGGTA TTTCACGGTT
GACGCCGACG GCATGCCCAC CGTGCACGTG AAGGCGGGGG TCACGCCCGA CTCGCTGGAC
AGCGCCTACC AGTCCATCTT GAAGACCATC GGCGACGGAT ACGTGCGCAA CGCGGCGCTC
ATCGAAGACG TCGCCGCCGA GAACCCCGCC GCGCTGGCCG ACATGGCGGC GGTGGAAAAG
CTGCTGGACG CCGGCGATCT CACCGAGAAG ATCGACGTCA CGCAGAACCC GCCCAAAGAA
TCCGTGCGCT ACTTCTTCGC ATTGCTGGGC ATGGCGGCAC TGTTCGGTGG GCAGATCGGG
ATGATCGCTA TCTGCCGCAC GCAGCCGAAC CTGAGCGCGC TGGGGGCGCG GCGCGCCGTG
GGAGCGCTCA GCCGCGCGAA GACGCTGACG GCGACGCTGG CCGCCAGCTG GGTGCTGACG
TTCGCCTGCA TCGCCATCGC GTATCTGTAC ATCCGGTTCG TCGCCGGCGT GGATTTCGGC
GGACGAGATG CGATATGCAT CGCCGTGATC GCCGCCGCGG CCTTGACGGC GACGGCGTTC
GGCACGCTGC TGGGCTCGCT GCCGAAGATC GACGAAAGCG TGAAGGGCGG CATGCTGTCC
GGCATCGTGT GCTTCGCCTC GCTGTTCGCC GGGCTGTACG GCTCGCCCAC GATGAAGCTG
GCCGATACCG TGAACGCGGC GGTGCCCGCG GCGCAGCTGG TCAACCCGGC CGTGCAGATA
TCCCAAGCGT TCTACAGCAT CATGTACTAC GACACCTACC AGCGCACGAT CGAGCACATC
CTGATCCTGC TGGCCATGGC TGCGGTACTG TTCGCCGCGT CGGCTCTGTT CATAAGGAGG
CAGCGCTATG CAAGTCTTTA A
 
Protein sequence
MFNVFKGALL ALVREKSVFI WSLAFPLILS TMFVFMFANL DEAGQFEPIP TAVVADENYD 
AAPGFSEMID TLAEPGADQM LDVARVATEQ EARDLMSGND TAGAGYFNIS GDGAAGYFTV
DADGMPTVHV KAGVTPDSLD SAYQSILKTI GDGYVRNAAL IEDVAAENPA ALADMAAVEK
LLDAGDLTEK IDVTQNPPKE SVRYFFALLG MAALFGGQIG MIAICRTQPN LSALGARRAV
GALSRAKTLT ATLAASWVLT FACIAIAYLY IRFVAGVDFG GRDAICIAVI AAAALTATAF
GTLLGSLPKI DESVKGGMLS GIVCFASLFA GLYGSPTMKL ADTVNAAVPA AQLVNPAVQI
SQAFYSIMYY DTYQRTIEHI LILLAMAAVL FAASALFIRR QRYASL