Gene Elen_0066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0066 
Symbol 
ID8414346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp85881 
End bp86987 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content48% 
IMG OID645023042 
Producthypothetical protein 
Protein accessionYP_003180449 
Protein GI257789843 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCC GAAATTCTGT TGAAACGGCA CGCGCATCCG TTTCAAATAT CGAATCATAT 
ATTAGTAAAA AATGGCCGGG GTTTTTTGAC GAGATAGCCA CCACGCTATC CAAGCCTTTA
TGGCGCATGG GCAAGAACGA CATGTTCGCG ATATGCGACC AGGATATCGA CGATATCAAC
AAGGTCCAGT TCCTGGCAAT ACAAGAACGC GCGATCCAGC GCATTTTGCT TTCGAGCAAA
ACCCTGCAAC GCCCAGGAGG AGCATGGGAC TGCATACCTC CAATAGATTT AGCAAATCTT
TGCTGCGATT GGAAGAGATC CCCCGTGGTA CTCGAGTTCA ACTCTTCACT CGTTGAAGAA
CTTCAGCGTT CCGACATCGA TCCAGACGTT GATCTTTCGG AGCATCTTGA ACACCTCCCG
TTCTCCTGCT TCTTTATATC TGCCGAACAT CTTGGCTTCC GGCTCTCGAA TGCAGACGGC
AAGCTCAAAC ATGCGGTTGG GTTTTTCCTG GATTACGCGT GGATGCCTCG ATCAGACCAA
CCTCACCGAG TAGAAAAACA TTTCATAATT ACGATAATCG GAAGCAACGG CTACACGATC
CCCGTCGTTA TCCCCTTAAG ATTTTCGACG ATAAAAGATT TATCCGCATA TGTAGTTGAA
ACATATCTGC AAGCGAATGG CGGCAAGAGC AAGATGATCA AAGTGTTCGT CGACGAGGAT
CTTCATACGA TTCTTAGCCT GCTTCTCTAT ATCGCTTCAA AAGAGCCGGA TATAGTAGAG
AAGGAAATCG CACGGAGAAA GGACACGAAC GATCTTGATC GCAGTTCCAC TTTCAACAAC
GATCAAGAGC CACTGGACGA GCCGCGAACG TTTCTCGTTG GGGGAAAAAT CGGGCCCTCT
ATCGAGGCGC ATCGACACGC GAACAAAAAC GCAGGCAGCG GCCGTGCGAT AACTCCTCAT
ATTCGCCGAG CCCATTTTCA CACGTATCTC ACGGGTTCAC GAAAAGACAG AACTCAAAAG
AGAATTCTGA AATGGGTCGC ACAGACATCA GTGAACATGG AAAAGGAAGG CGATGCTTCG
ACTGTCATGG TGAGAAAAGT CGAATAA
 
Protein sequence
MTIRNSVETA RASVSNIESY ISKKWPGFFD EIATTLSKPL WRMGKNDMFA ICDQDIDDIN 
KVQFLAIQER AIQRILLSSK TLQRPGGAWD CIPPIDLANL CCDWKRSPVV LEFNSSLVEE
LQRSDIDPDV DLSEHLEHLP FSCFFISAEH LGFRLSNADG KLKHAVGFFL DYAWMPRSDQ
PHRVEKHFII TIIGSNGYTI PVVIPLRFST IKDLSAYVVE TYLQANGGKS KMIKVFVDED
LHTILSLLLY IASKEPDIVE KEIARRKDTN DLDRSSTFNN DQEPLDEPRT FLVGGKIGPS
IEAHRHANKN AGSGRAITPH IRRAHFHTYL TGSRKDRTQK RILKWVAQTS VNMEKEGDAS
TVMVRKVE