Gene Elen_1934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1934 
Symbol 
ID8416239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2265914 
End bp2267572 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content66% 
IMG OID645024905 
Productprotein of unknown function DUF88 
Protein accessionYP_003182287 
Protein GI257791681 
COG category[S] Function unknown 
COG ID[COG1432] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000288494 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0274849 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATATCA AGCAATCGTC TGAAAAGCGG TTCGCACTTT TGATCGATGC CGACAACGTG 
TCGGCGAAAT ATATAAAGCC CATCACCGAC GAGCTGTCGA AGTACGGCAC CGTCACCTAC
AAGCGCATCT ACGGCGACTG GACGCTCACG CTCCATGCCA AGTGGAAAGA CGCGCTGCTG
GAGAACTCCA TCACGCCCAT CCAGCAGTTC GGCTACACTC AAGGCAAGAA TGCCACCGAC
TCGGCCATGA TCATCGACGC CATGGACATC CTGTACACGC GCTCGGTGGA GGGCTTCTGC
ATCGTGTCGA GCGACAGCGA TTTCACGCGT CTGGCCAGCC GTATCCGCGA AAGCGGCCTC
ACGGTCATCG GCATGGGCGA GAAGAAGACG CCCACGCCGT TCAGAAAGGC ATGCGACATT
TTCACCACGC TGGAGCTTCT GCTGGGCGAC ACGGGCGGCA AGTCGGGCGG ACGCAACAGG
AACCGTCACG ACCAGGGCTC GTCGTCGAAC GGCCAGGGCG CTGGAACCAC CACCATGAGC
AAGGACGAGA TCGAGCAGGC CGTGGTGAAC ATCATCACGG ACAACCAGAA CAACGGCAAG
TCGACGGGGC TCGGAGAGGT GGGCAGCCGT CTGCTGAAGC GCTACCCCGA CTTCGACGTG
CGCAGCTACG GCACGAACCT GCTGTCGAAG CTGCTCGACG AGTTCGCCAG CGTGCAGATC
ATCAAGGACG GCAGCTCGGT GGCCGTGGTG CTGGCTGAGG GTGCGAACGC TCCGAAGGAC
GCTTCCCCGG AGGCAGAGCA GGCACCCGAG ACCAAGCAGG CCGACGACGT GAAAGACGCG
CCGGTTGCAG AATCTGAAGG ATCGACGGAC GCACAGGGCG CCGCTGAGGC GAAGCCGGTC
GAGAAGAAGC CCGCGTCGCG CCGTCAGCCG CGTCGTCGCA AGGATCAGGT CGCGGCGCAG
CAGGGATCCG AGGCTACCGA GGAGAAGCCG GTTCAGGAAC CGGAACTCTC AGCGGAGCCC
GCAGGCGAGC AGCATGATCG TCTGGCCGAA CCTGTCGTCG AAGCCGAGCC TGCCGAGCAG
CCTCCCTCGG ACAACCGTCC CGGACGTGCC GCCCGCATGC GCGCGGCTGC TTCTCGCTCG
AGAGGCTCGG AAGGGCGCAA GCAGGCGGGG AAGAAGCAGA CCGAGAAGGG TGAGCGCTCG
GACGGCGAGG TGCCTGCCCA GGCCGCCGCG CCAACCGAGG AGCAGAAGCC CGCTGCGAAG
CCGAAGCGCA AGCCCGCGAG GGCGAAGGCC CCCAAGGCCG AGCAGCCGGT CGCCGAGGCT
ACGGCGACGC AGGAGGAGCC CGTCGGGGAA GCGCCGAAGC GGGAGTCCGA AGCGCCCGCG
AAGCGCGCAC CGAAGCGTCC AGCCAAAGCG ACTGCGAAGG CTGTCGCCGA AGGCGCCGCT
GCCCCGTCCG ACCCCGAGGC GTTCATCCGC CAGACCGTGG CCGCCGCCGA GCCGGAGGGG
ATCGCGCTGT CCGTGCTGGG CAAGCGCGTG CGCGGCAAGT TCCGCACGTT CAAGCTGCGC
GATCTGGGCT ACGCGCAGTT CAGGCCCTAT CTCGACGACC TGGACGGCAT CAAGGTGGAG
CAGCGCGACG GCCAATCCTA CGCCCGCCTC GACCGATAA
 
Protein sequence
MDIKQSSEKR FALLIDADNV SAKYIKPITD ELSKYGTVTY KRIYGDWTLT LHAKWKDALL 
ENSITPIQQF GYTQGKNATD SAMIIDAMDI LYTRSVEGFC IVSSDSDFTR LASRIRESGL
TVIGMGEKKT PTPFRKACDI FTTLELLLGD TGGKSGGRNR NRHDQGSSSN GQGAGTTTMS
KDEIEQAVVN IITDNQNNGK STGLGEVGSR LLKRYPDFDV RSYGTNLLSK LLDEFASVQI
IKDGSSVAVV LAEGANAPKD ASPEAEQAPE TKQADDVKDA PVAESEGSTD AQGAAEAKPV
EKKPASRRQP RRRKDQVAAQ QGSEATEEKP VQEPELSAEP AGEQHDRLAE PVVEAEPAEQ
PPSDNRPGRA ARMRAAASRS RGSEGRKQAG KKQTEKGERS DGEVPAQAAA PTEEQKPAAK
PKRKPARAKA PKAEQPVAEA TATQEEPVGE APKRESEAPA KRAPKRPAKA TAKAVAEGAA
APSDPEAFIR QTVAAAEPEG IALSVLGKRV RGKFRTFKLR DLGYAQFRPY LDDLDGIKVE
QRDGQSYARL DR