Gene Elen_1240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1240 
Symbol 
ID8415531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1485267 
End bp1486787 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content71% 
IMG OID645024203 
ProductDEAD/DEAH box helicase domain protein 
Protein accessionYP_003181599 
Protein GI257790993 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.502604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAAT TCAACGAACT CGGCCTGTCC GACCAGGCTC TTGAAGCCGT CGCGCGCCTG 
GGCTACGAAG CGCCCACGCC CGTCCAGGAG CAGGCCATCC CTCTCGCGCT CGAAGGGCGC
GACCTCATCG CCGCCGCCAA GACCGGCACC GGCAAGACCG CGGCCTTCTC GCTGCCCTCG
CTCGACCGCC TGGGCCATGC CAAGGGCGGG CAGGGCCCGC TCATGCTCGT GGTCACCCCC
ACGCGCGAGC TGGCCCAGCA GATCGGCGAG GTGTGCACCG CCATCGCCGC CTCCACGCAT
CACCGCATCC TCACCGTGGT GGGCGGCCTG TCCTACACCC CGCAGATCAA CAAGCTCAAG
CACGGCGTGG ACATCCTCAT CGCCACGCCG GGCCGCCTGG TCGACCTCAT GGAACAGGGC
GCCGTGCGCC TCGGCGACGT GGAGGTGCTC GTGCTGGACG AGGCCGACCG CATGCTGGAC
ATGGGCTTCT GGCCGGCCAT GAAGAAGATC ATCGGCGCCA CGCCCGCCTC GCGCCAGACG
CTGCTGTTCT CGGCCACCAT CGACGCGTCC ATCAAGAACA GCGTGGGCAA GCTGCTGCAC
GACCCGGCGT TCGTCGAGAT CGCCCACAAG GGCGAGACGG CCGACACCGT AGAGCAGTAC
ATCGTCCACG TGGCGCAGAC CCTCAAGCCG GCGCTGCTCA AGGCCGTGCT GGCCGAGAAG
GGCTCCGACC GCGTCATCGT GTTCGCCCGC ACGCGCAGCC GCGCCGATTC CACGTGCCGC
CGCCTCAAGC GCGCGGGCTA CACCGCCGAG GCTATCCACT CCGACCGCAG CCAGGCGCAG
CGCCGCCGCG CGCTCGACAA CTTCGCCGCC GGCAAGACGG GCGTGCTCGT GGCAACCGAC
GTGCTGGCGC GCGGCATCGA CGTGGAGGAA GTGGACTACG TCGTGAACTA TGACCTGCCC
ACCCAGCCCG AGGATTACGT CCACCGCATC GGCCGCACGG GTCGAGCGGG CGCCGCGGGC
TTCGCCGTGT CGTTCGTAAG CCCCGAGACG GCCGACGCGC TTCGCGACAT CGAGAAGCTC
ATCAAGCGCC CCATTCCCGA GATGGAGGTG CCTTCGTTCG ATGCCGAGCA GGCCGCCGAA
GAGGCCGCGG GCAAGGCCGC CCGCGCCGAC GCGCGCCGCG ACCCCGAGAT CAAGCAGGCC
GCCAAGGAGA TGGCCGCGCG CGAGCGCAAG AAGGCCAAGG CCCGCGAGCA GGCGCAGGCC
GAAGACGCGC CGAAAAACGC CCCGAAGCGC AAGGCGCCGA AGAAGCCCGT CGCGTCGAAG
GGCGCCCAGC CGAACCGCCA GCAGCCGGGC CGGGCGAACG CGAAGCAAGG GCAGCCCGGC
CAGCGCAGCG GCAAGCCCGC GTCCGGAGGC GCGCGCCCCG CGCAGTCTTC GGCGAAGAAG
GGCGGTCCCG ATCTCCGTCC GGGCCGCGCT CATCGCGCCA GCCTCGCCCA GCAGCGCAGC
AGAGGCGGCA AGCGCAAATA G
 
Protein sequence
MKQFNELGLS DQALEAVARL GYEAPTPVQE QAIPLALEGR DLIAAAKTGT GKTAAFSLPS 
LDRLGHAKGG QGPLMLVVTP TRELAQQIGE VCTAIAASTH HRILTVVGGL SYTPQINKLK
HGVDILIATP GRLVDLMEQG AVRLGDVEVL VLDEADRMLD MGFWPAMKKI IGATPASRQT
LLFSATIDAS IKNSVGKLLH DPAFVEIAHK GETADTVEQY IVHVAQTLKP ALLKAVLAEK
GSDRVIVFAR TRSRADSTCR RLKRAGYTAE AIHSDRSQAQ RRRALDNFAA GKTGVLVATD
VLARGIDVEE VDYVVNYDLP TQPEDYVHRI GRTGRAGAAG FAVSFVSPET ADALRDIEKL
IKRPIPEMEV PSFDAEQAAE EAAGKAARAD ARRDPEIKQA AKEMAARERK KAKAREQAQA
EDAPKNAPKR KAPKKPVASK GAQPNRQQPG RANAKQGQPG QRSGKPASGG ARPAQSSAKK
GGPDLRPGRA HRASLAQQRS RGGKRK