Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1240 |
Symbol | |
ID | 8415531 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1485267 |
End bp | 1486787 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 645024203 |
Product | DEAD/DEAH box helicase domain protein |
Protein accession | YP_003181599 |
Protein GI | 257790993 |
COG category | [J] Translation, ribosomal structure and biogenesis [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0513] Superfamily II DNA and RNA helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.502604 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCAAT TCAACGAACT CGGCCTGTCC GACCAGGCTC TTGAAGCCGT CGCGCGCCTG GGCTACGAAG CGCCCACGCC CGTCCAGGAG CAGGCCATCC CTCTCGCGCT CGAAGGGCGC GACCTCATCG CCGCCGCCAA GACCGGCACC GGCAAGACCG CGGCCTTCTC GCTGCCCTCG CTCGACCGCC TGGGCCATGC CAAGGGCGGG CAGGGCCCGC TCATGCTCGT GGTCACCCCC ACGCGCGAGC TGGCCCAGCA GATCGGCGAG GTGTGCACCG CCATCGCCGC CTCCACGCAT CACCGCATCC TCACCGTGGT GGGCGGCCTG TCCTACACCC CGCAGATCAA CAAGCTCAAG CACGGCGTGG ACATCCTCAT CGCCACGCCG GGCCGCCTGG TCGACCTCAT GGAACAGGGC GCCGTGCGCC TCGGCGACGT GGAGGTGCTC GTGCTGGACG AGGCCGACCG CATGCTGGAC ATGGGCTTCT GGCCGGCCAT GAAGAAGATC ATCGGCGCCA CGCCCGCCTC GCGCCAGACG CTGCTGTTCT CGGCCACCAT CGACGCGTCC ATCAAGAACA GCGTGGGCAA GCTGCTGCAC GACCCGGCGT TCGTCGAGAT CGCCCACAAG GGCGAGACGG CCGACACCGT AGAGCAGTAC ATCGTCCACG TGGCGCAGAC CCTCAAGCCG GCGCTGCTCA AGGCCGTGCT GGCCGAGAAG GGCTCCGACC GCGTCATCGT GTTCGCCCGC ACGCGCAGCC GCGCCGATTC CACGTGCCGC CGCCTCAAGC GCGCGGGCTA CACCGCCGAG GCTATCCACT CCGACCGCAG CCAGGCGCAG CGCCGCCGCG CGCTCGACAA CTTCGCCGCC GGCAAGACGG GCGTGCTCGT GGCAACCGAC GTGCTGGCGC GCGGCATCGA CGTGGAGGAA GTGGACTACG TCGTGAACTA TGACCTGCCC ACCCAGCCCG AGGATTACGT CCACCGCATC GGCCGCACGG GTCGAGCGGG CGCCGCGGGC TTCGCCGTGT CGTTCGTAAG CCCCGAGACG GCCGACGCGC TTCGCGACAT CGAGAAGCTC ATCAAGCGCC CCATTCCCGA GATGGAGGTG CCTTCGTTCG ATGCCGAGCA GGCCGCCGAA GAGGCCGCGG GCAAGGCCGC CCGCGCCGAC GCGCGCCGCG ACCCCGAGAT CAAGCAGGCC GCCAAGGAGA TGGCCGCGCG CGAGCGCAAG AAGGCCAAGG CCCGCGAGCA GGCGCAGGCC GAAGACGCGC CGAAAAACGC CCCGAAGCGC AAGGCGCCGA AGAAGCCCGT CGCGTCGAAG GGCGCCCAGC CGAACCGCCA GCAGCCGGGC CGGGCGAACG CGAAGCAAGG GCAGCCCGGC CAGCGCAGCG GCAAGCCCGC GTCCGGAGGC GCGCGCCCCG CGCAGTCTTC GGCGAAGAAG GGCGGTCCCG ATCTCCGTCC GGGCCGCGCT CATCGCGCCA GCCTCGCCCA GCAGCGCAGC AGAGGCGGCA AGCGCAAATA G
|
Protein sequence | MKQFNELGLS DQALEAVARL GYEAPTPVQE QAIPLALEGR DLIAAAKTGT GKTAAFSLPS LDRLGHAKGG QGPLMLVVTP TRELAQQIGE VCTAIAASTH HRILTVVGGL SYTPQINKLK HGVDILIATP GRLVDLMEQG AVRLGDVEVL VLDEADRMLD MGFWPAMKKI IGATPASRQT LLFSATIDAS IKNSVGKLLH DPAFVEIAHK GETADTVEQY IVHVAQTLKP ALLKAVLAEK GSDRVIVFAR TRSRADSTCR RLKRAGYTAE AIHSDRSQAQ RRRALDNFAA GKTGVLVATD VLARGIDVEE VDYVVNYDLP TQPEDYVHRI GRTGRAGAAG FAVSFVSPET ADALRDIEKL IKRPIPEMEV PSFDAEQAAE EAAGKAARAD ARRDPEIKQA AKEMAARERK KAKAREQAQA EDAPKNAPKR KAPKKPVASK GAQPNRQQPG RANAKQGQPG QRSGKPASGG ARPAQSSAKK GGPDLRPGRA HRASLAQQRS RGGKRK
|
| |