Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1408 |
Symbol | |
ID | 8415706 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1680663 |
End bp | 1681580 |
Gene Length | 918 bp |
Protein Length | 305 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 645024377 |
Product | DNA protecting protein DprA |
Protein accession | YP_003181766 |
Protein GI | 257791160 |
COG category | [L] Replication, recombination and repair [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake |
TIGRFAM ID | [TIGR00732] DNA protecting protein DprA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.313803 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGCGC TCGCTGGCGA GCGCACGGTG CTCGCGCGCG GCGGAGAAGG CTTCCCTTCT GCGCTGGAAA GCGTGGCGCA GCCGCCGGAT CGGCTGTACG TGGTAGGCGA TCCTTTCGCG CTCCAGGAGG GCATCGCCAT CGTGGGCGCG CGCAGGGCCA CGCCGTACGG GCGCGGGTGC GCCAAGCGCT TCGCGGCCTT GGCCGCCCGG CGCGGCATCG TAGTGGTGTC GGGCGGGGCG CGCGGCTGCG ACGCGGCGGC GCATGCGGCC GCGCTGGAGG AGGGCGGGCG CACGGTGGCG TTTCTGGGCG GCGGGTGCGA CCGGCTGTAC CCCGCCGAGC ACGAGGGGCT GTTCCAGCGC ATCGTCGACG AGGGCGGAGC GGTGGTCTCC GAGCACGCGT GGGACGAGGA CCCCAAGCCG TACCGTTTCC GCCTGCGCAA CCGCCTCATC GCCGGGCTCG CCCGCGCCAC GCTCATCGTG GAGGCGGGAC TGCCGTCGGG AACCTTCTCC ACGGCCGACG AGGCGCTGGC GGCGAACCGC GACGTGCTTG TGGTGCCCGG CGCCATAACG GCCGTGTCCT CGCGCGGCGC GAACCGCCTC ATCTACCAGG GCGCGACGCC CGTCATCGAT GACGAGACGT TCGAGGATGC GCTGTTCTCG CTGTTCGGAT GCCTGAAGCA GGAGACCGTC CCTTCCGTCG AGCGCACCGG TGCGCCGCGT GCGGACGAGC CGTCGAACCC GGTGGCGGAC GCGCTGCGCG CCGAGCCGCT GAGCATGGAG CAGCTCTACG CGATCGCCGC CTCATCGTGC GGCGGGGAGG ATGCGCGATC CTGGCTCATG GAGCGCCTCG TCGAAGCCGA GCTCGCCGGC ACCGTCGCCC GCCACCCCGA CGGCCGCTGG GGTCCGGCGG TGAGATGA
|
Protein sequence | MSALAGERTV LARGGEGFPS ALESVAQPPD RLYVVGDPFA LQEGIAIVGA RRATPYGRGC AKRFAALAAR RGIVVVSGGA RGCDAAAHAA ALEEGGRTVA FLGGGCDRLY PAEHEGLFQR IVDEGGAVVS EHAWDEDPKP YRFRLRNRLI AGLARATLIV EAGLPSGTFS TADEALAANR DVLVVPGAIT AVSSRGANRL IYQGATPVID DETFEDALFS LFGCLKQETV PSVERTGAPR ADEPSNPVAD ALRAEPLSME QLYAIAASSC GGEDARSWLM ERLVEAELAG TVARHPDGRW GPAVR
|
| |