Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2003 |
Symbol | |
ID | 8416314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2347849 |
End bp | 2350452 |
Gene Length | 2604 bp |
Protein Length | 867 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645024980 |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003182356 |
Protein GI | 257791750 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.228987 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0000000440234 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCGGAAT TCCTCAAGAA GGCGTCGTGC CGAGGCCGTC GTCCGCTGCA TCTCGTCCCG CGGCGCCAGA TACCTCGCCC CAACCTCATC GCTAAGCTTT TGCGCGAGCG GCACGTCGCG CGGTTCATCG TCGCGCCCGA CGGCTTCGGC AAGACGGGTC TTGCGATGGA ATACGCCGAT ACGGTGTTCT CGTTCGAGCA TGTGGTCTGG CTCGACGGCC GCAGCCCGTG CTTCTTGCGC GATCTCGATC GCGGCATCAT AGCGCGCACG CTTTTGGAGG GCGACCCGCA GCCGTTCCTC GTCGTCATAG ACGATGTGCC GCTGCTCGAT CCTGAGCGCG CCGAGCTGTT ATCCGACGAG CTCGATCTGC TGCTCGACCG CGATTGCGAG GTGATGGTGG CCTGCATGCC ATCGCGCGAC GCCTTCGCGC GTCATCGCGA CCGCATCAAG CTCACGGCCG TCGACCTGCT GCTCACCGAT GAGGAAGTGG ACCTGCTCCG CACACCGGGG GAGCGCTCGA GCGATCCCGC ATCGACGATT GCGCCCGCGT GCCGCGTCGC CACGTTCGTA TGGGGGTCGG ACGAGGAGCG CGCGGGGTTC TTGTCCGCCG TGCTTGGCGA GGAGCTTCCG GCCGACCTGC TGCTGCCGCT GTTCGTGATG GCGTCGCTGG GGAAGGGGAC GTTCGAGGAC GTGGCGGCGT TCGGCCCGTT CGGCCCCGAT CAGGACGCGC TGCTCGCCGA GCAGTACCCC TATCTCGGCA TCGACTCCCG CAGCGGCCGG TTCGAGGCCG CGCCGTTCGA GGTCGAAGCC CTCGCCGGCG CGTTTGCGCG CAAGCTGGGC GCGTTGGCCG ATCGGTCGTT GTTCTCCGAT GCGAGCACGC TCGTCGTGCG CTTGGGCGAT GCGCTCGCGA TGCGCGGCGA GCACGAACGG GCCTGCGATT TCGTACGTTT GCTGGCGTCG CGCGCCCCGC GAGCGTCCTG GCTCGCGAAA CACGGGGCGG CGTTGCTCGA AGCCGCTTGC CTCGTGCCGG CCTGCGAGGT GCACCGCTCC TTATCGGGCG AGACGGCGGG GAAGGGCGCT CTGCTCAGCG CCCACGAGGC GGTTCGCCGC GCGCTTTTGG GGGATCGACA AGCGGCCTGC GTTGCCGCGC GCAAGGCGGT GCGCGACAGG AACGCGCCGT CCTCCGTGCG GATGACGGGC GCGTTGGTGC TCGCGCAATG CGCGGAAGCG GACGAGCGGC GGCGCGCCGA CAAGCTCGTC GTGTCGCTGC TCGCGGCCGC GGGCATCGCG GATGCGGCCG AAGCGCCCCG TGAGGCCGTC TGCGCGAAGG CGCCGGAAGA TCGGGGATGG CTCGCTGCGG GATGCGTGCA TGCGCAGCTG CGGAAAGCGG GCTGCCCCGA GGCTGCTCGG GTATGGCTCG ATTGGCACGA CGACGGCGCG CGCGGGAGCT TCCTCAAGCA GTCGGCCGCC GAAGTGCTGC GTCAAGCTGC GGCGTCGGGA GCCGGCGAGG CCCGATCGAT CGAGCTCGAC CGTTTGGGCG CCTTCGTGCG CAAGGAGGTG TCCCAGGCGA GCCGCGGCAC GCTCGGGCTC GGCGACGCTT TGGCCGGCAT CGCCTACGAG CGGGCCTGCG AGCGCGGGGC GATCGCCGTT CCCGCGCTCG ACGCGCAGGC GGCCCTCGCC GTCAGGCGCA TAGAGATGCG CCTGTTCTCC CAGCGCAACG CGTGCGAGCG GCTGGAGCTG GAGCGATTGG AACGCGAGCG CGTATTCGTC GCCACCCATC CCGACGCATA CCGTGAAGAG GGTCGCGCGA CTCGCCTGCC GAAGCTGCCG TCCTCCGTGC CCGCGCTCAC CGTCAACCTG TTCGGGGGCC TCGACGTGCT GATGGGGGAT GAGCGGGTGG ATCCGTCGCT GTTCAGCCGC CAGAAGGTCA AGACGCTGCT CGCCCTGCTC GTGCTGCACA ACGGCCGCGA GTTCTCGCGC GACAAGCTGG TGGGTCTTCT GTGGCCCGAC AGCGAGATCA TGCACGGGCG CAAGAACTTC TACGGCATTT GGGCGATGCT GCGCCGCGCG CTGACGCTGC CGTCGGGCGA GTGCCCCTAT CTCATTCGCC AGCAGCAGGG GTTGAGACTT GACGCGAGCC TGCTGACGAG CGATGTGGCG CAGCTCGAGG ACGTGTGCCG CACGCTGCTG TTCGAGCGGC CCGGATACGG CGGCTGGGCG CAGGTGTACT CGCAGGTCAA CGATCGCTTC TCGGACGATC TTCTGCCCAG CGAGAACGGC AACGACGCCC TCGCGTCGCT GCGCGTGGAC TACCGCAACC GCCTCGTGGA CGCGCTGGTG GCCGCCTCGA CGCGCCTCGT CGCTGCAGGC GAGGCCCAGG AGGGACTCTG GTTCGCCCGT GCGGCGCTCC AGCGCGACCG CTCGCGCGAA GACGCCTACA TCTGCCTCAT GCAGGCCCAG CTGGCCGCGG GGCAGCGTAC GGCCGCTCTC GAGACCTACT TTGCCTGCCG CCGCTTCCTC ACCGACGAGC TGGGTATCGA CCCCTCCCTC GAAACGATGC GCCTCTATCG CAGCATCATC GAAACCGAAA CCGATTTCGA GTGA
|
Protein sequence | MPEFLKKASC RGRRPLHLVP RRQIPRPNLI AKLLRERHVA RFIVAPDGFG KTGLAMEYAD TVFSFEHVVW LDGRSPCFLR DLDRGIIART LLEGDPQPFL VVIDDVPLLD PERAELLSDE LDLLLDRDCE VMVACMPSRD AFARHRDRIK LTAVDLLLTD EEVDLLRTPG ERSSDPASTI APACRVATFV WGSDEERAGF LSAVLGEELP ADLLLPLFVM ASLGKGTFED VAAFGPFGPD QDALLAEQYP YLGIDSRSGR FEAAPFEVEA LAGAFARKLG ALADRSLFSD ASTLVVRLGD ALAMRGEHER ACDFVRLLAS RAPRASWLAK HGAALLEAAC LVPACEVHRS LSGETAGKGA LLSAHEAVRR ALLGDRQAAC VAARKAVRDR NAPSSVRMTG ALVLAQCAEA DERRRADKLV VSLLAAAGIA DAAEAPREAV CAKAPEDRGW LAAGCVHAQL RKAGCPEAAR VWLDWHDDGA RGSFLKQSAA EVLRQAAASG AGEARSIELD RLGAFVRKEV SQASRGTLGL GDALAGIAYE RACERGAIAV PALDAQAALA VRRIEMRLFS QRNACERLEL ERLERERVFV ATHPDAYREE GRATRLPKLP SSVPALTVNL FGGLDVLMGD ERVDPSLFSR QKVKTLLALL VLHNGREFSR DKLVGLLWPD SEIMHGRKNF YGIWAMLRRA LTLPSGECPY LIRQQQGLRL DASLLTSDVA QLEDVCRTLL FERPGYGGWA QVYSQVNDRF SDDLLPSENG NDALASLRVD YRNRLVDALV AASTRLVAAG EAQEGLWFAR AALQRDRSRE DAYICLMQAQ LAAGQRTAAL ETYFACRRFL TDELGIDPSL ETMRLYRSII ETETDFE
|
| |