Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2449 |
Symbol | |
ID | 8416773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2868457 |
End bp | 2869677 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 645025431 |
Product | cell envelope-related transcriptional attenuator |
Protein accession | YP_003182794 |
Protein GI | 257792188 |
COG category | [K] Transcription |
COG ID | [COG1316] Transcriptional regulator |
TIGRFAM ID | [TIGR00350] cell envelope-related function transcriptional attenuator common domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00377769 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGATTCG ACGACAACAA CAGACAGACG GCTTCGGGTC GCCATGGTGC CTCCGCTTCG CGCGGAGGTT CTCACGCCTA TGGGCAACCT CCTGCGGACG GTTTCGGTCA GGCTCCCCAT GGCAGCCGGC CTTATCCGCA GGGCGGTGGG TACGCTTCGC GCCCGACGGG GCAGACGCCG TATCCAAACC AGCGACCGGC GAACTACGGC TACGCTTCGC AAGCTGACGA GATCGTGCGC GTGCGCAAGA AGCGCAAGAA GCACACCAAG CTGAAGCGGG CGGCGCTTAT CGCGCTCGCC GTCGTCGTGG TGCTCGTGGG CGGCGTCGCC GCGTACGGCG CTTGGTACAC GAGCAGCCTC GCAAGCAACA TGGCGCTCGA CGCTCAGGAG CAAAGCGAGT TGAGCAGCGT TCTCGCCCCC TCCGATTCCC AGATGGAGCC GTTCTACGTG CTGCTGGTGG GCTCGGACAA CTGGGAGACG TACGGGGAGC GCTCCGACGC CCTCGTGCTC GTGCGCATCG ATCCGGTGGG CCACGTCATC ACCATGGTGT CGGTTCCGCG CGACACGCCG TACGAGTACA ACGGCAAGGT CGAGAAGATC AACCAGATGT TCGCCGTGAA CGGGGCGGCT GGCGCCGTTA CGGCGGTGCA GGACCTCACC GGCGTGAAGA TCTCCGACTA CGTGGAAATC GAGTTCGCCG GGTTGGCCGA GTTCGTGGAT TCGATCGGCG GCATCTACGT GGACGTGCCC TACACCATCG ACTACCAGGT GTACACGCAG GATCAGGCGC CCGTCCATAT CGAGGCCGGC AACCAGCTGC TCAACGGCGA GCAGTGCGTG GCGCTTGCCC GCATGCGCAC CGCCTATGGC GACGACCAGG AGGCCATTCG CCAGTCGAAC GTGCGTGCGA TGGCCATGGC GCTCATGAAG AACGTGCTGC AAGCTCCTCC GGTCGAGATC CCCGGCCTGA TCCAAAACCT CTCCCAGTGC GTGTCTACGA GCATCGACTT GCAGACGATG ATCTCGCTTG CGACCGATTT CGCACAGGCG GGCAACCCCA CCATCTACAC GTGCACCGGT CCGTACAAGG GCGACTTCAT GGAGGAATAC GGCGGCCTGT GGCTGTGCTA CGAGGATCCC GAGGGTTGGG CTACACTGAT GAAGGCGGTC GACGCCGGCG AGAATCCCGA AGCTGCCGAG ACCACCGTTA ACGGCAAGTA A
|
Protein sequence | MGFDDNNRQT ASGRHGASAS RGGSHAYGQP PADGFGQAPH GSRPYPQGGG YASRPTGQTP YPNQRPANYG YASQADEIVR VRKKRKKHTK LKRAALIALA VVVVLVGGVA AYGAWYTSSL ASNMALDAQE QSELSSVLAP SDSQMEPFYV LLVGSDNWET YGERSDALVL VRIDPVGHVI TMVSVPRDTP YEYNGKVEKI NQMFAVNGAA GAVTAVQDLT GVKISDYVEI EFAGLAEFVD SIGGIYVDVP YTIDYQVYTQ DQAPVHIEAG NQLLNGEQCV ALARMRTAYG DDQEAIRQSN VRAMAMALMK NVLQAPPVEI PGLIQNLSQC VSTSIDLQTM ISLATDFAQA GNPTIYTCTG PYKGDFMEEY GGLWLCYEDP EGWATLMKAV DAGENPEAAE TTVNGK
|
| |