Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_3081 |
Symbol | |
ID | 8417416 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 3582328 |
End bp | 3583923 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 645026060 |
Product | cell envelope-related transcriptional attenuator |
Protein accession | YP_003183412 |
Protein GI | 257792806 |
COG category | [K] Transcription |
COG ID | [COG1316] Transcriptional regulator |
TIGRFAM ID | [TIGR00350] cell envelope-related function transcriptional attenuator common domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGTCAA GAAACTCTCG AACGAACCGT AAAGAAGCGC GCTCATCGCG GGCTCCTCAG TCTGCTCGCG CCGTGCGGTC GGCCAAGGCC GCGCAGTCGG GTCGCATCGC CCCGTACAGC CAGGCAAGCG CTTTCTCCCG CGATGCGTAT GGCGATGGCT CTTCGTACCG TGCGGCCTAC ACGCCCGGCA GCCAAGGTTC CGGCGCGAAC GGCGCGTACG CCCGGCAGAC TGCGGCAAGT CAATATTCGC GCAACAATCC AAGCTACTCG GCAGCCCGCA AGAAAGCCGG GCGCGGCAAG AAGATAGCCC TCGGCGTGAT CATCGCGATT CTCGTGGTGG CCGTGGGGGG CGGTTCGGCG TTCGCCTTGT GGAAGAACTC CGTCAACGAG AAGCTGATCA AGGGAAACAA GTCTGACGAA GAGATCATGG CCATCAACGA TGCGCTCAAG CCTGAAAAGG ACATGACCTT CACCGAGCCC TTCTACATGA TCCTCATCGG CACCGACGAG GCCGAGGACA GCACCGAGGA CATGCATCGT TCCGACACGA ACATCGTGGT GCGCATCGAT CCTGCAAAGA ACCAGGCCAC GATGGTATCC ATCCCCCGCG ATACGAAGAT CGACATCGAC GGCTACGGGA CGAACAAGTT CAACGCCGCT TACGCTTACG GCGGCGCTGC CGGAACCATC CGCGAAGCGA ATCAGCTGCT GGGCATCGAG ATATCGCACT ACGCGGAAGT GAACTTCGGC AAGTTGAAGG AATTGGTCGA CGCAGTTGGC GGCGTGGACG TCGAAGTGAC CGAGCGCGTC GATGATCCCG ACGCAGACGG TACCACGGCT CATCCCGAGT GGCCGCGCGT CATCATCGAG GAAGGCGAGC AGCACCTCGA CGGCAACCAA GCGCTGGTGT TCGCGCGCAG CCGCGCGTAT CCCGACGGAG ACTTCACGCG CACGGCGAAC CAGCGCAAGC TGATCATGGC CATCGTCAAC AAGGTGCTGG CACTGAACAC CGCCGAGCTT TTGGGTGCGG TTCAAGCAGC GGCCAACTGC GTGACCACGG ATCTTGCGGT GGGCGATATC GCCGCGTTGG CCCAGCAGTT CCAAGATGAC GGCGATCTTA CCATCTACTC TGCGATGGTT CCTTCTACCA CGGCGATGAT CGGCGACGTA TCGTACGTCA TCAACGATCC CGTGGCGACG AAGGAGATGA TGAAGCTGGT TGAGGCTGGA GAGGATCCGA GCTCGGTTGT CTCGTCAGGA TACGTTGATC CGGGCGACAC CACAGGCGGT GCCAGCACGT ACGGCAACGG TTACGGCAGC ACTGGTTCGG GCGCCGGCAA CGGCGCAGGA AACACCAATT ATTACGATCC GGGCTACACC GATGCATCGG GAACCGGAGG GGCTAACAAC GGCGGCTATG TTGATAACGG TTACGTTGAC AACGGTTACG TTGACAACAC CGGCGGAGCG GGTGGAACCG GCGGGTACGA CAATGGAACT ATTGCTGGCG GTGCGGGCGG TACCGACGGT GCAGGCGGAT ACGACCCCGG CTATACCGAT CCGGGCGCCT ACGACGGGAC GGCATCCGCT GCTTAG
|
Protein sequence | MASRNSRTNR KEARSSRAPQ SARAVRSAKA AQSGRIAPYS QASAFSRDAY GDGSSYRAAY TPGSQGSGAN GAYARQTAAS QYSRNNPSYS AARKKAGRGK KIALGVIIAI LVVAVGGGSA FALWKNSVNE KLIKGNKSDE EIMAINDALK PEKDMTFTEP FYMILIGTDE AEDSTEDMHR SDTNIVVRID PAKNQATMVS IPRDTKIDID GYGTNKFNAA YAYGGAAGTI REANQLLGIE ISHYAEVNFG KLKELVDAVG GVDVEVTERV DDPDADGTTA HPEWPRVIIE EGEQHLDGNQ ALVFARSRAY PDGDFTRTAN QRKLIMAIVN KVLALNTAEL LGAVQAAANC VTTDLAVGDI AALAQQFQDD GDLTIYSAMV PSTTAMIGDV SYVINDPVAT KEMMKLVEAG EDPSSVVSSG YVDPGDTTGG ASTYGNGYGS TGSGAGNGAG NTNYYDPGYT DASGTGGANN GGYVDNGYVD NGYVDNTGGA GGTGGYDNGT IAGGAGGTDG AGGYDPGYTD PGAYDGTASA A
|
| |