Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_3000 |
Symbol | |
ID | 8417333 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 3482701 |
End bp | 3484188 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 645025978 |
Product | cell envelope-related transcriptional attenuator |
Protein accession | YP_003183332 |
Protein GI | 257792726 |
COG category | [K] Transcription |
COG ID | [COG1316] Transcriptional regulator |
TIGRFAM ID | [TIGR00350] cell envelope-related function transcriptional attenuator common domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAACA AACGCAGCCC TCAACGGCGC CCTGCGAGCA CCGAGCGTTT TTCCCGCTCC TCGTACGGCC GCTCCCGTCA GACTGAACCG GGTCGTCGCC CCAGCGTTCA AGGCGAATCC GCTCCCACGC CGCGCCGCGT GTCGCGAGCC GATTACGAGC GCACGCGCGC ACGCTCTCGT CGCCGCACGC TGGTTATCGC GCTTGCCGTT GTGGGCGTGC TGGTGCTGGG CGGGGCGGGG GCTGCGTTCG CGTACTACAA CGTACTCTCG GGAAACCTTC ACGACGGCGT CAGCGCCGAG CTGCGCAATG CGCTCGTGGA AACCGATTTG GCGAACGAGC CGTTCTATAT CCTGCTCATG GGAACCGACG GGTCGAACGA CCGCGAAGCG TCGGCGGAAT TCGCCGGCGA TCAGTTCCGC AGCGACAGCA TCATCCTCGC GCGCATCGAC CCGGTGGACA AGAAGGCGAC GCTCGTCTCC ATCCACCGCG ATACCCTGGT AGACATGGGG GAATACGGGC AGAACAAGCT GAACGCTGCA CATGCCATCG GCGGAGCCGC TCTCACGGTG AAAACCGTAT CGAAGTTGGC GGGTGTGCCC ATTTCCCATT ATGCCGAGAT CAACTTCGAC GGCTTCAAAG ACATTGTCGA TGCGCTGGGC GGCGTCGAAG TGGATGTTCC GATGGAGATC GACGACGAGG ATGCCGGCGG CCATCTGGAT GCCGGGCTGC AGACGCTGAG CGGCGATCAG GCGCTCATCC TGTGTCGTTC GCGCCATGCC TACGACGAGT ACGGCGACGG CGACTCCTAC CGTGCAGCGA ACCAGCGTCT CGTCCTGTCC GCCATCGCGA AGAAAATCCT GTCGGCCGAC GTGGCCACGA TGGCATCCAC CGTCCAGGCG CTTTCCCAAT ACGTGACTAC GGATCTCGAG ATTTCCGACA TCATCGGCCT CGCGCAGACG ATGCAGGGCC TCGACCCGGC CACCGACATC TATTCGGCCA TGGAGCCGAC TACGTCCAAG TACATCAACG ACGTCTGGTA CGAAATCAAC AACGTCGACG AATGGAAGAA AATGATGACT CGCGTTAACC AGGGCCTTCC GCCCACCGAC GAGGACATCG TCGACGAGAT GTCGAACACC GTGCTGGCAA CCACCGGCAG CGGCCAGACG CATTCCAGCA CGAACGAGGA CGGCACGCAG AAAAAAGCGA AGCGCACGGG CAGCGTGGCG GTGCGCAACG GCAACGGCAT CACCGGCGCA GGCACCGAGG CGTCCGAGCG CATCGAGGAA CTGGGCTATT CGGTGGAGTC GGGCAATGCC GACAGCTTCG ACTACCCTAA GACGCTGGTG ATTTACGACG ATGAGGCGAA GGCATCGCGC GCGCAGGAGA TCGCGGATGC TCTTGGCGTT GGAAAAGCCA TGAAAAACGA CGGATCGTAT CTGTTCGAGA GCGATTTCCT CGTGGTGTTG GGAAGCGACT GGAAGTAG
|
Protein sequence | MDNKRSPQRR PASTERFSRS SYGRSRQTEP GRRPSVQGES APTPRRVSRA DYERTRARSR RRTLVIALAV VGVLVLGGAG AAFAYYNVLS GNLHDGVSAE LRNALVETDL ANEPFYILLM GTDGSNDREA SAEFAGDQFR SDSIILARID PVDKKATLVS IHRDTLVDMG EYGQNKLNAA HAIGGAALTV KTVSKLAGVP ISHYAEINFD GFKDIVDALG GVEVDVPMEI DDEDAGGHLD AGLQTLSGDQ ALILCRSRHA YDEYGDGDSY RAANQRLVLS AIAKKILSAD VATMASTVQA LSQYVTTDLE ISDIIGLAQT MQGLDPATDI YSAMEPTTSK YINDVWYEIN NVDEWKKMMT RVNQGLPPTD EDIVDEMSNT VLATTGSGQT HSSTNEDGTQ KKAKRTGSVA VRNGNGITGA GTEASERIEE LGYSVESGNA DSFDYPKTLV IYDDEAKASR AQEIADALGV GKAMKNDGSY LFESDFLVVL GSDWK
|
| |