Gene Elen_2449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2449 
Symbol 
ID8416773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2868457 
End bp2869677 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content64% 
IMG OID645025431 
Productcell envelope-related transcriptional attenuator 
Protein accessionYP_003182794 
Protein GI257792188 
COG category[K] Transcription 
COG ID[COG1316] Transcriptional regulator 
TIGRFAM ID[TIGR00350] cell envelope-related function transcriptional attenuator common domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00377769 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATTCG ACGACAACAA CAGACAGACG GCTTCGGGTC GCCATGGTGC CTCCGCTTCG 
CGCGGAGGTT CTCACGCCTA TGGGCAACCT CCTGCGGACG GTTTCGGTCA GGCTCCCCAT
GGCAGCCGGC CTTATCCGCA GGGCGGTGGG TACGCTTCGC GCCCGACGGG GCAGACGCCG
TATCCAAACC AGCGACCGGC GAACTACGGC TACGCTTCGC AAGCTGACGA GATCGTGCGC
GTGCGCAAGA AGCGCAAGAA GCACACCAAG CTGAAGCGGG CGGCGCTTAT CGCGCTCGCC
GTCGTCGTGG TGCTCGTGGG CGGCGTCGCC GCGTACGGCG CTTGGTACAC GAGCAGCCTC
GCAAGCAACA TGGCGCTCGA CGCTCAGGAG CAAAGCGAGT TGAGCAGCGT TCTCGCCCCC
TCCGATTCCC AGATGGAGCC GTTCTACGTG CTGCTGGTGG GCTCGGACAA CTGGGAGACG
TACGGGGAGC GCTCCGACGC CCTCGTGCTC GTGCGCATCG ATCCGGTGGG CCACGTCATC
ACCATGGTGT CGGTTCCGCG CGACACGCCG TACGAGTACA ACGGCAAGGT CGAGAAGATC
AACCAGATGT TCGCCGTGAA CGGGGCGGCT GGCGCCGTTA CGGCGGTGCA GGACCTCACC
GGCGTGAAGA TCTCCGACTA CGTGGAAATC GAGTTCGCCG GGTTGGCCGA GTTCGTGGAT
TCGATCGGCG GCATCTACGT GGACGTGCCC TACACCATCG ACTACCAGGT GTACACGCAG
GATCAGGCGC CCGTCCATAT CGAGGCCGGC AACCAGCTGC TCAACGGCGA GCAGTGCGTG
GCGCTTGCCC GCATGCGCAC CGCCTATGGC GACGACCAGG AGGCCATTCG CCAGTCGAAC
GTGCGTGCGA TGGCCATGGC GCTCATGAAG AACGTGCTGC AAGCTCCTCC GGTCGAGATC
CCCGGCCTGA TCCAAAACCT CTCCCAGTGC GTGTCTACGA GCATCGACTT GCAGACGATG
ATCTCGCTTG CGACCGATTT CGCACAGGCG GGCAACCCCA CCATCTACAC GTGCACCGGT
CCGTACAAGG GCGACTTCAT GGAGGAATAC GGCGGCCTGT GGCTGTGCTA CGAGGATCCC
GAGGGTTGGG CTACACTGAT GAAGGCGGTC GACGCCGGCG AGAATCCCGA AGCTGCCGAG
ACCACCGTTA ACGGCAAGTA A
 
Protein sequence
MGFDDNNRQT ASGRHGASAS RGGSHAYGQP PADGFGQAPH GSRPYPQGGG YASRPTGQTP 
YPNQRPANYG YASQADEIVR VRKKRKKHTK LKRAALIALA VVVVLVGGVA AYGAWYTSSL
ASNMALDAQE QSELSSVLAP SDSQMEPFYV LLVGSDNWET YGERSDALVL VRIDPVGHVI
TMVSVPRDTP YEYNGKVEKI NQMFAVNGAA GAVTAVQDLT GVKISDYVEI EFAGLAEFVD
SIGGIYVDVP YTIDYQVYTQ DQAPVHIEAG NQLLNGEQCV ALARMRTAYG DDQEAIRQSN
VRAMAMALMK NVLQAPPVEI PGLIQNLSQC VSTSIDLQTM ISLATDFAQA GNPTIYTCTG
PYKGDFMEEY GGLWLCYEDP EGWATLMKAV DAGENPEAAE TTVNGK