Gene Elen_3081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_3081 
Symbol 
ID8417416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3582328 
End bp3583923 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content61% 
IMG OID645026060 
Productcell envelope-related transcriptional attenuator 
Protein accessionYP_003183412 
Protein GI257792806 
COG category[K] Transcription 
COG ID[COG1316] Transcriptional regulator 
TIGRFAM ID[TIGR00350] cell envelope-related function transcriptional attenuator common domain 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTCAA GAAACTCTCG AACGAACCGT AAAGAAGCGC GCTCATCGCG GGCTCCTCAG 
TCTGCTCGCG CCGTGCGGTC GGCCAAGGCC GCGCAGTCGG GTCGCATCGC CCCGTACAGC
CAGGCAAGCG CTTTCTCCCG CGATGCGTAT GGCGATGGCT CTTCGTACCG TGCGGCCTAC
ACGCCCGGCA GCCAAGGTTC CGGCGCGAAC GGCGCGTACG CCCGGCAGAC TGCGGCAAGT
CAATATTCGC GCAACAATCC AAGCTACTCG GCAGCCCGCA AGAAAGCCGG GCGCGGCAAG
AAGATAGCCC TCGGCGTGAT CATCGCGATT CTCGTGGTGG CCGTGGGGGG CGGTTCGGCG
TTCGCCTTGT GGAAGAACTC CGTCAACGAG AAGCTGATCA AGGGAAACAA GTCTGACGAA
GAGATCATGG CCATCAACGA TGCGCTCAAG CCTGAAAAGG ACATGACCTT CACCGAGCCC
TTCTACATGA TCCTCATCGG CACCGACGAG GCCGAGGACA GCACCGAGGA CATGCATCGT
TCCGACACGA ACATCGTGGT GCGCATCGAT CCTGCAAAGA ACCAGGCCAC GATGGTATCC
ATCCCCCGCG ATACGAAGAT CGACATCGAC GGCTACGGGA CGAACAAGTT CAACGCCGCT
TACGCTTACG GCGGCGCTGC CGGAACCATC CGCGAAGCGA ATCAGCTGCT GGGCATCGAG
ATATCGCACT ACGCGGAAGT GAACTTCGGC AAGTTGAAGG AATTGGTCGA CGCAGTTGGC
GGCGTGGACG TCGAAGTGAC CGAGCGCGTC GATGATCCCG ACGCAGACGG TACCACGGCT
CATCCCGAGT GGCCGCGCGT CATCATCGAG GAAGGCGAGC AGCACCTCGA CGGCAACCAA
GCGCTGGTGT TCGCGCGCAG CCGCGCGTAT CCCGACGGAG ACTTCACGCG CACGGCGAAC
CAGCGCAAGC TGATCATGGC CATCGTCAAC AAGGTGCTGG CACTGAACAC CGCCGAGCTT
TTGGGTGCGG TTCAAGCAGC GGCCAACTGC GTGACCACGG ATCTTGCGGT GGGCGATATC
GCCGCGTTGG CCCAGCAGTT CCAAGATGAC GGCGATCTTA CCATCTACTC TGCGATGGTT
CCTTCTACCA CGGCGATGAT CGGCGACGTA TCGTACGTCA TCAACGATCC CGTGGCGACG
AAGGAGATGA TGAAGCTGGT TGAGGCTGGA GAGGATCCGA GCTCGGTTGT CTCGTCAGGA
TACGTTGATC CGGGCGACAC CACAGGCGGT GCCAGCACGT ACGGCAACGG TTACGGCAGC
ACTGGTTCGG GCGCCGGCAA CGGCGCAGGA AACACCAATT ATTACGATCC GGGCTACACC
GATGCATCGG GAACCGGAGG GGCTAACAAC GGCGGCTATG TTGATAACGG TTACGTTGAC
AACGGTTACG TTGACAACAC CGGCGGAGCG GGTGGAACCG GCGGGTACGA CAATGGAACT
ATTGCTGGCG GTGCGGGCGG TACCGACGGT GCAGGCGGAT ACGACCCCGG CTATACCGAT
CCGGGCGCCT ACGACGGGAC GGCATCCGCT GCTTAG
 
Protein sequence
MASRNSRTNR KEARSSRAPQ SARAVRSAKA AQSGRIAPYS QASAFSRDAY GDGSSYRAAY 
TPGSQGSGAN GAYARQTAAS QYSRNNPSYS AARKKAGRGK KIALGVIIAI LVVAVGGGSA
FALWKNSVNE KLIKGNKSDE EIMAINDALK PEKDMTFTEP FYMILIGTDE AEDSTEDMHR
SDTNIVVRID PAKNQATMVS IPRDTKIDID GYGTNKFNAA YAYGGAAGTI REANQLLGIE
ISHYAEVNFG KLKELVDAVG GVDVEVTERV DDPDADGTTA HPEWPRVIIE EGEQHLDGNQ
ALVFARSRAY PDGDFTRTAN QRKLIMAIVN KVLALNTAEL LGAVQAAANC VTTDLAVGDI
AALAQQFQDD GDLTIYSAMV PSTTAMIGDV SYVINDPVAT KEMMKLVEAG EDPSSVVSSG
YVDPGDTTGG ASTYGNGYGS TGSGAGNGAG NTNYYDPGYT DASGTGGANN GGYVDNGYVD
NGYVDNTGGA GGTGGYDNGT IAGGAGGTDG AGGYDPGYTD PGAYDGTASA A