Gene Elen_1812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1812 
Symbol 
ID8416116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2126808 
End bp2128025 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content67% 
IMG OID645024783 
ProductNusA antitermination factor 
Protein accessionYP_003182166 
Protein GI257791560 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.768572 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.780968 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAAGTT CAGAACTGAT TGAGGCGTTG CAGGCGCTGG CGCATGAGCG CAAGATCGAC 
GAGTTCTACC TCATCGAACG CCTCGAGGCA TCGCTCGCCA AGAGCTACCA GCACATCCTC
GATCTCGAGT GGGACGCCCG CGTGACCATC GACCGCCAGA CGGGCCACAT CTACGTGTAC
GAGCTGGTGC CGGTGGGCGA GCCCGACGAG GAGACCGGCG AGTACAGCGA GTTCGAGGAG
CGCGACGTCA CCCCCGACGA TGTCAGCCGC ATCGCCGCGC AGAACGCCAA GGGCGTCATC
GCGTCCATCG TGCGCGAAGC CGGCCGTCAG TCCATCTACG AAGAGTTCTC GGACCGCGTG
GGCGACCTCG TGACGGGCAC GGTGCTGCAG GGCACGCCGG ACTTCACCAT CATCAAGATT
CGCGACGGCG TGGAGGCCGA GCTGCCCCAT TACGACGTGA AGCGCAACCC CAACGAGCGC
AACGAGCGTC CGAGCAACGA GCACTACCGC CACAACCAGC GCCTCAAGGT GCTCATCATC
GAAGTGCGCG ACCCGAACTC CGACGCGCCG AAGATGCGCG GCGAGCAGGC GCGCCCGGCC
ATCGTGGTGT CGCGCACGCA TCCGGACCTC ATCCGCCGCC TGTTCGAGAT CGAGGTGCCG
GAGATCTACG ACGGCATGGT GGAGATCAAG TCCATCGCCC GCGAGCCCGG CGCCCGCTCC
AAGATCGCCG TGGCGTCGCG CGAGGCGAAC CTCGATCCCG TGGGCGCCTG CGTCGGCCCG
AAGGGCAGCC GCGTTCGCAT GGTGGTGGAA GAGCTGCGCA ACGAGCGCGT CGACGTGATC
CAGTGGGCGG AGGATCCGGC GGTGTACGTG GCCAACGCGC TGTCGCCTGC GAAGGTGACC
CGCGTCGTCA TCGACGAGGA CAACCACTAC GCCACCGTCG TGGTGCCCGA CGACCAGCTG
TCGCTGGCCA TCGGCAAGGA GGGCCAGAAC GCCCGTCTGG CTGCGCGCCT GACCGGCTGG
CATATCGATA TCAAGAGCGC CAGCTTCACG GGCGAGTCGC TGGCTCCGAT GGACAACATG
CTGATCGACG AGGACGAGGC CGCGGACGAC GAAGCCGGTC TGTGCGCCTA CGTGGGCGAG
GACGGGGTGC GCTGCCGCAA CCATGCCCGT CCGGGCAGCC GCTATTGCGG CGTGCACGCC
GACCTCGACG AAGCATAA
 
Protein sequence
MASSELIEAL QALAHERKID EFYLIERLEA SLAKSYQHIL DLEWDARVTI DRQTGHIYVY 
ELVPVGEPDE ETGEYSEFEE RDVTPDDVSR IAAQNAKGVI ASIVREAGRQ SIYEEFSDRV
GDLVTGTVLQ GTPDFTIIKI RDGVEAELPH YDVKRNPNER NERPSNEHYR HNQRLKVLII
EVRDPNSDAP KMRGEQARPA IVVSRTHPDL IRRLFEIEVP EIYDGMVEIK SIAREPGARS
KIAVASREAN LDPVGACVGP KGSRVRMVVE ELRNERVDVI QWAEDPAVYV ANALSPAKVT
RVVIDEDNHY ATVVVPDDQL SLAIGKEGQN ARLAARLTGW HIDIKSASFT GESLAPMDNM
LIDEDEAADD EAGLCAYVGE DGVRCRNHAR PGSRYCGVHA DLDEA