Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1812 |
Symbol | |
ID | 8416116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2126808 |
End bp | 2128025 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645024783 |
Product | NusA antitermination factor |
Protein accession | YP_003182166 |
Protein GI | 257791560 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.768572 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.780968 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCAAGTT CAGAACTGAT TGAGGCGTTG CAGGCGCTGG CGCATGAGCG CAAGATCGAC GAGTTCTACC TCATCGAACG CCTCGAGGCA TCGCTCGCCA AGAGCTACCA GCACATCCTC GATCTCGAGT GGGACGCCCG CGTGACCATC GACCGCCAGA CGGGCCACAT CTACGTGTAC GAGCTGGTGC CGGTGGGCGA GCCCGACGAG GAGACCGGCG AGTACAGCGA GTTCGAGGAG CGCGACGTCA CCCCCGACGA TGTCAGCCGC ATCGCCGCGC AGAACGCCAA GGGCGTCATC GCGTCCATCG TGCGCGAAGC CGGCCGTCAG TCCATCTACG AAGAGTTCTC GGACCGCGTG GGCGACCTCG TGACGGGCAC GGTGCTGCAG GGCACGCCGG ACTTCACCAT CATCAAGATT CGCGACGGCG TGGAGGCCGA GCTGCCCCAT TACGACGTGA AGCGCAACCC CAACGAGCGC AACGAGCGTC CGAGCAACGA GCACTACCGC CACAACCAGC GCCTCAAGGT GCTCATCATC GAAGTGCGCG ACCCGAACTC CGACGCGCCG AAGATGCGCG GCGAGCAGGC GCGCCCGGCC ATCGTGGTGT CGCGCACGCA TCCGGACCTC ATCCGCCGCC TGTTCGAGAT CGAGGTGCCG GAGATCTACG ACGGCATGGT GGAGATCAAG TCCATCGCCC GCGAGCCCGG CGCCCGCTCC AAGATCGCCG TGGCGTCGCG CGAGGCGAAC CTCGATCCCG TGGGCGCCTG CGTCGGCCCG AAGGGCAGCC GCGTTCGCAT GGTGGTGGAA GAGCTGCGCA ACGAGCGCGT CGACGTGATC CAGTGGGCGG AGGATCCGGC GGTGTACGTG GCCAACGCGC TGTCGCCTGC GAAGGTGACC CGCGTCGTCA TCGACGAGGA CAACCACTAC GCCACCGTCG TGGTGCCCGA CGACCAGCTG TCGCTGGCCA TCGGCAAGGA GGGCCAGAAC GCCCGTCTGG CTGCGCGCCT GACCGGCTGG CATATCGATA TCAAGAGCGC CAGCTTCACG GGCGAGTCGC TGGCTCCGAT GGACAACATG CTGATCGACG AGGACGAGGC CGCGGACGAC GAAGCCGGTC TGTGCGCCTA CGTGGGCGAG GACGGGGTGC GCTGCCGCAA CCATGCCCGT CCGGGCAGCC GCTATTGCGG CGTGCACGCC GACCTCGACG AAGCATAA
|
Protein sequence | MASSELIEAL QALAHERKID EFYLIERLEA SLAKSYQHIL DLEWDARVTI DRQTGHIYVY ELVPVGEPDE ETGEYSEFEE RDVTPDDVSR IAAQNAKGVI ASIVREAGRQ SIYEEFSDRV GDLVTGTVLQ GTPDFTIIKI RDGVEAELPH YDVKRNPNER NERPSNEHYR HNQRLKVLII EVRDPNSDAP KMRGEQARPA IVVSRTHPDL IRRLFEIEVP EIYDGMVEIK SIAREPGARS KIAVASREAN LDPVGACVGP KGSRVRMVVE ELRNERVDVI QWAEDPAVYV ANALSPAKVT RVVIDEDNHY ATVVVPDDQL SLAIGKEGQN ARLAARLTGW HIDIKSASFT GESLAPMDNM LIDEDEAADD EAGLCAYVGE DGVRCRNHAR PGSRYCGVHA DLDEA
|
| |