Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1810 |
Symbol | |
ID | 8416114 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2123716 |
End bp | 2126478 |
Gene Length | 2763 bp |
Protein Length | 920 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645024781 |
Product | translation initiation factor IF-2 |
Protein accession | YP_003182164 |
Protein GI | 257791558 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0532] Translation initiation factor 2 (IF-2; GTPase) |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR00487] translation initiation factor IF-2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0893364 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.24179 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAGCA TGCGAGTACA TGAGTTGGCG AAGGAATTCG ATATGTCGAG CAAAGAGCTG CTCGACAAGC TGCAGCAGAT GAAGATCCCC GCGAAGAGCC ATGCGAGCAT GCTCGCGGAC GCCTACGTCG ACAAGATCCG CAAGAACCTG GAGCCCGAGA TCAAGCAGCG CGCCGGGCAG TTGGAGGACG AGGAGGCCAG GAAGCTGGCC GAAGAGCGCG CCGAGGCCGA GCGCAAAAAG GCCGAGGAGG AGCGCGCCCG TCGCGAAGCG GTGGAGCAGG AGCGCGCCGC CCGCGAGGCC GAGCGCGCCC AGCGAGCCGA AGGCTCGTCC GACGACGGGC GCGCGCAGGG CGCCGACGGA GAAGGCGGCC CGAAGAAGGC GCCCGTTTCC TCGCCGTTCG AGAGCCTGGC GAGCCAGATC GAGAGCGAGA AGGAACGCGT GGCCCGCGAG GCCGCCGAGG CCCGCGCCCG CGCCCGCCAG GCCAAGATGG CTGCAGAGGT GGCCAAGAAG CAGGCGGTGG AAGAGGCGCT GCGCAACCGC AACGCGAAGG GCTCCAAGAG CGCTTCGAGC GCGTCGACGC CGAAGAGGCC CGCGCCGGTG CTGGGCGCCA AGAAGCCTGC GTTCGACTCG CTGCTGTCGC AGATCGAGGC CGAGAAGCAG CGCATCGAGG CGCAGAAGCA GGCTGCTCCC GCCGCCCGCG ACGCCGCGCG CAAGGGCTCC AAGCCCGAGC GTCCGGCCAA GAAGGGCAAG CGGGGCGGGC ACGTCGAGCA GATCGTGCCC GAGCTCGAGC AGCAGGCAGC CCAGCCCGAG GACCGCTACG CGCAGATGGC CGTGCAGGCC GAGAAGCTGC AGCGCGACAA GGTGCTGGCA GAGGCTCGTG CCGCCGTCGC CGCCGCGTCC ACGCATGAGG GCGAAGGCCG CCGTAAGAAG CGCAAGGAGA AGCGCGAGGC CGAGAACCGC GAGCGCATGG AGCTGGAGGC CATCGAGAAG GGCCTCGATC CCACGCTGGT GCTCGACGAT TCCGTCGTGG AGATCCCCCA GGGAGCCACG GTGGCGAAAT TCGCCGAGAT CCTGGGCGTG CAGCCCAACG ACATCATCAA GCGCCTGTTC ATGCTGGGCC AGGTGCTCAC CCTGACGCAG TCCATGAGCG ACGAGCTGGT CGAGCTCATC GCCGACGACA TGGGCCGCAA GGTGCGCGTC GTGTCTCCGG AGGAGGAGTA CGCGGTGGTG TACCACGACA AGGACGAGGA CCTCAAGCCC CGCCCGCCCG TGGTCACCGT CATGGGCCAT GTCGACCACG GCAAGACGTC GCTTCTGGAC GCCATCCGCG ACACGGGCGT CGTGGCCAGC GAGGCCGGCG GCATCACCCA GCACATCGGC GCGTCGGTGG TGGAGATCGA CGGCAAGCAG ATCACGTTCA TCGACACGCC GGGCCACGAG GCGTTCACGG CCATGCGCGC CCGCGGCGCC CAGGTCACCG ACGTCATCGT GCTGGTGGTG GCCGCCGACG ACGGCGTCAT GCCGCAGACC ATCGAGGCCA TCAACCACGC GAAGGCGGCC GAGGTGCCCA TCGTGGTGGC CGTCAACAAG ATCGACAAGC CGGGCGCGAA CCCCGACCGC GTGCGCCAGG AGCTGGTGGA ATACGGCGTC ATTCCCGAGG AGTGGGGCGG CACCAACATG TTCGTGGAGG TGTCGGCCAA GCAGCGCCTG CACATCGACG ACGTGCTGGA GACGATCATC CTCCAGGCCG ACGTGCTGGA GCTCAAGGCC AACCCCGACG CCGAGGCCTC CGGCTTCGTC ATCGAGGCGA ACCTCGACAA GGGTCGCGGT CCCGTGGCCA CCGTGCTCGT GCAGCGCGGC ACGCTGCACC CGGGCGACGT CGTGGTGGCC GGCACCTCGT ACGGCCGTGT CCGCGCGCTG GTGGATCCGC ATGGCAAGCA CGTCGATTCG GCCGGTCCCG CCGATCCGGT GGAGATCTTG GGCCTGAACA GCGTGCCCAC CGCGGGCGAC GAGTTCCGCG TGTTCGAGGA CGAGCGCGAC GCCCGCAAGC TGGCCGAGGA ACGCGCGTTG CGCGCCCGCC TGGCCGAGCA GGAATCCAAG AGCCACATGA GCCTGGACGA CCTGTTCAAC CGCATCGAGG AGGGCAAGCA GACCGACCTG AACCTCATCG TGAAGGCCGA CGTGCAGGGC TCCATCGAAG CGCTGCGCGA CGCGTTCGAG AAGATGGACC AGTCCGAGGT GCGCATCAAC ATCGTGCACT CGGCCGTGGG CGGCATCACC GAGACCGACG TCACGCTGGC ATCGGCCTCC GACGCCATCA TCATCGGCTT CAACGTGCGC CCCACGGGCA AGTCGAAGCA GCAGGCCGAG AAGGAGAAGG TGGACATCCG TCTGTACAGG ATCATCTACC AGGCCATCGA GGAGATCAAC GCGGCTCGCG TGGGTCTGCT GTCCCCCGAC ATCGTGGAGG AGGACACCGG CATCGCCGAG GTGCGCGAGA CGTTCAAGGT GCCGAAGGTG GGTACGATCG CCGGCTGCTA CATCGTGGAA GGCGAGATCC ATCGCGACGA CAAGGTGCGC ATTGTCCGCG ACGGCACCAT CATCTTCGAG GGCGTCATGG AATCGCTGCG CCGCTTCAAG GACGATGTGA AGTCCGTGAA GCAGGGCTAC GAGTGCGGCA TCGGCATCGA GAAGTTCCAG GACCTCAAGA TCGGCGACCA CATCGAAGGT TACACCGTGA AGGAAGTCGA GCGCACCGAG TAA
|
Protein sequence | MASMRVHELA KEFDMSSKEL LDKLQQMKIP AKSHASMLAD AYVDKIRKNL EPEIKQRAGQ LEDEEARKLA EERAEAERKK AEEERARREA VEQERAAREA ERAQRAEGSS DDGRAQGADG EGGPKKAPVS SPFESLASQI ESEKERVARE AAEARARARQ AKMAAEVAKK QAVEEALRNR NAKGSKSASS ASTPKRPAPV LGAKKPAFDS LLSQIEAEKQ RIEAQKQAAP AARDAARKGS KPERPAKKGK RGGHVEQIVP ELEQQAAQPE DRYAQMAVQA EKLQRDKVLA EARAAVAAAS THEGEGRRKK RKEKREAENR ERMELEAIEK GLDPTLVLDD SVVEIPQGAT VAKFAEILGV QPNDIIKRLF MLGQVLTLTQ SMSDELVELI ADDMGRKVRV VSPEEEYAVV YHDKDEDLKP RPPVVTVMGH VDHGKTSLLD AIRDTGVVAS EAGGITQHIG ASVVEIDGKQ ITFIDTPGHE AFTAMRARGA QVTDVIVLVV AADDGVMPQT IEAINHAKAA EVPIVVAVNK IDKPGANPDR VRQELVEYGV IPEEWGGTNM FVEVSAKQRL HIDDVLETII LQADVLELKA NPDAEASGFV IEANLDKGRG PVATVLVQRG TLHPGDVVVA GTSYGRVRAL VDPHGKHVDS AGPADPVEIL GLNSVPTAGD EFRVFEDERD ARKLAEERAL RARLAEQESK SHMSLDDLFN RIEEGKQTDL NLIVKADVQG SIEALRDAFE KMDQSEVRIN IVHSAVGGIT ETDVTLASAS DAIIIGFNVR PTGKSKQQAE KEKVDIRLYR IIYQAIEEIN AARVGLLSPD IVEEDTGIAE VRETFKVPKV GTIAGCYIVE GEIHRDDKVR IVRDGTIIFE GVMESLRRFK DDVKSVKQGY ECGIGIEKFQ DLKIGDHIEG YTVKEVERTE
|
| |