Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1936 |
Symbol | |
ID | 8416243 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 2268830 |
End bp | 2270635 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645024909 |
Product | hypothetical protein |
Protein accession | YP_003182289 |
Protein GI | 257791683 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000210591 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.10336 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGAAA ACGATTTCCG CAAGGAGTAC GAGCGCATGC AGCACCAGGT GCGCGCGTCG TCCGATCTCA AAGAACGCAC GCTAGCCGCC GCCGAGCGGG CAGCCGACCG CTTCGCCTCC TCCGCTCAGC CGGTCGCGAC CGCGTCAGCC AAGCGCCCAC ATCGGCGCGC CGGATCGCGC AGCGGCGGCG TCGCAGTTGC GCGTCGCTGG GGTCTGCCCG CCGCAGCCTG CCTCGTCGCC GCGGCCATCG TCGCCGGCGG CGTGCCCATG GTCATGGGCG CGATGGACGC GGACGGCCAT ACGGCCATCT CCCTGAGCGA CGCCCAACAG GCAAGCGGCT TCGCCGTGCG CGCCTACGCC TCCGACGGCA GCGCGCCGCT CGCGCCCGGC GAGGGGGGCA CCGTTGCGTT CGACCGCGAC TTGGGCTACC GCTTCTCAGG AGGCGACGAC TACAAAGTGA GCGGCTTCTT CACCGGCTGC CTCTTCCACG TTGAGGGCGA GGGCATCTCC CGCGTGCAGG CGAACCTGAC GGGCGGAGCC CTGTACCGCG TGACGTTCGA AGACGGACCG ACCGACCCCG ACGACCCGCG CATGGGCGAG CTGGCAAGCT GGAAGCCCAC GGCGCGCGGT ACCGGCGAGT ACTACGGCGG CTACGATTTC GTCGGAAGCT CCATGCGGAA CGGAGAGAGT AAGCTGAGCC TCGCGAAGCT CATGGGTTCC ACCATCGACG TCTCCGCAAG CGACGACCCC GGCATCGCGG ACGGCACGAC GAGTTTCGGC CTCTGGACGA ACGAGGGCGA GCCTCCTGAA AACATCATGG GCGACCTGCA ATCCCCCGTC ATCGACCTGT TCGAAGGACA GACGCTCACC GTCACCGTGA CGTTCGAGGA CGGGCGCACG TCCACCCAGG CCATCGAGCT GCACGCGGCC AACTTCGAGA CCGAAATGGT CGACGGCACC CCCCGCCTGA CCACCCGTCT CGCCGCAGAC GACGCCGAAG CCCCCTCGGC AGCGAAATCG CTCTACGGCA TCGTGGTGAA AGCGGGAAGC GGCCCGTTCC CCTTCCCGCT CGACGACGCG AACGACCGCG CCGACGAAGT GCTGCCCGCG TCGACCATCG AGCGCCAGGA CGATACCTGG CGGGCAACCG TCGAGGAGAA CGGCGCGCGC GTCGACGCGA CCCTGCCCGA AAACGCGCTC ACACCGTCCG ACGGCGAAGT CGCGTTCGAC TTCGGATACG AAAGCACCGG CTCCTCTCAC CCAGACTCCG AAAGCTCGCA ACAGCCGACC GCGCGCTTGG CCATGAGCTC CCCTTCCATC TCGCTCTCCG ACACTCTTCC GGGCGGAAAG GCGCTCGACG ACTGCCTCTT TGTCGTGGAC GGATGGCTGG GCAACGCGCG CTACATGGAC AAATGCTCGC GCGAGGTGTG GGGCTACGGC TACAACGACG ACGGCACGCT TACGAGCGAC GACTACCGTT ACGCGTCCAC GACGGTGACG CTGCGCAACC TTGAAGATAC GGCCGTCCCC GTTTGGACGC CCGTGCTCTA CGATTTCGCC CTGCGCAACG ATGACGGAAC GCTCGACATG GTGCGGACGG GTTACGATCT GGACTTCGAG GCAACGGGCG ACACCGTGCC CTCCGACGAC CCCCAGCACG TCGTCATCGC GCCGGGCGGC ACAGTGCAGC TGACCGTGGT GCGCGTCCTG CCCACGTACG TCCTGGAGAG CGGAAACCTG GTGCTCGTGC CGACCGACGA CGGCAGCCCG TTCTCCCAGG CCTTCTCCCT CGGCGGGCAG ATCTAG
|
Protein sequence | MNENDFRKEY ERMQHQVRAS SDLKERTLAA AERAADRFAS SAQPVATASA KRPHRRAGSR SGGVAVARRW GLPAAACLVA AAIVAGGVPM VMGAMDADGH TAISLSDAQQ ASGFAVRAYA SDGSAPLAPG EGGTVAFDRD LGYRFSGGDD YKVSGFFTGC LFHVEGEGIS RVQANLTGGA LYRVTFEDGP TDPDDPRMGE LASWKPTARG TGEYYGGYDF VGSSMRNGES KLSLAKLMGS TIDVSASDDP GIADGTTSFG LWTNEGEPPE NIMGDLQSPV IDLFEGQTLT VTVTFEDGRT STQAIELHAA NFETEMVDGT PRLTTRLAAD DAEAPSAAKS LYGIVVKAGS GPFPFPLDDA NDRADEVLPA STIERQDDTW RATVEENGAR VDATLPENAL TPSDGEVAFD FGYESTGSSH PDSESSQQPT ARLAMSSPSI SLSDTLPGGK ALDDCLFVVD GWLGNARYMD KCSREVWGYG YNDDGTLTSD DYRYASTTVT LRNLEDTAVP VWTPVLYDFA LRNDDGTLDM VRTGYDLDFE ATGDTVPSDD PQHVVIAPGG TVQLTVVRVL PTYVLESGNL VLVPTDDGSP FSQAFSLGGQ I
|
| |