Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2601 |
Symbol | |
ID | 8416926 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 3036680 |
End bp | 3037735 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645025580 |
Product | hypothetical protein |
Protein accession | YP_003182942 |
Protein GI | 257792336 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000831419 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.000000147354 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCCTAGGG CGAAAGCGGC CGATACGGTC GAGGTCGTCG AGGTGGAAGC CGAGATCATC GACGGCGCAG AAGATCAGCA GAAGGCGGAG CTGTCCGTCA CCAACAAGCC CGGGAGCATC GAGGCGAACT TCGACGCGTT GGAGGAATAC GTCGATGGCA TCCTCGAAGA TTACGCGGAC TGGGAGCCGT CGGCGGACAA CGCCGAGGAC GTGAAGCAGT GCGACCGCGA GAAGAAGTAC CTCAACGGCC TCGCCTCCCA GCTCGACACG CGGCGCAAGG CCGTCAAGTC GGAATACCTC AAGCCCCTCG ACGCCTTCGA GGCGCGCGCC AACTCGATCC GCGACAAGAT CAAGGCGACG GCCAAGCGCC TCGACGACGT GAAGAAGCAG GCGGATCAGG CGGAGAAGGA CGCCAAGTAT TCCGCGCTCG AAGCCCATTA CTGCGAGTTC GCCGAACTGC TCGCGCCCGT GGTGCCTTAC GCGAGGCTCC ATGATCCCAA GTGGCTCAAC AAGCGCCCCA CCCTGCCGCA GGCCATCAAG GAGCTTGACG CGAAGGTGGA AAAGGTCGCG AACGACTGGG ACAGCCTCAA GAAGCGCAAC CTCGAATTCC ACGATGCTGC AGAGGCGTTC TTCTTCGAGC ACTTGGACCT CGGCGCAGCC TGCACGTATA ACGACAAGCT CGTGGAGGAC CACAGGCGCA TCGAAGAGCT TAAGCGCGAG ATGGCGCAGG AGGAAGAGGC ACAGGAGGCC GCGCCAGCGC AAGAGCGACC GGAGCCGGCA CCTGCGCCGA TGCCCGCGCC GGTGCCCGTG AGCGCGCCGC AGCCCGCCCC CGCGCCCGTC GTCGCGTGCG CGCCGCAGCC CATGCCCGCG CCTATGCCGG AGCCGGTCCC GCCCGACGCA GGCCCTTACG TCATGGTCAT CAACTTGGCG ACGCTCGACC AGATTCAGCA GATCGGCCGG TTCTCCGGGA GCATCGGGGT TACCGGCGTT TTCAAACGCG GCACGCTGCA AGAGGCGTAT ATGCGCGAGG CGGGGGGTGT CGGATATGAC CGGTAG
|
Protein sequence | MPRAKAADTV EVVEVEAEII DGAEDQQKAE LSVTNKPGSI EANFDALEEY VDGILEDYAD WEPSADNAED VKQCDREKKY LNGLASQLDT RRKAVKSEYL KPLDAFEARA NSIRDKIKAT AKRLDDVKKQ ADQAEKDAKY SALEAHYCEF AELLAPVVPY ARLHDPKWLN KRPTLPQAIK ELDAKVEKVA NDWDSLKKRN LEFHDAAEAF FFEHLDLGAA CTYNDKLVED HRRIEELKRE MAQEEEAQEA APAQERPEPA PAPMPAPVPV SAPQPAPAPV VACAPQPMPA PMPEPVPPDA GPYVMVINLA TLDQIQQIGR FSGSIGVTGV FKRGTLQEAY MREAGGVGYD R
|
| |