Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1623 |
Symbol | |
ID | 8415922 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1922175 |
End bp | 1923311 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645024592 |
Product | hypothetical protein |
Protein accession | YP_003181980 |
Protein GI | 257791374 |
COG category | [S] Function unknown |
COG ID | [COG1426] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0072792 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.000000117284 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACATTCG GAACCATATT GAGAGAGGCG CGCGAGCGCA AGGGCTACGA TCTCGCGACG GCTGCGCGAC GGCTGCGCAT CCGACCCGAC ATCCTGCGCG CTATCGAGGA AGACGATTTC TCCCGCATGC CGCCGCGCGG CTACGCGCGC AACATGGTGA ACGCCTACGC GCGCCTCGTG GGGCTCAACC CCACCGAGAT GACGCGCATG TACCTGGACG AGGCCTACGC CTACCAGGTG GGACGCGCGC GCAACGGCGC GCAGCCCTCG GGTTTCGATC GAGGAGGATC TTCGCGCACG GGACGCTCGA CCTCGCGGCA GGGCGCTCGC CCCTCGCAGC AAGTCGACGA GCGCCCGCCG CGGCAGAATG CGTTCGGGCG TACGATGTAC GACGACCGGC GCGACTACGG TCGCGATTAC GGCGCGCGCG GGGGTTCGGA GCGCCTCTAT TCCGAGGGAC GCACGCATCC CAGTCGCCAT GCGGCGCTGC CGAACGCCGA GTACACGAAC TTCTACGCAG GGCCGAAGGC CTCGAGCGTC GTGCAGTCGA AGCTGCCTTT CGTCATCGCG GGCGGCGTGA TCCTCGTGCT GCTCATCGTG GTGCTGGTGC TCGTGTTCGG CAACAACGGC GGGAGCTCGA ACGAGGATGT GACGAAGCTG CCGGTGACGG GGCTCGCCGA CCCGACGCAG GACGGCAGCG GCACGGAAGG CGGCGAAACC CCCGCGCAAC CGCAGGCCGA GCCTGTTGAA ACTGCTCCGA CCAGCATCAA GGTGACCTAC ACGATCGCCA AGGACACCCC GGTGTACGCG GTCATCACGA AGGACGGGAC GTCCGAAGAC CAGATGTTCT CGGGCGGCGA GGAGGACACC GTTGAGCTGG CCGAGGGCGA CGTGTGGACG TTCGCCGCTT GGGCGAGCGA CGGCGTGACG ATCAAGGTGG ACGGCGAGGC GGTCAAGTTC GACGGCTCCG ATCCGGCTAC CGGCATGCCC ATGGCCACGG TCGATTTCGA CGCCTACCTG GAGAAGTGGT ACGAGGATCA TCCGGATGCC AAGAAGAAGG GGTCGGCCGA CGCGGACGCC GCCGACAAAG CAGCCGAGGA TGGCGCGAAA ACCGGGGATG GAACATCCGC CGCTTAG
|
Protein sequence | MTFGTILREA RERKGYDLAT AARRLRIRPD ILRAIEEDDF SRMPPRGYAR NMVNAYARLV GLNPTEMTRM YLDEAYAYQV GRARNGAQPS GFDRGGSSRT GRSTSRQGAR PSQQVDERPP RQNAFGRTMY DDRRDYGRDY GARGGSERLY SEGRTHPSRH AALPNAEYTN FYAGPKASSV VQSKLPFVIA GGVILVLLIV VLVLVFGNNG GSSNEDVTKL PVTGLADPTQ DGSGTEGGET PAQPQAEPVE TAPTSIKVTY TIAKDTPVYA VITKDGTSED QMFSGGEEDT VELAEGDVWT FAAWASDGVT IKVDGEAVKF DGSDPATGMP MATVDFDAYL EKWYEDHPDA KKKGSADADA ADKAAEDGAK TGDGTSAA
|
| |