Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1166 |
Symbol | |
ID | 8415457 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1401020 |
End bp | 1402048 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645024129 |
Product | hypothetical protein |
Protein accession | YP_003181525 |
Protein GI | 257790919 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.321304 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.665917 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGTTCG CGCCCGTTGG AACCCCCGCG CCGCCCTCCC GCGCCCTGCC GCGCATGGCG ATCGCCCTGC TCGCCTGCCT TGCTCTATAC GGCATCGACC TCGCCTGCAT GGCAACCGTC CCCGCAACCG GATTCGACTT CGCGTCGGGC TTTGCGGTGC CGTTCGACCT CATGGTCTTC GTCCCGGCGG TCTTCTACCT CCTCGTGGTG CGACGCTACA GGCTGTCGCC CTTGCTGGTG CTGCCGGTCA TCTGGCTGGG CGGCCTCGTG TCGCTGCAGT TCGCCGCGCC CGGCAAGCCG TCGCTGCTGG CGCCGCTCGG CCTGTGCGCC GTCGGAGTGG AGCTGGGCAT CGCCGCGCGC GAGGTCCTCC GCCTCGTCCG CCGCTTCCGC ACGGCGAAAG CGTCCTCGGA CAACCCGCTC GACTGGTTCT CGGACGCGTT TTCCGCTCTC GCTCGCAACG AGCGGGTCGC GCGCATGGCG GCGCTCGAAT GCGTCATGTG GTACTACGCG TCGGCTTCGT GGCGCCGCGC GCCGCACGTG CCGCACGGCT ACCGGGCGTT CTCGTCCCAC CGGCAAAGCG GCTACGTCGC CGCGGTCGGC GTCATGCTGG TTCTCATCGC CGTCGAGACG GTCGCCGCGC ACCTGTTGGC AGCACGGTTC AGCGTCGCGG CCGCATGCGT GTTGACCGCA CTCTCCCTCT ACACGATCCT CTGGATGATC GCCGAAGCCC GCGCCGTCGT GCTGAACCCG CTTCTGGTGG ACGATGTCGA GCTGGTGGCG CGCTGGGGCA TGCTCGTCTG CGAGCGCATC CCCCTCGACC GCATCGCGCG CGTCGGTTCC CAGGATCCCG CTGTCCCGAA GAGGGAGCTT CTGAACCTGG CGGCCATGGG CGGGCAGGCG TTGTGGATCG AGCTGGCGGA ACCCCTCGAA GTGCGCGGCC TCACCGGAAA GCCCCGCCTC GTGCGCGCGA TCAAGACCAC GCCCGACGAT GCCGCCGCGT TCAAGGACGC GCTTCGCCCG CGAAGCTGA
|
Protein sequence | MPFAPVGTPA PPSRALPRMA IALLACLALY GIDLACMATV PATGFDFASG FAVPFDLMVF VPAVFYLLVV RRYRLSPLLV LPVIWLGGLV SLQFAAPGKP SLLAPLGLCA VGVELGIAAR EVLRLVRRFR TAKASSDNPL DWFSDAFSAL ARNERVARMA ALECVMWYYA SASWRRAPHV PHGYRAFSSH RQSGYVAAVG VMLVLIAVET VAAHLLAARF SVAAACVLTA LSLYTILWMI AEARAVVLNP LLVDDVELVA RWGMLVCERI PLDRIARVGS QDPAVPKREL LNLAAMGGQA LWIELAEPLE VRGLTGKPRL VRAIKTTPDD AAAFKDALRP RS
|
| |