Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1696 |
Symbol | |
ID | 8415995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2000919 |
End bp | 2002079 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 645024663 |
Product | protein of unknown function DUF1113 |
Protein accession | YP_003182051 |
Protein GI | 257791445 |
COG category | [S] Function unknown |
COG ID | [COG4905] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.176656 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.89414 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGACA CGGTGAACGA GGAAGGTCGC CTCCCGGTGC CGCTCAAGGT ATTCGGCGTT CTCAGCATAG TAGGAGGACT CGCTTCCCTC GGCGACCTGG CGCCGGTGAT CTTCTCCTTC GTGCAAGGCG TCGGCGGCGG CACGTACCGA ACGGCGTCCA TCGTCATCTT CGCCGGTCTC ATGGCGGTGC TCACGTCATC GGCCGTGCTG TTCGTGTTCT TGGGCGTGCG GCTGCTGCGC AATCGCCGAG CCCACGCGGC GCAGGCGGCG AACACCCTCG CGGCGCTCAC CGTGCTGGCC GTCATCGGCA CGCTGATGCT GTTCGGCTTG TCTGCGCACC ACATCGGCTA CCTCGTTCAG ATCGTCATTC TCGTCGCGGT GACCAGCTAT CTCGACCCGC AGTTGCACGA GGAGCGCCGG CTACAGCGCA AGCTCAAGAA GATGGACAAG GAGCAGCGCT CGGAAGAGAG GAGGTTGGCG CGCGAGCGCA AGCCGAAAAA AGGGTTCATC ACGCTGAACT TCTTCAACCT GTTCTGGATC TTCGTGGTGT GCTGCGTGCT GGGGCTCATC ATCGAGGTGC TGTTCCATTT CGCGCTATAC CATGAGTATC AGGATCGCGC TGGGCTTCTG TTCGGGCCGT TCTCTCCCAT CTACGGTTTC GGCGCTCTGC TCATGACCAT CGCGCTCAAC CGTTTCCACG ACAAGCCCGT GTGGGTGGTC TTCCTCGTGA GCGCCGTCAT CGGCGGTGCG TTCGAGTATT TCACGAGCTG GATCATGGAG TTCTCGTTCG GCATCCGCGC ATGGGACTAT TCGGGCACGT TCTTGTCCAT CGACGGACGC ACGAACTTCG TGTTCATGGT GATGTGGGGC GTGCTGGGCG TGGCGTGGAT CAAGCTTCTG CTGCCGCGGC TTTTGAAGTT GATCAACCTC ATTCCGTGGA ACTGGCGCTA CGCGGTGACG GCCGTATGCG CGGGGCTCAT GCTGGTGGAC GGCGTCATGA CCGTGCAGTC CATCGACTGC TGGTACGCGC GATCGGCGGG CAAGGCGCCC GACACGCCCA TCGAGGAGTT TTACGCGAAG CATTTCGACA ACGCCTACAT GGAGCACCGC TTCCAGACCA TGACCATGGA TGTGAACGAC GCCGCTCGAG CCGATCGGTA A
|
Protein sequence | MNDTVNEEGR LPVPLKVFGV LSIVGGLASL GDLAPVIFSF VQGVGGGTYR TASIVIFAGL MAVLTSSAVL FVFLGVRLLR NRRAHAAQAA NTLAALTVLA VIGTLMLFGL SAHHIGYLVQ IVILVAVTSY LDPQLHEERR LQRKLKKMDK EQRSEERRLA RERKPKKGFI TLNFFNLFWI FVVCCVLGLI IEVLFHFALY HEYQDRAGLL FGPFSPIYGF GALLMTIALN RFHDKPVWVV FLVSAVIGGA FEYFTSWIME FSFGIRAWDY SGTFLSIDGR TNFVFMVMWG VLGVAWIKLL LPRLLKLINL IPWNWRYAVT AVCAGLMLVD GVMTVQSIDC WYARSAGKAP DTPIEEFYAK HFDNAYMEHR FQTMTMDVND AARADR
|
| |