Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1946 |
Symbol | |
ID | 8416253 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2282284 |
End bp | 2283894 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 645024919 |
Product | C4-dicarboxylate anaerobic carrier |
Protein accession | YP_003182299 |
Protein GI | 257791693 |
COG category | [S] Function unknown |
COG ID | [COG1288] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.185929 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.387022 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAGA AGGCAAAAGA AAAGAGCAAG AAAAAGCGAT CTATATCATC GTTTACCATC CTGCTGATCA TCCTGATCGT GCTGGCGCTG GTCACGGTAG TGATGTCGCT GGCCGGTGTG GAAGGGGTCC AAGGCGCCAC GGTCGCCAAT GTGGCCACGG CTCCCGTCAA GGGCTTTACC GACGCCCTGC CTGTTTGTCT GTTCGTGTTG ATCCTGGGCG GTTTCCTGGG TATCGTCACG GAAACGGGCG CGCTGGACGC CGGTATCGCA GCGCTGGTGA AGAAGCTCAA GGGCAATGAG CTCATCCTCA TTCCCATTCT GATGTTCATC TTCTCCATCG GCGGTACGAC GTACGGTATG TGCGAGGAAA CGGTTCCGTT CTACCTGCTG CTCGCGGCCA CCATGGTCGC CGCAGGCTTC GACAGCGTTG TCGGTGCCGC GGTCGTGCTG TTGGGTGCCG GTTGCGGCGT GCTCGGTTCG ACGGTCAACC CGTTTGCCGT CGGTGCTGCC GTGGACTCTT TGAGCTCTTC CGGCATCGTG ATCAACCAGG GCACCATCAT CCTGCTGGGC GTGGTGCTGT GGCTCGTGAC GCTGGCGATC TCCATCGTCT TCGTCATGCG CTACGCGAAG AAGGTCAAGG CCAACAAGGG TTCCACCATC CTGTCCTTGC AGGAACAGGA AACCATGAAG GCCGAGTTCG GCGAGGCTCA GCAGGAAGCT GAAACCGCTG AGGCGAACCC GAACGAGAAG CTTATGACGG GTCGTCAGAA GTGGACGCTC ATCGTGTTCG CCCTGACGTT CGTAGTCATG ATCGTCGGCT TCATCCCTTG GGGCGACTTC GGCGTCGAGG TGTTCGATGC CGGTGCGGCG ACGGAAGAGG TCACCACGCA GGTTAGCGGC GACGACATCT CCGCGGCTTG GACCGACAAG AAGGTTGGTG GCGAGATTAC GTTCGACGGC GATGTGACCG GCACGGTCAC GGCCGAAGAA GAGATCTCCC AGGGTTGGTC CGCGTTCCTG ACGGGTCTGC CGTTGGGTCA ATGGTACTTC GATGAGGCTT CCACCTGGTT CCTCATCATG GCTATCATCA TCGGTATCGT GGGTGGCGTG TCCGAGAGCC GTTTCGTCAA GGCATTCATC AACGGCACCG CCGATATGAT GAGCGTCGTG CTGATCATCG CCATGGCTCG TTCTATCACC GTGCTTATGG GCGAGACCGG TCTCGACATG TGGATCCTGA ACAACGCGGC GAACGCTCTG AACGGTTTGT CGGCGGTCAT CTTCGCGCCG ATGTCGTTCT TGCTGTACAT CGTGCTGTCG TTCTTGATCC CGTCGTCGTC CGGCATGGCC ACGGTGTCCA TGCCCATCAT GGGCCCGCTG GCGAACTCGC TGGGCTTCTC GACCGACGTC ATGATCATGA TCTTCAGCGC CGGCAACGGC CTGGTGAACC TGTTCACCCC GACGAGCGGT GCTATCATGG GCGGTTTGGC GCTGGCCAAG GTGGAATACT CCACATGGCT GAAGTTCGGC GGCAAGCTGT TCGTGGTGCT GGGCGTCGCC TGCGTGATCA TCTTGACGGT TGCGATGATG GTTATCCCGG GCACCGCGTA A
|
Protein sequence | MTEKAKEKSK KKRSISSFTI LLIILIVLAL VTVVMSLAGV EGVQGATVAN VATAPVKGFT DALPVCLFVL ILGGFLGIVT ETGALDAGIA ALVKKLKGNE LILIPILMFI FSIGGTTYGM CEETVPFYLL LAATMVAAGF DSVVGAAVVL LGAGCGVLGS TVNPFAVGAA VDSLSSSGIV INQGTIILLG VVLWLVTLAI SIVFVMRYAK KVKANKGSTI LSLQEQETMK AEFGEAQQEA ETAEANPNEK LMTGRQKWTL IVFALTFVVM IVGFIPWGDF GVEVFDAGAA TEEVTTQVSG DDISAAWTDK KVGGEITFDG DVTGTVTAEE EISQGWSAFL TGLPLGQWYF DEASTWFLIM AIIIGIVGGV SESRFVKAFI NGTADMMSVV LIIAMARSIT VLMGETGLDM WILNNAANAL NGLSAVIFAP MSFLLYIVLS FLIPSSSGMA TVSMPIMGPL ANSLGFSTDV MIMIFSAGNG LVNLFTPTSG AIMGGLALAK VEYSTWLKFG GKLFVVLGVA CVIILTVAMM VIPGTA
|
| |