Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1568 |
Symbol | |
ID | 8415867 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1863664 |
End bp | 1865250 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 645024537 |
Product | C4-dicarboxylate anaerobic carrier |
Protein accession | YP_003181925 |
Protein GI | 257791319 |
COG category | [S] Function unknown |
COG ID | [COG1288] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.313123 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.0213036 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGACGG AAGCCGTACT GCCCACCAGC ACCGCCTCGC CCAAGGAAGG CCCGCCTAAG AAGAAGAGGA AGCTCAGCTT CCCCACCGCC TTCACCATCC TGTTCGCGCT CACCATCGTC GCGGTGGCGG CTACATGGTT CGTGCCGGCG GGCCAGTACG CGAAGCTCGC GTACAACGCC GACGCGGGCA CGCTTCAGAT CACGAGCCCC CAGGGCGCGG TGAGCGAAGA GCCCGCCACG CAGGAGACGC TCGATGCTAT CGGCGTGAAC ATCGGGATCG ACCAGTTCAC CTCGGGCGCG CTGTCCAAGC CCATCTCGGT TCCCAACACG TACGAGCGCC TGGAGCAGCA GCCCAAAGGC ATCGCCGACA TCACGGTGAG CATGGTGTCG GGCACGGTGG AGGCCGCCGA CATCATGGTG TTCATCCTTG TGCTGGGCGG CCTTATCGGC GTGGTGAACG CGAGCGGCGC GTTCGAGTCG GGCCTCATGG CCCTCACGAA GAAGACGAAG GGCCATGAGT TCCTACTCGT GTTCCTCGTG AGCGCCCTCA TGGTCCTCGG CGGAACCACG TGCGGTCTCG AAGAAGAGGC CGTCGCCTTC TATCCCATCC TCGTGCCTAT ATTCCTGGCG CTCGGCTACG ATTCCATCAT CTGCGTCGGC GCCATTTTCC TGGCAGGTTC CATGGGCACG ACGTTCTCCA CCATCAATCC GTTCTCGGTG GTCATCGCCT CGAACGCCGC GGGCGTGAAC TTCACGCAGG GCATCGAATG GCGTATCGCG GGATGCGTCG TGGGCGCCAT CGTGGTCATC GCGTATTTGT ACTGGTACAG CCGCAAGATC AAGGCGAACC CGGCGTTCTC CTACACCTAC GAGGATCGCG AGAAGTTCGC CAAACTGTAC AACGTGGAGG CGGGGGAGAC GAAGGAGGCG CGCGCGACCG GCTTCACGCT TAAGAAGAAG GCAATCCTCG TGCTGTTCGT GGCGGCGTTC CCCATCATGG TGTGGGGCGT CGTGAGCCAG GGCTGGTGGT TCCCGCAGAT GGCAGCCTCG TTCCTGGCCA TCGCGATCAT CATCATGTTC CTGTCGGGCA TCGCCGAGAA GAAGGTGGTG GATGCGTTCA TCCACGGAGC GTCAAGCCTG GTGGGCGTGT CGCTCATTAT CGGGCTCGCG CGCGGCATCA ACCTGATCAT GGAGCAGGGG CTCATCTCCG ACACGCTGCT GTTCTGGTCG TCGGGGCTCG TGCATGGCAT GACCGGACCT GTCTTCATCC TGGTCATGAT GCTGATCTTC TTCCTGCTGG GCTTCGTGGT GCCGTCGTCG TCGGGTTTGG CCGTGCTGTC CATGCCTATC ATGGCGCCGC TGGCCGACAC CGTGGGCATC CCGCGCTCGG TGGTGGTGTG CGCCTACCAG TGGGGCCAAT ACGCCATGCT GTACCTCGCG CCCACCGGCC TCGTGCTGGC CACGCTCACG ATGCTGGACA TGAAGTACTC CAAGTGGCTC AAGTTCGTGT GGCCTATGGT GCTGTTCGTG CTCATCTTCG GCGGCATCCT GCTGGTCGCC CAAGTCCTCG TCTACGGCGC CGCCTGA
|
Protein sequence | MSTEAVLPTS TASPKEGPPK KKRKLSFPTA FTILFALTIV AVAATWFVPA GQYAKLAYNA DAGTLQITSP QGAVSEEPAT QETLDAIGVN IGIDQFTSGA LSKPISVPNT YERLEQQPKG IADITVSMVS GTVEAADIMV FILVLGGLIG VVNASGAFES GLMALTKKTK GHEFLLVFLV SALMVLGGTT CGLEEEAVAF YPILVPIFLA LGYDSIICVG AIFLAGSMGT TFSTINPFSV VIASNAAGVN FTQGIEWRIA GCVVGAIVVI AYLYWYSRKI KANPAFSYTY EDREKFAKLY NVEAGETKEA RATGFTLKKK AILVLFVAAF PIMVWGVVSQ GWWFPQMAAS FLAIAIIIMF LSGIAEKKVV DAFIHGASSL VGVSLIIGLA RGINLIMEQG LISDTLLFWS SGLVHGMTGP VFILVMMLIF FLLGFVVPSS SGLAVLSMPI MAPLADTVGI PRSVVVCAYQ WGQYAMLYLA PTGLVLATLT MLDMKYSKWL KFVWPMVLFV LIFGGILLVA QVLVYGAA
|
| |