Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1242 |
Symbol | |
ID | 8415534 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1488767 |
End bp | 1489705 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645024206 |
Product | protein of unknown function DUF6 transmembrane |
Protein accession | YP_003181601 |
Protein GI | 257790995 |
COG category | [R] General function prediction only |
COG ID | [COG5006] Predicted permease, DMT superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0369938 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGCCA TGCAGAAGAA CACCTTGAAA TACTCCGCCG TCGTGTTCCT CGGCGGCGCC AGCTACGGCG TGATGGCCGC CACGATCAAA TGCGCGCTCG CCGAGGGGTT CTCGTGGACC CAAACCGCCG CCAGCCAGGC GTTCTTCGGC GCGCTGCTGT TCGCCGTCGC CTTGGCCGCG CTGACCGTCC TCGGCAAGCG CCCTGTACCG CTGTCGCCCA AGCGCGTGCT GTCCTTGCTG GGGCTGGGCC TCGCCACATG CACCACCTGC GTGCTGTACA ACTTCGCGCT CACCATGCTG CCCGTGTCCG TGGCCATCAC GCTGCTGTTC CAATTCACCT GGATCGGCAT CGTATTCCAG GTGGTCGCCA CCCGTCGCAA ACCCCGCCTC GCTGAAATCG TCGCGGCGGC CGTCATCCTC GGAGGCACCC TCTTGGCGAG CGGCCTGTTC TCGAACACGG TGGGTCATCT CGACCCGCTC GGCATCCTCT GCGCGCTGCT GTCGGCCGTC AGCTGCGCGA CGTTCATGTT CCTGTCGGCG CGCGTCGGCT GCGACCTGCC GCCAATCGAG CGCGGACTCG TCGTGTGCCT GGGCGCGTGC ATCCTCGGCT TCGCCGTGTG CCCCGACTAC TTCTCAAGCG GCGCGCTGCA GGCCGGCATT TGGAAGTACG GGCTGATCCT GGGCGTATTC GGGCTGTTCG TGCCCGTGGT GCTGTTCGGC ATCGGAACGC CGCACCTCTC GGCGGGGCTG TCCACCATCA TGGCATCGTC GGAGCTGCCA TGCGGAATAG CGATATCGGT GCTCGTCCTG TCCGAACCCG TGGACGCGCT TCAGACGGCC GGAATAGCCG TCATCATGCT GGGCGTGGCG GTATCGCAGC TGCCCAACCT GCTGCCGCAA GGCGCGTCCG CTCGCTTTCG AAAAAAACGC TTGCAATGA
|
Protein sequence | MAAMQKNTLK YSAVVFLGGA SYGVMAATIK CALAEGFSWT QTAASQAFFG ALLFAVALAA LTVLGKRPVP LSPKRVLSLL GLGLATCTTC VLYNFALTML PVSVAITLLF QFTWIGIVFQ VVATRRKPRL AEIVAAAVIL GGTLLASGLF SNTVGHLDPL GILCALLSAV SCATFMFLSA RVGCDLPPIE RGLVVCLGAC ILGFAVCPDY FSSGALQAGI WKYGLILGVF GLFVPVVLFG IGTPHLSAGL STIMASSELP CGIAISVLVL SEPVDALQTA GIAVIMLGVA VSQLPNLLPQ GASARFRKKR LQ
|
| |