Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0414 |
Symbol | |
ID | 8414698 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 531168 |
End bp | 532466 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 645023389 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003180792 |
Protein GI | 257790186 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2807] Cyanate permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.743421 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0000294242 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAAAGCT CGCATGCTCT GAAGAACTCA GGAAAGTTCG CTTGGATCTT GGTGCTCGGC CTTGGACTCA TGTCGGCGGG CACAACCGGT TCCTATAGCG TCGTCGCAGG CTGTTTCATG ACGCCGGTCT GCGAGGATCT GGGAATCGAC TACAACATGT TCTCCTACTA TTTCACGGCA ACGCTGGTGG GGGTCGCCGT CGGCATGATG TGCGTCGGCA AGATCTTGCC GAAAGTGGTC GGTCGGTGGA CCCATGTCGC GGTGGCAGCG GTTCTACTGG CGGCTGGCGC GGCCATGGCG TTTTACTCGA ATGTCTGGCT GTTCCTGCTT TCGGGGCTCA TCATCGGATT CGGCATGTCC TTCACCACGG GGCTTTGCAT GTCAGCCGTC ATCGATCAAT GGTTCAGGAA GAAGGCGGGC CTCGCCATCG GCTTGGCTTG GACGGTCAAC TCGGTGTACA TGGTCGTCAT GAGCCCGGTC ATCACCGCCG TCATCGAATC TGTGGGTTGG AGGAACGGGT ACCTGGTGCT CGCTGCCGTC TCGGCCCTCG TAGTCATACC GTCTATCGTG TTCATCATCC GGTTCAAGCC CGCGGATAAA GGCATGCTTC CCTACGGTTA CAGCGAAGCC GAATCGATCG GTCAAGATGC AGGGGTGGAA ATCGCGGCCA CGCGAGGCGT GCCGTTCAAG GTTGCGGTGA AGTCGCCCGC GTTCATCGCC TGCGTCTTGT TCCTCTGCCT CGTGCAGATC ACCGTGTGCA TGAACCAGCT TTTCCCAACA TATGCAGTGG AGGTCGGCCT TGGCGCGATG ACGGGCGGAA TCATGGTTTC AGCTGCGTCC ATGTTCGATA TCTTCCTCAA CCCTGCCGTA GGGTCCTCAT GCGACAAGCT CGGCTCGTTC ATAGCCCTGG TGCTGTGGGT AGTCGTGAGC ATCCTGTCGT TCGTGATGCT CATATTCGCA GCTGGCCAAC CGTGGCTCGC CATTCTGGGA GCAGGCGTGA ACGACGTTAT GTACGTGGTG GCCGGAGCTG GCCTTACGTG CCTGCTCATG AGCGTGTTCG GCTCTCGCGA TTACGGTCGG ATCTTCGGTA TCGTCTGCGG CGTGGCTTAT ATCGCCGGGG CGTTCGGCAT GCCCATCATG ACCGCTGTCT ACTCGGCAAC TGGAACGTTC AATGCGGTGT TCGGGCTGTG CATCGTCATG AATGTCGCCA TAATCGGATT GATCTTGGTC ATCAAGGCTA CCAGCAAGAA GCTGCAGTGG GTCGAGGGAG AGGACGAAAC GCCTGTCGCT CTGCATTGA
|
Protein sequence | MESSHALKNS GKFAWILVLG LGLMSAGTTG SYSVVAGCFM TPVCEDLGID YNMFSYYFTA TLVGVAVGMM CVGKILPKVV GRWTHVAVAA VLLAAGAAMA FYSNVWLFLL SGLIIGFGMS FTTGLCMSAV IDQWFRKKAG LAIGLAWTVN SVYMVVMSPV ITAVIESVGW RNGYLVLAAV SALVVIPSIV FIIRFKPADK GMLPYGYSEA ESIGQDAGVE IAATRGVPFK VAVKSPAFIA CVLFLCLVQI TVCMNQLFPT YAVEVGLGAM TGGIMVSAAS MFDIFLNPAV GSSCDKLGSF IALVLWVVVS ILSFVMLIFA AGQPWLAILG AGVNDVMYVV AGAGLTCLLM SVFGSRDYGR IFGIVCGVAY IAGAFGMPIM TAVYSATGTF NAVFGLCIVM NVAIIGLILV IKATSKKLQW VEGEDETPVA LH
|
| |