Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_3096 |
Symbol | |
ID | 8417432 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 3600916 |
End bp | 3602211 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 645026076 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003183427 |
Protein GI | 257792821 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2223] Nitrate/nitrite transporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.00556654 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTACCA CTACAGCCAA AAAGAAGGGG ATCCACTTCG CATGGTTCGT CCTCATCGGC GTTCTGCTGA TGATGGGTCT GTGCCGTGGC GGCATCAACA GCGGCATGGG CTTGTTCTTC CAGCCTATCA GCGCCGACAT GCACATCGGC GTGGGCGAAG TGGCAATGAT GTCGTCCATC TCGGCATTGA TCACTATGTT CTGGAGCCCG TTCGCGGGTC GCCTGCTCGA CAAGTTCGAC ATCCGCATCA TCACCGTAGT CGCCGTCGCC ATCCAGGCGG GCTGCTTCGC CGCGCTGTCG CTGGTCGACG CAGTGTGGGG CTTCTACGCG CTGGCCGGCA TCATGGCGTT CGGCTCGGTG TTCGCCACGC AGCTGGTGGG TCCCATGATG ATCAACCGCT GGTTCAAGGA TAAGAACGGC CTGGCCATGG GCATCATGAT GAGCTTCGTG GCCATCTGCT CCGCCGTTCT CTCGCCGGTC GTCGCCTCCA TCATCGCATC GAACGGCTGG CGCATGGGCT ATATCGTGCT GGGCGTGCTC GCGCTGGTTA TCGTGATCCC GGCCGTGCTC ATCTGGTTCC GCACGCCCGA GCAGAAGGGG CAGCTGCCCC TCGGCGCCAC CGAGGCCGAC ATCGAAGCCG CCAAGAACGC CGATCCGAAA GCCGCCGCCG AGGCCGCGAA GAACCTGCCG GGCCTCACGT CCAAGCAAGC GCTGAAGACC CCCACGTTCT GGTTCTTCTT CATCTTCATG GTGCTGCTCA CCGGAACGCT GGCGTTCGCC TCCATCGTTC CCACCCTGGC TATCGAAGCT GGTTTCGATA CCGTGACCAG CGGATTCGCT ATGACGGCGT ACATGATCGG CACGGCTATC GCCGCCGTCG TGTTCGGCAC GATCTCCGAC AAGCTGGGCC CCCTCAAGGC CACGATGGTG GCGTGCGCGT GCGGTTTCAT CGCCCTCTTG GGCCTCATCT TCTTCCGCAC GAACCTGTAC ATGTTCTTCG GCTCGCTGTT CTTCTACGGC TGCCTGTCCG CCACGCTGGG CGTCATCGGC CCGCTGGTGC TAGGCACGCT GTTCGGCCAG AAGGAATTCG GCTCCATCTA CGGCATCGTC ATGATGGCCA CCGGCATCGG CTCCATGATC CTCATCCCGG CGTACGGCTT CATCTACGAT GCGACGGGCA GCTTCACGCC CGCCCTCATC ATGATCTTCT GCTTCATCGT CGTCTGCCTG ATCTCCATGA TCATGGCCTT CAAGACCGGC AAGAAGGTTC AGGCCATGTG GATGCCGAAG GCGTAA
|
Protein sequence | MATTTAKKKG IHFAWFVLIG VLLMMGLCRG GINSGMGLFF QPISADMHIG VGEVAMMSSI SALITMFWSP FAGRLLDKFD IRIITVVAVA IQAGCFAALS LVDAVWGFYA LAGIMAFGSV FATQLVGPMM INRWFKDKNG LAMGIMMSFV AICSAVLSPV VASIIASNGW RMGYIVLGVL ALVIVIPAVL IWFRTPEQKG QLPLGATEAD IEAAKNADPK AAAEAAKNLP GLTSKQALKT PTFWFFFIFM VLLTGTLAFA SIVPTLAIEA GFDTVTSGFA MTAYMIGTAI AAVVFGTISD KLGPLKATMV ACACGFIALL GLIFFRTNLY MFFGSLFFYG CLSATLGVIG PLVLGTLFGQ KEFGSIYGIV MMATGIGSMI LIPAYGFIYD ATGSFTPALI MIFCFIVVCL ISMIMAFKTG KKVQAMWMPK A
|
| |