Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1691 |
Symbol | |
ID | 8415990 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1993754 |
End bp | 1994710 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645024658 |
Product | formate/nitrite transporter |
Protein accession | YP_003182046 |
Protein GI | 257791440 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2116] Formate/nitrite family of transporters |
TIGRFAM ID | [TIGR00790] formate/nitrite transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.516822 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGAAAT TGTCGAGGAA AACGCTTTTG TTTCTCCCTG CGCCCGCGAG TTGCGGTACC ATACCGTGCG GCTTCGCAGG GGACGCGCGA AGAAGCCGCG ACCAGGCAAG GAAGGAACCA GATGCGATCA TGATGCAGGA CGAGATCCAA ACGCTCAGGC CGGATGCGCT GTCGCCGGCT GAAATCGCGG AGAAGGCCGA GAAGGTCGGC GAGGCCAAGG CGGCCATGCC GAAGGCGAAG TGCTTCGCGT CCGCGATGCT GGCCGGGGCG TTCATCGCGT TCGGGGCGCT GTACTTCTGC GTGTTCCTGG GCGACCCCTC CATGCCGTTC GCGGTGCAGC GCGCGGTGGG CGGACTCTGC TTCTGCCTCG GGCTGGTGCT CGTGCTGTGC TGCGGCGCGG AGCTGTTCAC CGGCAACGTG CTTCTGGTGT GCGCGAAAGC GTCGGGCCGT ATCGGCTGGA GGTCCCTGTT CGGCAACTGG CTGCTCGTGT GGCTCGGCAA CCTCGCCGGC GCGCTCGTCG CGCTCGCGCT CGTCCACTTT GCGAACGTCG CCGCCATGAA CGGAGGCGCG GTGGGCGAGG CCTTCGTCAG CGTGGCCGCG GGCAAGGTGG CGCCCGATTG GACGACGCTG TTCTTCAAGG GCGTCATGTG CAACATCCTC GTGTGCCTGG CCGTGTGGAT CGGGTTCTCG GCGCGCACCG TCGCCGACAA GGTGCTCGGC ATCCTGCTGC CCATCTCTGC GTTCGTTGCG TGCGGGTTCG AGCACTGCGT GGCGAACATG TTCTTCCTCC CCATGGGGCT GTTGCTCAAC TCGATGGGGG TGGGTGCGCC CGACGCGGTG ACGCTGGCCG GCATCGCGAC GAACCTGTCG GCCGCCACGC TGGGCAACGT CGCCGGCGGC GTCGCCGTCG GCATGGCGTA CTGGTTCGTG TACCGGAAGA AGGACGTCGA TCGGTAG
|
Protein sequence | MGKLSRKTLL FLPAPASCGT IPCGFAGDAR RSRDQARKEP DAIMMQDEIQ TLRPDALSPA EIAEKAEKVG EAKAAMPKAK CFASAMLAGA FIAFGALYFC VFLGDPSMPF AVQRAVGGLC FCLGLVLVLC CGAELFTGNV LLVCAKASGR IGWRSLFGNW LLVWLGNLAG ALVALALVHF ANVAAMNGGA VGEAFVSVAA GKVAPDWTTL FFKGVMCNIL VCLAVWIGFS ARTVADKVLG ILLPISAFVA CGFEHCVANM FFLPMGLLLN SMGVGAPDAV TLAGIATNLS AATLGNVAGG VAVGMAYWFV YRKKDVDR
|
| |