Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1932 |
Symbol | |
ID | 8416237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 2263684 |
End bp | 2264940 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645024903 |
Product | sodium:dicarboxylate symporter |
Protein accession | YP_003182285 |
Protein GI | 257791679 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3633] Na+/serine symporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00260891 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.0618427 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAATCA AGAAGGTCGT CCAGGCGTGG AACTCGGTCA GCCTGGTGAA GCGCATCGTC ATCGGACTGG TGATCGGCGG CGTGCTGGGC GTGGCGGTGC CGAACATCGA GGTCGTCGAG CTGCTGGGCA CGCTGTTCGT GTCGGCGCTG AAAGGCGTCG CGCCGATTCT GGTGTTCTTC CTGGTTATCA GCGCGCTTGC GAACGCTCGT GCCGCCGGAT CGATGAAAAC CGTCGTGCTC CTGTACGTGG TGAGCACGTT CGTCGCGGCG CTCGTGGCCG TGGTGGCCAG CTTCCTGTTC CCCATCGAGC TCACGCTGAG CGCTCCGGCG GCCGACCAGA GCTCGCCGTC GGGCATCGGG GAGGTGCTGG GCGCGCTCGT GATGAACGTG GTGTCGAACC CGGTGGACGC GCTTATGAAC GCGAACTACA TCGGCATCCT GGCTTGGGCC GTGGTGCTGG GCATCGCGCT GCGCGCCGCG TCGCAGAGCA CGAAGGACGT GTTCGCCAGC GTGTCCGACG CCGTGTCGCA GGTGGTGCGC TGGGTGATCT CGCTGGCGCC CTTCGGCATC TTGGGCCTCG TGTACACCAC GGTGAGCGCG AACGGCCTGG AGATATTCAC CGAGTACGGC CAGCTTCTGC TGGTGCTGGT GGGCTGCATG CTGTTCATCG CGTTCGTCAC GAACCCGCTG CTGGTGTTCT GGGGCATCCG CAAGAACCCC TACCCCCTCG TGCTGCGCTG CCTGAAGGAC AGCGGCATCA CGGCGTTCTT CACGCGCAGC TCGGCGGCGA ACATCCCGGT GAACATGGAG CTGTGCCGCA AGCTGGGGCT GGACAAGGAC AACTACTCAG TGTCCATCCC GCTGGGAGCC ACCATAAACA TGGCGGGCGC TGCGGTGACC ATCTCGGTCA TGGCCATGGC GGCCGCCCAC ACCATGGGCG TGTCCATCGA TCTGCCCACC GCCGTCATCC TCAGCGCGCT GGCAGCGGTG TCCGCCTGCG GCGCGTCGGG CGTGGCAGGA GGGTCGCTGC TGCTCATCCC GCTGGCATGT TCGCTGTTCG GCATCGGCAA CGACGTGGCC ATGCAGGTGG TGGCCATCGG CTTCATCATC GGCGTGGTTC AGGATTCGTG CGAGACGGCG CTCAACTCGT CGTCCGACGT GCTGTTCACC GCCACGTCGG AGTACCGCGA GTGGCGCAAA GCCGGCAAGG AGATCACGTT CGGCGAGGAT GCGTTCAGCG AGAACGAGCT GGCGTAA
|
Protein sequence | MGIKKVVQAW NSVSLVKRIV IGLVIGGVLG VAVPNIEVVE LLGTLFVSAL KGVAPILVFF LVISALANAR AAGSMKTVVL LYVVSTFVAA LVAVVASFLF PIELTLSAPA ADQSSPSGIG EVLGALVMNV VSNPVDALMN ANYIGILAWA VVLGIALRAA SQSTKDVFAS VSDAVSQVVR WVISLAPFGI LGLVYTTVSA NGLEIFTEYG QLLLVLVGCM LFIAFVTNPL LVFWGIRKNP YPLVLRCLKD SGITAFFTRS SAANIPVNME LCRKLGLDKD NYSVSIPLGA TINMAGAAVT ISVMAMAAAH TMGVSIDLPT AVILSALAAV SACGASGVAG GSLLLIPLAC SLFGIGNDVA MQVVAIGFII GVVQDSCETA LNSSSDVLFT ATSEYREWRK AGKEITFGED AFSENELA
|
| |