Gene Elen_1932 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1932 
Symbol 
ID8416237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2263684 
End bp2264940 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content65% 
IMG OID645024903 
Productsodium:dicarboxylate symporter 
Protein accessionYP_003182285 
Protein GI257791679 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3633] Na+/serine symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00260891 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0618427 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAATCA AGAAGGTCGT CCAGGCGTGG AACTCGGTCA GCCTGGTGAA GCGCATCGTC 
ATCGGACTGG TGATCGGCGG CGTGCTGGGC GTGGCGGTGC CGAACATCGA GGTCGTCGAG
CTGCTGGGCA CGCTGTTCGT GTCGGCGCTG AAAGGCGTCG CGCCGATTCT GGTGTTCTTC
CTGGTTATCA GCGCGCTTGC GAACGCTCGT GCCGCCGGAT CGATGAAAAC CGTCGTGCTC
CTGTACGTGG TGAGCACGTT CGTCGCGGCG CTCGTGGCCG TGGTGGCCAG CTTCCTGTTC
CCCATCGAGC TCACGCTGAG CGCTCCGGCG GCCGACCAGA GCTCGCCGTC GGGCATCGGG
GAGGTGCTGG GCGCGCTCGT GATGAACGTG GTGTCGAACC CGGTGGACGC GCTTATGAAC
GCGAACTACA TCGGCATCCT GGCTTGGGCC GTGGTGCTGG GCATCGCGCT GCGCGCCGCG
TCGCAGAGCA CGAAGGACGT GTTCGCCAGC GTGTCCGACG CCGTGTCGCA GGTGGTGCGC
TGGGTGATCT CGCTGGCGCC CTTCGGCATC TTGGGCCTCG TGTACACCAC GGTGAGCGCG
AACGGCCTGG AGATATTCAC CGAGTACGGC CAGCTTCTGC TGGTGCTGGT GGGCTGCATG
CTGTTCATCG CGTTCGTCAC GAACCCGCTG CTGGTGTTCT GGGGCATCCG CAAGAACCCC
TACCCCCTCG TGCTGCGCTG CCTGAAGGAC AGCGGCATCA CGGCGTTCTT CACGCGCAGC
TCGGCGGCGA ACATCCCGGT GAACATGGAG CTGTGCCGCA AGCTGGGGCT GGACAAGGAC
AACTACTCAG TGTCCATCCC GCTGGGAGCC ACCATAAACA TGGCGGGCGC TGCGGTGACC
ATCTCGGTCA TGGCCATGGC GGCCGCCCAC ACCATGGGCG TGTCCATCGA TCTGCCCACC
GCCGTCATCC TCAGCGCGCT GGCAGCGGTG TCCGCCTGCG GCGCGTCGGG CGTGGCAGGA
GGGTCGCTGC TGCTCATCCC GCTGGCATGT TCGCTGTTCG GCATCGGCAA CGACGTGGCC
ATGCAGGTGG TGGCCATCGG CTTCATCATC GGCGTGGTTC AGGATTCGTG CGAGACGGCG
CTCAACTCGT CGTCCGACGT GCTGTTCACC GCCACGTCGG AGTACCGCGA GTGGCGCAAA
GCCGGCAAGG AGATCACGTT CGGCGAGGAT GCGTTCAGCG AGAACGAGCT GGCGTAA
 
Protein sequence
MGIKKVVQAW NSVSLVKRIV IGLVIGGVLG VAVPNIEVVE LLGTLFVSAL KGVAPILVFF 
LVISALANAR AAGSMKTVVL LYVVSTFVAA LVAVVASFLF PIELTLSAPA ADQSSPSGIG
EVLGALVMNV VSNPVDALMN ANYIGILAWA VVLGIALRAA SQSTKDVFAS VSDAVSQVVR
WVISLAPFGI LGLVYTTVSA NGLEIFTEYG QLLLVLVGCM LFIAFVTNPL LVFWGIRKNP
YPLVLRCLKD SGITAFFTRS SAANIPVNME LCRKLGLDKD NYSVSIPLGA TINMAGAAVT
ISVMAMAAAH TMGVSIDLPT AVILSALAAV SACGASGVAG GSLLLIPLAC SLFGIGNDVA
MQVVAIGFII GVVQDSCETA LNSSSDVLFT ATSEYREWRK AGKEITFGED AFSENELA