Gene Elen_0404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0404 
Symbol 
ID8414688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp517148 
End bp518539 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content65% 
IMG OID645023379 
Productsodium:neurotransmitter symporter 
Protein accessionYP_003180782 
Protein GI257790176 
COG category[R] General function prediction only 
COG ID[COG0733] Na+-dependent transporters of the SNF family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.630301 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.211646 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTACGTG AGAAATTCGG ATCCCGTTTA GGGTTCATCC TGATCAGCGC GGGATGCGCC 
ATCGGGCTGG GGAACGTGTG GCGCTTCCCT TACATCGTGG GGCAGTACGG GGGCGCGGCA
TTCGTGCTTT TGTATTTGCT GTTCTTGGTG GTGTTCGCGC TGCCCATTCT CGTGATGGAG
TTCGCGGTGG GCCGGGCGAG CCAGAAGGGC GTCGCGCGCA GCTTCGACGA GCTGGAGCCG
GCCGGGTCGA AATGGCATCG GTTCAAGTGG GCGGCCCTTG CGGGCAATTA CCTGCTGATG
ATGTTCTACA CCACGGTTGC CGGATGGATG CTGGCGTTCA TGGCGTTCAG CGGCGCGGGC
ACGTTCGAGG GCCTGGACGC CGGCGCCGTC GAAGGGGTGT TCAACGGGCT TCTGGCCGAC
CCGCTTATGA TGGTCGCGTT CATGCTGGTC GTAGTGCTGA TAGGCGTGTT AGTGACGCGG
GCCGGCTTGC GCAACGGCGT GGAGCGCATT ACGAAGACGA TGATGGCCGC CCTGTTCGCC
GTCCTTGCCG TGCTGGTGGT GCGCGCGGTC ACGCTTCCGG GCGCCGAAGA GGGCCTGTCG
TTCTATCTCA TGCCCGATTT CGCGAAGCTG TTCGAAGGCG GGTGGGGGAC GTTCGTCGAT
GCCGTGTTCG CGGCTATGGG CCAGGCGTTC TTCACGGTGT CGGTGGGCGT GGGGTCCATG
TCCATCTTCG GCAGCTACAT CGATAAACGC TACCGCCTTA CGGGCGAGGC GCTGCGCGTC
GCGGGGCTGG ACACGCTCGT GGCCATCATG GCGGGCCTCA TCATCTTCCC GGCGTGCTTC
GCGTTCGGGG TGGAGCCGGG CAGCGGCCCC GGCCTGGTGT TCATCACGCT TCCCAGCGTG
TTCAGCCAGA TGCCGGTGGG GCAGCTGTGG GGCACGCTGT TCTTCCTGTT CATGAGCTTC
GCCGCGCTGT CCACGGTGGT GGCGGTGTTC GAGAACATCA TGAGCTTCAG CATGGACGAG
TGGGGCTGGT CGCGCAACCG CGCTTGCCTG GTGAACGGCA TCGCGCTGGC GCTGTTGTCG
CTGCCGTGCG TACTGGGCTT CAACGTGTGG GCGGGCGTGG AGGTGCCGGG TATCGGCAAT
ATCCAGGCCA TCGAGGACTT CCTCATGTCG AACAACGTGC TGCCGCTGGG CGCTCTGGTG
TTCCTGCTGT TCTGCACGTC CAAGCGGGGC TGGGGTTGGG ATGCGTTTCT GCGCGAGGCC
GACACGGGCG AGGGCACGCG CTTTCCTCGC TGGGCTCGCG GCTACGTGCG CTTCGCGCTG
CCCGTGCTCA TCCTGGCGGT GTTCGTGGCC GGCTACGTAC CCATCGTGCA AACCTGGCTG
GGGCTGGGGT AG
 
Protein sequence
MVREKFGSRL GFILISAGCA IGLGNVWRFP YIVGQYGGAA FVLLYLLFLV VFALPILVME 
FAVGRASQKG VARSFDELEP AGSKWHRFKW AALAGNYLLM MFYTTVAGWM LAFMAFSGAG
TFEGLDAGAV EGVFNGLLAD PLMMVAFMLV VVLIGVLVTR AGLRNGVERI TKTMMAALFA
VLAVLVVRAV TLPGAEEGLS FYLMPDFAKL FEGGWGTFVD AVFAAMGQAF FTVSVGVGSM
SIFGSYIDKR YRLTGEALRV AGLDTLVAIM AGLIIFPACF AFGVEPGSGP GLVFITLPSV
FSQMPVGQLW GTLFFLFMSF AALSTVVAVF ENIMSFSMDE WGWSRNRACL VNGIALALLS
LPCVLGFNVW AGVEVPGIGN IQAIEDFLMS NNVLPLGALV FLLFCTSKRG WGWDAFLREA
DTGEGTRFPR WARGYVRFAL PVLILAVFVA GYVPIVQTWL GLG