Gene Elen_0068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0068 
Symbol 
ID8414348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp89977 
End bp91515 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content64% 
IMG OID645023044 
ProductNa+/solute symporter 
Protein accessionYP_003180451 
Protein GI257789845 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGAAA AGATATGCAT GGCGCTTATT TTCGTCGGCG TGGCGGTGGG CGTCGGCATC 
TACTGCCGAC GACACACCGG AAGCGTCGAC GGGTTCATCC TGGGCGGCCG CAACGTGGGG
CCGTGGTTGA GCGCGTTCGC GTTCGGCACC AGCTACTTCT CCGCCGTCAT CTTCGTGGGC
TACGCCGGCC AGTTCGGCTG GAAGTACGGC GTGGCCGCCA CGTGGATCGG CATCGGCAAC
GCCATCCTGG GAAGCTTGCT GGCCTGGTGG GTGCTGGGCC CGCGCACGCG CGAGATGACG
CACCGCCTGG GCGCGTCCAC GATGCCCGAG TTCTTCGGCG CGCGGTTCCA GTCGAAGGGT
TTGCGCATCG CGGCCGCGGC CATCATCTTC GTGTTCCTCA TCCCGTACAC GGCCAGCGTG
TACAACGGCT TGTCGCGCCT GTTCGGCATG GCGTTCGGCC TGCCCTACGA GGTGTGCGTC
ATCGGTATGG CGGTTATCAC CTGCGTGTAC GTGGTGCTGG GCGGCTACAT GGCCACGGTG
ATGAACGACT TCATCCAGGG CATCGTCATG CTGCTGGGCA TCGTTGCCGT CATCGTGGCG
GTGCTGGTGA ACAACGGGGG CTTCACCGAG GCCCTCACCA CGCTGTCGCT CATTCCCGCC
GAGGGCAGCG AGATGGCCGG GCCGTTCGTC AGCTTCTTCG GGCCGAACCT GCCCGATCTC
ATCGGGGTCA TCGTGCTGAC CAGCTTGGGC ACGTGGGGTC TGCCGCAGAT GGTGCAGAAG
TTCTACGCCA TCAAAAGCGG TCCAGCCATC AAGCAGGGCG CCATCATCTC CACCGTGTTC
GCCATGGTGG TGGCCGGCGG CAGCTACTTC CTGGGAGGGT TCGGCCGCCT GTACGGCGAT
CAGGTGGAGA TGACCGCCGC AGGCACGCCG GTGTACGACT CTATCATGCC CACCATGCTG
TCCACGCTGC CCGACCTGCT CATCGGCATC GTGATCGTGC TGGTGCTGAG CGCGTCCATG
TCCACACTGT CATCGCTGGT GCTGACGTCG TCGTCCACGC TGACGCTCGA TCTGCTGAAA
GACAACGTGG TGAAGAACAT GAGCGAGAAG AAGCAGTTGG GCTACATGCG CGTGCTGATC
GTGGTGTTCA TCGTGATCTC GGCGGTGATC GCGCTGGTGC AGTACAACTC GTCCATCACG
TTCATCGCAC AGCTGATGAG CATCTCGTGG GGCGCGCTGG CCGGTTCGTT CTTGGGGCCC
TTCTTCTGGG GGCTCTACTC GCGTCGCGTG TCGCGTCCGG CTGTGTGGGC CAGCTTCATC
GTGGGCGTGG GTCTGACCAC GGGCAACATG GTCGCCGGGT TCGTGGGCGC GGCGTTCATC
GCGTCGCCCA TCAACTGCGG CGCCATCGCG ATGGTGCTGT CGCTGATCAT CGTGCCGGTG
GTCAGCTTGT TCACGAAGCG CGTTGAGTTC GAGGTCGACC CGCCGCACGT CGAGGGAGCC
ATCGACCGCG AATACGAGCA GGAGCTGGAA GCGGAGTAG
 
Protein sequence
MIEKICMALI FVGVAVGVGI YCRRHTGSVD GFILGGRNVG PWLSAFAFGT SYFSAVIFVG 
YAGQFGWKYG VAATWIGIGN AILGSLLAWW VLGPRTREMT HRLGASTMPE FFGARFQSKG
LRIAAAAIIF VFLIPYTASV YNGLSRLFGM AFGLPYEVCV IGMAVITCVY VVLGGYMATV
MNDFIQGIVM LLGIVAVIVA VLVNNGGFTE ALTTLSLIPA EGSEMAGPFV SFFGPNLPDL
IGVIVLTSLG TWGLPQMVQK FYAIKSGPAI KQGAIISTVF AMVVAGGSYF LGGFGRLYGD
QVEMTAAGTP VYDSIMPTML STLPDLLIGI VIVLVLSASM STLSSLVLTS SSTLTLDLLK
DNVVKNMSEK KQLGYMRVLI VVFIVISAVI ALVQYNSSIT FIAQLMSISW GALAGSFLGP
FFWGLYSRRV SRPAVWASFI VGVGLTTGNM VAGFVGAAFI ASPINCGAIA MVLSLIIVPV
VSLFTKRVEF EVDPPHVEGA IDREYEQELE AE