Gene Elen_2153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2153 
Symbol 
ID8416475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2535917 
End bp2537329 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content65% 
IMG OID645025140 
Productsodium:dicarboxylate symporter 
Protein accessionYP_003182505 
Protein GI257791899 
COG category[R] General function prediction only 
COG ID[COG1823] Predicted Na+/dicarboxylate symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATGG AGGTGCTGGT GCCCGTCGCA GGTATCTTGG CGGTGTTCGC TGCCATGCTC 
GTCGTGCTCA AGATCATGCG ATCGAAGGGC CTAAGCTTCA CGGCGAGGGT GTTCACCGCG
CTCGGGCTGG GCGTCCTGCT AGGTTTGGGA ATCCAGCTGT TGCTCGGCCG CGACGGGGAT
GCTGCCACGA CGGCGCTCGA CTGGATATCC ATCGTGGGGC AGGGTTATAT CGGCTTGTTG
AAGATGCTCG TCATGCCGCT CGTGTTCGTG GCCATCGTCG GGGCGTTCAC GCGCGCGGAG
GTCACCGAGC ACTTCGGGCG CATCGCCTTC GCGGTGTTGG CCGTGCTGCT GGGCACGGTG
ACCGTTGCGG CCGTGCTAGG CTGGGGTGCG ACGGTTCTGA CCGGGCTTGC GCATGCGGGC
TTCCTCGATG CCGCGACCAC CGATGCCGCC GAGCTTTCGT CGCTCGCCTC CCAGCAGAGC
GAGGCTGCCA GCCTTACGCT GCCGCAGGAA ATCCTCTCGT TCATCCCGAC GAACCCCTTC
GCCGACCTCG CTGGCGCCCG ATCCACCTCC ACCATCGCGG TCGTCATCTT CTCGGCCATC
CTCGGCGTCG CATATATCGG GTTGCGCGAT AAAGACGTTG ACCAGGCAGA CTTCTTCAAG
AGCCTCATCG ACAGCCTGTA CGGCATCGTC ATGCGGATAG TGGCCATGGT GCTAGGGCTC
ACCCCGTACG GTATCCTCGC GCTCATCGCG AAGGTGATGG CCGCGAGCGA TTACCGCGCC
ATCCTCGGAC TGGGAAAGTT CGTGCTCGTG TCTTACGGCG CGTTGCTGGC GGTGCTGTGC
GTGCATTGTC TCATCTTGCT GGCGAACCGT GTGAATCCGG CGACCTACTT CAAGAAGGCG
TTTCCCGTGC TCAGCTTCGC GTTCGTATCG CGTTCGAGCG CGGGCGCGCT GCCGCTCAAC
ATCGAGACGC AGCATAAAGC CCTGGGCGTG GACAGCGCGT CGGCGAACCT CGCGGCAAGC
TTCGGCATGT CTATCGGGCA GAACGGGTGC GCGGGCGTGT ACCCGGCGAT GCTGGCCACC
ATCGTGGCGC CTACCGTAGG CATCGACGTG CTGTCTCCTA CGTTCTTCGT GCCGCTCATC
GCCGTGGTGG CCATCAGCTC GTTCGGTGTG GCAGGAGTGG GCGGGGGAGC GACGTTCGCC
TCGCTCATCG TGTTGGGGAC GATGGGGCTT CCTATCGAGG TGGTGGCCGT CCTCGCATCC
GTGGAGCCGC TCATCGACAT GGGTCGCACC GCCCTCAACG TCAGCGACTC CATGGTGGCG
GGCATTACAG CCTCGCATGT TGCGGGCGGC CTCGACCGCT CGGTTCTGGA CGATCCGGAG
GCGCGTGTCA CCGCCGAGGC GCACGTGAGC TGA
 
Protein sequence
MSMEVLVPVA GILAVFAAML VVLKIMRSKG LSFTARVFTA LGLGVLLGLG IQLLLGRDGD 
AATTALDWIS IVGQGYIGLL KMLVMPLVFV AIVGAFTRAE VTEHFGRIAF AVLAVLLGTV
TVAAVLGWGA TVLTGLAHAG FLDAATTDAA ELSSLASQQS EAASLTLPQE ILSFIPTNPF
ADLAGARSTS TIAVVIFSAI LGVAYIGLRD KDVDQADFFK SLIDSLYGIV MRIVAMVLGL
TPYGILALIA KVMAASDYRA ILGLGKFVLV SYGALLAVLC VHCLILLANR VNPATYFKKA
FPVLSFAFVS RSSAGALPLN IETQHKALGV DSASANLAAS FGMSIGQNGC AGVYPAMLAT
IVAPTVGIDV LSPTFFVPLI AVVAISSFGV AGVGGGATFA SLIVLGTMGL PIEVVAVLAS
VEPLIDMGRT ALNVSDSMVA GITASHVAGG LDRSVLDDPE ARVTAEAHVS