Gene Elen_2137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2137 
Symbol 
ID8416459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2512117 
End bp2513703 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content64% 
IMG OID645025124 
ProductSSS sodium solute transporter superfamily 
Protein accessionYP_003182489 
Protein GI257791883 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR02121] sodium/proline symporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00565072 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.610314 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTTCCA ACGATTTTTG GGTCGTATTC GCGATGCTCC TGTATTTCGT CGCGGTGCTC 
ACCATCGGGT TCGTGTACGC GAAGCGCTCG AACTCGTCGA CGGCCGAGTA CTTCCTCGGC
GGCCGCGGGG TGGGGCCGTG GCTCACGGCG CTGTCGGCGG AGGCGTCCGA CATGTCGGGC
TGGTTGCTCA TGGGCCTGCC GGGCGTGGCC TACTTCACCG GTGCGTCCGA CGCCATGTGG
ACCGCCATCG GCCTTGCCAT CGGCACGTAT CTCAACTGGA AGTTCGTGGC GAAGCGTTTG
CGGAAGTATT CCGTGGTGGC GGGCGATTCC ATCACGCTGC CGGAATTCTA CAGCAAGCGC
TTCCATGACC GTAAGAACAT CGTGTCCACG GTGGCCGCGC TCATCATCAT GGTGTTCTTC
TGCGTGTACG TGGGCAGCTG TTTCGTCACG TGCGGAAAGC TGTTCGCCAC GCTGTTCGGC
CTCGACTACG CCACGATGAT GGTGCTCGGC GCCATCATCG TGTTTTTGTA CACGCTGGTC
GGCGGCTACC TTTCGGTGGT CACCACCGAC TTCGTGCAGG GCGTGCTCAT GTTCTTCGCG
CTGGCCACCG TGTTCGTGGG CTCGGTCGCC TGGGCGGGCG GCGTCGACAA CACCGTCGCG
TTCCTGCAGA ACATCCCCGG CTTCCTCAGC GGCACCCAGA TAGCGGTGCC CCTCGTGGAC
GACGCGGGCC GGCAGCTCGT GGAGAACGGG ACGCCCCTGT TCGGCGACGC GGCCGAGTAC
CCGCTCATCA CCATCGCCTC GATGCTGGCG TGGGGCCTCG GCTACTTCGG TATGCCGCAG
GTGCTCGTGC GATTCATGGG CATCCGCTCG GCTGACGAGG TGAAGAAGTC GCGCGTGATC
GCGGTCGTAT GGTGCGTCGT GTCGCTGGCC TGCGGCATCT GCATCGGCTT GGTGGGCCGC
GTCATCATTC CCGTCGACTT CGCCACGCAG GCTCAGGCCG AAAACGTGTT CATCGTGCTG
TCCCAGATGA TCTTGCCGCC GTTCATGTGC GGCGTAGTGG TGTCGGGCAT CTTCGCGGCG
TCGATGAGCT CCTCGTCCTC GTACCTGCTG ATCGCCGGCT CCTCGGTGGC TGAGAACATC
TTCCGCGGGG TCGTCAAGAA AGACGCCACC GACCGCCAGG TCATGATCGT GGCGCGCCTC
ACGCTCATCG CGGTGTTCAT GTTCGGCATC ATCGTGGCGT ACGACGAGAA CTCTTCTATC
TTCGGGGTGG TGTCCTACGC TTGGGCGGGC CTCGGCGCCT CGTTCGGCCC GCTCACCCTG
TGCGCCTTGT ATTGGCGCCG CGCCAACATG CAGGGTGCGC TGGCCGGCAT GATCACTGGT
ACGGTCACGG TGCTCGTCTG GCACAACTTC ATCAAGCCTC TCGGCGGCGT GTTCGGCATC
TACGAGCTGC TCCCCGCGTT CGTCCTGTCG TTCGCCGCCA TCATCATCGT GTCGCTGCTC
ACGCCCGCCC CGTCGGAGGC GGTCGCGAAC GAGTTCGACC ATTACATGGA CGAAGGGGGA
GCCGCCGTCG AAAAGGTCGT GCAGTAG
 
Protein sequence
MVSNDFWVVF AMLLYFVAVL TIGFVYAKRS NSSTAEYFLG GRGVGPWLTA LSAEASDMSG 
WLLMGLPGVA YFTGASDAMW TAIGLAIGTY LNWKFVAKRL RKYSVVAGDS ITLPEFYSKR
FHDRKNIVST VAALIIMVFF CVYVGSCFVT CGKLFATLFG LDYATMMVLG AIIVFLYTLV
GGYLSVVTTD FVQGVLMFFA LATVFVGSVA WAGGVDNTVA FLQNIPGFLS GTQIAVPLVD
DAGRQLVENG TPLFGDAAEY PLITIASMLA WGLGYFGMPQ VLVRFMGIRS ADEVKKSRVI
AVVWCVVSLA CGICIGLVGR VIIPVDFATQ AQAENVFIVL SQMILPPFMC GVVVSGIFAA
SMSSSSSYLL IAGSSVAENI FRGVVKKDAT DRQVMIVARL TLIAVFMFGI IVAYDENSSI
FGVVSYAWAG LGASFGPLTL CALYWRRANM QGALAGMITG TVTVLVWHNF IKPLGGVFGI
YELLPAFVLS FAAIIIVSLL TPAPSEAVAN EFDHYMDEGG AAVEKVVQ