Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2137 |
Symbol | |
ID | 8416459 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2512117 |
End bp | 2513703 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 645025124 |
Product | SSS sodium solute transporter superfamily |
Protein accession | YP_003182489 |
Protein GI | 257791883 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | [TIGR00813] transporter, SSS family [TIGR02121] sodium/proline symporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00565072 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.610314 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTTTCCA ACGATTTTTG GGTCGTATTC GCGATGCTCC TGTATTTCGT CGCGGTGCTC ACCATCGGGT TCGTGTACGC GAAGCGCTCG AACTCGTCGA CGGCCGAGTA CTTCCTCGGC GGCCGCGGGG TGGGGCCGTG GCTCACGGCG CTGTCGGCGG AGGCGTCCGA CATGTCGGGC TGGTTGCTCA TGGGCCTGCC GGGCGTGGCC TACTTCACCG GTGCGTCCGA CGCCATGTGG ACCGCCATCG GCCTTGCCAT CGGCACGTAT CTCAACTGGA AGTTCGTGGC GAAGCGTTTG CGGAAGTATT CCGTGGTGGC GGGCGATTCC ATCACGCTGC CGGAATTCTA CAGCAAGCGC TTCCATGACC GTAAGAACAT CGTGTCCACG GTGGCCGCGC TCATCATCAT GGTGTTCTTC TGCGTGTACG TGGGCAGCTG TTTCGTCACG TGCGGAAAGC TGTTCGCCAC GCTGTTCGGC CTCGACTACG CCACGATGAT GGTGCTCGGC GCCATCATCG TGTTTTTGTA CACGCTGGTC GGCGGCTACC TTTCGGTGGT CACCACCGAC TTCGTGCAGG GCGTGCTCAT GTTCTTCGCG CTGGCCACCG TGTTCGTGGG CTCGGTCGCC TGGGCGGGCG GCGTCGACAA CACCGTCGCG TTCCTGCAGA ACATCCCCGG CTTCCTCAGC GGCACCCAGA TAGCGGTGCC CCTCGTGGAC GACGCGGGCC GGCAGCTCGT GGAGAACGGG ACGCCCCTGT TCGGCGACGC GGCCGAGTAC CCGCTCATCA CCATCGCCTC GATGCTGGCG TGGGGCCTCG GCTACTTCGG TATGCCGCAG GTGCTCGTGC GATTCATGGG CATCCGCTCG GCTGACGAGG TGAAGAAGTC GCGCGTGATC GCGGTCGTAT GGTGCGTCGT GTCGCTGGCC TGCGGCATCT GCATCGGCTT GGTGGGCCGC GTCATCATTC CCGTCGACTT CGCCACGCAG GCTCAGGCCG AAAACGTGTT CATCGTGCTG TCCCAGATGA TCTTGCCGCC GTTCATGTGC GGCGTAGTGG TGTCGGGCAT CTTCGCGGCG TCGATGAGCT CCTCGTCCTC GTACCTGCTG ATCGCCGGCT CCTCGGTGGC TGAGAACATC TTCCGCGGGG TCGTCAAGAA AGACGCCACC GACCGCCAGG TCATGATCGT GGCGCGCCTC ACGCTCATCG CGGTGTTCAT GTTCGGCATC ATCGTGGCGT ACGACGAGAA CTCTTCTATC TTCGGGGTGG TGTCCTACGC TTGGGCGGGC CTCGGCGCCT CGTTCGGCCC GCTCACCCTG TGCGCCTTGT ATTGGCGCCG CGCCAACATG CAGGGTGCGC TGGCCGGCAT GATCACTGGT ACGGTCACGG TGCTCGTCTG GCACAACTTC ATCAAGCCTC TCGGCGGCGT GTTCGGCATC TACGAGCTGC TCCCCGCGTT CGTCCTGTCG TTCGCCGCCA TCATCATCGT GTCGCTGCTC ACGCCCGCCC CGTCGGAGGC GGTCGCGAAC GAGTTCGACC ATTACATGGA CGAAGGGGGA GCCGCCGTCG AAAAGGTCGT GCAGTAG
|
Protein sequence | MVSNDFWVVF AMLLYFVAVL TIGFVYAKRS NSSTAEYFLG GRGVGPWLTA LSAEASDMSG WLLMGLPGVA YFTGASDAMW TAIGLAIGTY LNWKFVAKRL RKYSVVAGDS ITLPEFYSKR FHDRKNIVST VAALIIMVFF CVYVGSCFVT CGKLFATLFG LDYATMMVLG AIIVFLYTLV GGYLSVVTTD FVQGVLMFFA LATVFVGSVA WAGGVDNTVA FLQNIPGFLS GTQIAVPLVD DAGRQLVENG TPLFGDAAEY PLITIASMLA WGLGYFGMPQ VLVRFMGIRS ADEVKKSRVI AVVWCVVSLA CGICIGLVGR VIIPVDFATQ AQAENVFIVL SQMILPPFMC GVVVSGIFAA SMSSSSSYLL IAGSSVAENI FRGVVKKDAT DRQVMIVARL TLIAVFMFGI IVAYDENSSI FGVVSYAWAG LGASFGPLTL CALYWRRANM QGALAGMITG TVTVLVWHNF IKPLGGVFGI YELLPAFVLS FAAIIIVSLL TPAPSEAVAN EFDHYMDEGG AAVEKVVQ
|
| |