Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1016 |
Symbol | |
ID | 8415306 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1232386 |
End bp | 1233801 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645023980 |
Product | amino acid carrier protein |
Protein accession | YP_003181377 |
Protein GI | 257790771 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1115] Na+/alanine symporter |
TIGRFAM ID | [TIGR00835] amino acid carrier protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.190269 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.164712 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACATCG TCCAGATGAT CAGCGATATC GACGCCTTCG TATGGGGCCC GCCGATGATC GTGCTGCTGT TGGGTTCGCA TCTGTACCTG ACGATCCGCA CCCGGTTCAT CCAGCGCAAG CTGCCGACGG CCATCAAGCT GTCGGTGACG AAGGACCCGG ATGCGCCGGG CGACATCAGC CAGTTCGGCG CGCTGACCAC GGCGCTGTCG GCCACCATCG GCACCGGCAA CATCGTGGGC GTGGGCACCG CCATCCTGGC CGGCGGCCCG GGCGCGGTGC TGTGGATGTG GCTCACCGGC GTGTTCGGCA TGGCCACGAA GTACTCCGAG ACGTTCGCCG CCGTGAAGTA CCGCGTGAAG GACCACAACG GCAACATGCT GGGCGGCGCG ATGTACGCAT GGCGACGCGC GTTCGAGAAG GACGGCAAGA CGCCGTGGTG GGGCTTGCTG GGGGCCGGGG CGTTCGCCCT GTTCGCCGCC GTCGCCTCGT TCGGCATCGG CTCGGCCGTG CAGTCCAGCG CCATGACCGG GATCATCACG TCCAACGCTC CCGGCGTGCC CACCTGGGGC ATCGGCCTGG CCATCGTCAT CATGGTGTCC ATCGTCATCT TCGGCGGCAT CAAGATCATC TCGAAGGTGT GCGAGAAGCT CGTGCCGTTC ATGGCCATCG CCTACGCGTG GGGCTGCATC GTGATCATCG GCATGAACTG GGAGTACGTG TGGCCCGCCA TCAGCCTCAT CTTCGAGTGC GCGTTCACGC CGAAGGCGGC GTTCGGCGGC GCGGTGGGCT CGGGGCTGAT GATGGCGCTG CAGTTCGGCT GCGCGCGCGG CCTGTTCTCG AACGAGTCGG GCCTGGGCTC GGCGCCCATC GTGGCCTCGG CGGCCTCCAC GCGCAACCCG GCGCGCCAGG CCCTCGTGTC CATGACCGGC ACCTTCTGGG ACACCGTCGT CATCTGCGCG CTCACGGGCA TCGTGCTCGT GTCCACGATG ATCGCGAACC CGGGCATCAT GGAGAGCGGC CAGGTTTCGG CCGGCGCCGA TCTGACGAGC GCGGCCTTCG CGTCGATCCC CTACATCGGC ACGCCCATCC TGGTCATCGG CATGATCCTG TTCGCCTACA CCACCATCCT CGGCTGGTCG TACTACGGCA ACCGCTGCGT CACCTACCTG TTCGGCAAGC GCGCCATCCG CCCCTATCAG GTGCTGTACG TGGTGGTGGC GTTCCTGGGG GCCATCGGCA TCGGCGATTT GGTGTGGACC ATCTCCGACA TCACGAACGC GCTCATGGCC ATCCCGAACA TCATCGTGGT GCTGCTGCTT TCGGGCCTCA TCGCGCGCGA GACGAAGCAT TACGTGTGGG ACAAGAACCT GGACGAGACG GACGACACGC CCATCCCCGT GCTTGAGTCG AAGTAG
|
Protein sequence | MDIVQMISDI DAFVWGPPMI VLLLGSHLYL TIRTRFIQRK LPTAIKLSVT KDPDAPGDIS QFGALTTALS ATIGTGNIVG VGTAILAGGP GAVLWMWLTG VFGMATKYSE TFAAVKYRVK DHNGNMLGGA MYAWRRAFEK DGKTPWWGLL GAGAFALFAA VASFGIGSAV QSSAMTGIIT SNAPGVPTWG IGLAIVIMVS IVIFGGIKII SKVCEKLVPF MAIAYAWGCI VIIGMNWEYV WPAISLIFEC AFTPKAAFGG AVGSGLMMAL QFGCARGLFS NESGLGSAPI VASAASTRNP ARQALVSMTG TFWDTVVICA LTGIVLVSTM IANPGIMESG QVSAGADLTS AAFASIPYIG TPILVIGMIL FAYTTILGWS YYGNRCVTYL FGKRAIRPYQ VLYVVVAFLG AIGIGDLVWT ISDITNALMA IPNIIVVLLL SGLIARETKH YVWDKNLDET DDTPIPVLES K
|
| |