Gene Elen_1016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1016 
Symbol 
ID8415306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1232386 
End bp1233801 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content66% 
IMG OID645023980 
Productamino acid carrier protein 
Protein accessionYP_003181377 
Protein GI257790771 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1115] Na+/alanine symporter 
TIGRFAM ID[TIGR00835] amino acid carrier protein 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.190269 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.164712 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATCG TCCAGATGAT CAGCGATATC GACGCCTTCG TATGGGGCCC GCCGATGATC 
GTGCTGCTGT TGGGTTCGCA TCTGTACCTG ACGATCCGCA CCCGGTTCAT CCAGCGCAAG
CTGCCGACGG CCATCAAGCT GTCGGTGACG AAGGACCCGG ATGCGCCGGG CGACATCAGC
CAGTTCGGCG CGCTGACCAC GGCGCTGTCG GCCACCATCG GCACCGGCAA CATCGTGGGC
GTGGGCACCG CCATCCTGGC CGGCGGCCCG GGCGCGGTGC TGTGGATGTG GCTCACCGGC
GTGTTCGGCA TGGCCACGAA GTACTCCGAG ACGTTCGCCG CCGTGAAGTA CCGCGTGAAG
GACCACAACG GCAACATGCT GGGCGGCGCG ATGTACGCAT GGCGACGCGC GTTCGAGAAG
GACGGCAAGA CGCCGTGGTG GGGCTTGCTG GGGGCCGGGG CGTTCGCCCT GTTCGCCGCC
GTCGCCTCGT TCGGCATCGG CTCGGCCGTG CAGTCCAGCG CCATGACCGG GATCATCACG
TCCAACGCTC CCGGCGTGCC CACCTGGGGC ATCGGCCTGG CCATCGTCAT CATGGTGTCC
ATCGTCATCT TCGGCGGCAT CAAGATCATC TCGAAGGTGT GCGAGAAGCT CGTGCCGTTC
ATGGCCATCG CCTACGCGTG GGGCTGCATC GTGATCATCG GCATGAACTG GGAGTACGTG
TGGCCCGCCA TCAGCCTCAT CTTCGAGTGC GCGTTCACGC CGAAGGCGGC GTTCGGCGGC
GCGGTGGGCT CGGGGCTGAT GATGGCGCTG CAGTTCGGCT GCGCGCGCGG CCTGTTCTCG
AACGAGTCGG GCCTGGGCTC GGCGCCCATC GTGGCCTCGG CGGCCTCCAC GCGCAACCCG
GCGCGCCAGG CCCTCGTGTC CATGACCGGC ACCTTCTGGG ACACCGTCGT CATCTGCGCG
CTCACGGGCA TCGTGCTCGT GTCCACGATG ATCGCGAACC CGGGCATCAT GGAGAGCGGC
CAGGTTTCGG CCGGCGCCGA TCTGACGAGC GCGGCCTTCG CGTCGATCCC CTACATCGGC
ACGCCCATCC TGGTCATCGG CATGATCCTG TTCGCCTACA CCACCATCCT CGGCTGGTCG
TACTACGGCA ACCGCTGCGT CACCTACCTG TTCGGCAAGC GCGCCATCCG CCCCTATCAG
GTGCTGTACG TGGTGGTGGC GTTCCTGGGG GCCATCGGCA TCGGCGATTT GGTGTGGACC
ATCTCCGACA TCACGAACGC GCTCATGGCC ATCCCGAACA TCATCGTGGT GCTGCTGCTT
TCGGGCCTCA TCGCGCGCGA GACGAAGCAT TACGTGTGGG ACAAGAACCT GGACGAGACG
GACGACACGC CCATCCCCGT GCTTGAGTCG AAGTAG
 
Protein sequence
MDIVQMISDI DAFVWGPPMI VLLLGSHLYL TIRTRFIQRK LPTAIKLSVT KDPDAPGDIS 
QFGALTTALS ATIGTGNIVG VGTAILAGGP GAVLWMWLTG VFGMATKYSE TFAAVKYRVK
DHNGNMLGGA MYAWRRAFEK DGKTPWWGLL GAGAFALFAA VASFGIGSAV QSSAMTGIIT
SNAPGVPTWG IGLAIVIMVS IVIFGGIKII SKVCEKLVPF MAIAYAWGCI VIIGMNWEYV
WPAISLIFEC AFTPKAAFGG AVGSGLMMAL QFGCARGLFS NESGLGSAPI VASAASTRNP
ARQALVSMTG TFWDTVVICA LTGIVLVSTM IANPGIMESG QVSAGADLTS AAFASIPYIG
TPILVIGMIL FAYTTILGWS YYGNRCVTYL FGKRAIRPYQ VLYVVVAFLG AIGIGDLVWT
ISDITNALMA IPNIIVVLLL SGLIARETKH YVWDKNLDET DDTPIPVLES K