Gene Elen_2201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2201 
Symbol 
ID8416523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2585112 
End bp2586173 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content68% 
IMG OID645025187 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_003182552 
Protein GI257791946 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.26861 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGACG AGCGCGTTGA AACCTGCTTA ACCTGCGAGT CCGCCGTCGG GTTCGGCAGT 
TCCCAGTCGC TCGGCAGTCC TGGCTCGCAC GAAACGCCCA CAGGGCGTTT CGGCTCGATG
GGTGCGGCGC CGCGGAGCGA AGCGCAGCAA GTCCCCGAAG GGGAAACTCG CTTGGTGGGA
ACTGCCGAGC CCAACGGCTC CAAGGATGCC GCGCCGCCCC TTCTGCATGC TCCTCGCGCG
GCTCGGGTGG GGAATAGGAG GTTGACGCTG GCGGCGTTTT TGGGTGCTTT GGCGGCGCTG
GCGGCTGTGG TGGTCGCGGG GCTCGTCGTC GCGGATGCGG CTACGGCAAC CGACTTCTCG
ATGAAGAACC TCGCTCCCAG CTTCGCCCAT CCTTTCGGCA CCGACTGGAT GGGGCGCGAC
ATGCTGCTGC GCACGCTGGC GGGGCTGTCC ACCAGCGTGC TGGTGGGTCT TCTGGCCGCG
GGCGTGTCGT CCATCATCGC GCTGGTCATG GGCGCGGTGG CGGCGCTGGG CGGGAAGAAG
GCCGATGCCG CGGTCACCTG GCTCATCGAC CTCATGCTGG GCATCCCCCA TATCGTGCTG
CTCATCCTCA TCTCGTTCGC GTTGGGGAAG GGCTTCTGGG GCGTCACCAT CGGCGTGGCC
GTGACGCACT GGCCCAGCCT CGCGCGCGTC GTGCGCGCCG AGATCCTGCA GTGTAAGCAG
TCGACGTTCG TGGCTGCGGC GCGGCGGCTG GGGCAGACTC CTCTGCGCAT CGCGGCGAAG
CACATGGTTC CTTACGTGCT GCCCCAGTTC ATCGTGGGGT TGATTCTGCT GTTCCCGCAC
GCCATCCTGC ACGAGGCCGC CGTGACGTTC CTCGGCTTCG GTCTGCCGCC CGAGCAACCC
GCCATCGGCG TGATCCTCAG CGAGTCGATG GCGTACCTGT CGGCCGGCAT GTGGTGGCTC
GCCGTGTTCC CGGGTCTCGC GCTCATCGCG ACCGTGCTGC TGTTCGACCT GGCGGGATCC
AGCCTGCGCA AGCTCGTCGA CCCGCACAGC GCGCAGGAAT AG
 
Protein sequence
MADERVETCL TCESAVGFGS SQSLGSPGSH ETPTGRFGSM GAAPRSEAQQ VPEGETRLVG 
TAEPNGSKDA APPLLHAPRA ARVGNRRLTL AAFLGALAAL AAVVVAGLVV ADAATATDFS
MKNLAPSFAH PFGTDWMGRD MLLRTLAGLS TSVLVGLLAA GVSSIIALVM GAVAALGGKK
ADAAVTWLID LMLGIPHIVL LILISFALGK GFWGVTIGVA VTHWPSLARV VRAEILQCKQ
STFVAAARRL GQTPLRIAAK HMVPYVLPQF IVGLILLFPH AILHEAAVTF LGFGLPPEQP
AIGVILSESM AYLSAGMWWL AVFPGLALIA TVLLFDLAGS SLRKLVDPHS AQE