Gene Elen_0351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0351 
Symbol 
ID8414635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp452037 
End bp453338 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content59% 
IMG OID645023328 
ProductGeneral substrate transporter 
Protein accessionYP_003180731 
Protein GI257790125 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0092051 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.434998 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGCAG CTACAAAGAA AAAAACTTCG TTAGCCAGGG CGGTGTTCTC GGGGAGCCTT 
GGGAATATGC TGGAATGGTT CGACTACGGG TTGTACGGGT ACTTCGCAGC GATCATTTCC
GCAGATTTCT TCGTTGCCGA CGACCCGATC GTCGGGCTCT TGCTCAGCTT CCTGGTCTTC
GGAACGGGCT TTCTCGTACG CCCCATCGGC GGTATTCTGA TCGGCGCGTA CGCCGACAAG
CACGGAAGAA TCAAGGCTTT GACGTTGACG ATCCTGTGCA TGGGCATCTG CACCATGCTC
ATGGGATGCT TGCCAACGTA TTCCCAGGTG GGACTTTTGG CACCCATCCT GCTGACGGTG
CTGCGCCTGC TGCAGGGCCT GGCAACGGGC GGCGAGTTCG GCTCTTCGCT GACGTTCATC
TCGGAGTACG GAACGCCGAA CAACAGGGCC TTCCTGTGCA GCTGGCAGCC GTTCAGCGTC
GGTTGCGGCC TGCTCCTGGG GTCCACCGCG GGGCTGCTCG TCACCACGCT TCTGCCTGAA
GCGGCGCTGT ACGAGTGGGG ATGGCGCGTG CCGTTCCTCT GCGGAATCCT GATCGCCTTC
TACGGCGTGC ACATGCGCAA GAACGTCCCC GATTCCCCTG AGTTCCTGAA GGCGAAGGCG
GAAGTGAAGG AAGAGGACCA TACGCCGGTC AAAGACCTGT TCCTGCGCTA CAAGAAGTCC
ATCATCACGG TGATCGGGCT GCTCGTCGGC TCCAGCGCGA CCTACTACAT CCTGATCACC
TATATGCCGA CGTACATCTC GCAGTTCATG GGGACGTCGT TCTCGAGCGC GTTCGTCGTC
AATACGTCGG TCATCGCGAT CAACTTGCTG CTCTGTCCCA TCGTGGGCCT GCTCATCGAC
AAGGTGGGAA GGCGGAAATG CCTGATCATC GGCTGTCTCG GGTTCCTTAT CCTGTCTTAT
CCGGTGTTCT ACGTGCTGAT CCAGCAGACC AACGCTCTCT TGATGATCGG CTTGCTGGGA
GTGCTGATCG TGTTCCAGAC GATTCTCGCC GTCGCCATCG TGGTGGTTTC GGCCGAGGTG
TTCCCCACCA AGCTCCGCAA CAGCGGCATC GGCTTCTCCT ACAATATCGC GGCGGCGGTC
TTCGGCGGCT TGGCGCCCCT TGCGGCAACG GCGCTCATCG CGGCCACGGG CGATCGCCTG
TCCATCACCT ACTTGATGAT AGGTTCCGTC CTCATAACGC TCTTGACCAC GATCTTCCTT
TTGAAGGGGT ATTACGTCAA GGGAAAGAAC GCGTCGAGCT AG
 
Protein sequence
MVAATKKKTS LARAVFSGSL GNMLEWFDYG LYGYFAAIIS ADFFVADDPI VGLLLSFLVF 
GTGFLVRPIG GILIGAYADK HGRIKALTLT ILCMGICTML MGCLPTYSQV GLLAPILLTV
LRLLQGLATG GEFGSSLTFI SEYGTPNNRA FLCSWQPFSV GCGLLLGSTA GLLVTTLLPE
AALYEWGWRV PFLCGILIAF YGVHMRKNVP DSPEFLKAKA EVKEEDHTPV KDLFLRYKKS
IITVIGLLVG SSATYYILIT YMPTYISQFM GTSFSSAFVV NTSVIAINLL LCPIVGLLID
KVGRRKCLII GCLGFLILSY PVFYVLIQQT NALLMIGLLG VLIVFQTILA VAIVVVSAEV
FPTKLRNSGI GFSYNIAAAV FGGLAPLAAT ALIAATGDRL SITYLMIGSV LITLLTTIFL
LKGYYVKGKN ASS