Gene Elen_1829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1829 
Symbol 
ID8416133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2149863 
End bp2151305 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content66% 
IMG OID645024799 
ProductGeneral substrate transporter 
Protein accessionYP_003182182 
Protein GI257791576 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.392144 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.18053 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACG TAAAGCATAT GACCCAGGTA ACGAGCCCTC CCATGGGCGA GGTCGCCGAC 
GGCGCCAAGC AGATCGGCGG GGACGTGGTC CACAAGGTGA AGAAGTCGGC CAAGAAGACC
ATGGACGACG TGGCCATGAC GCCCTTCTTG CGCAAGATCA CGTTCTTCTC GAGCGGCGGC
TCGTTTTTGG ACGGGTACGT GCTCTCGCTC ATCGGCGTGG CGCTCACGCA GATCACACCG
CTGTTCAACC TCGACGAGGC GTGGAGCGCG GCCATCGGGG CGTCGGTTTT GCTGGGCATC
TTCGTGGGCA CGATCGCGGG CGGCTACCTC ACCGACCGCA TCGGGCGCAA GAAGATGTTC
ATCGTCGACA TCGTGGCCAT CGGAACCTTC TCCATCCTGA GCGTGTTCTG CGCCGACCCG
CTCCAGCTCG TGGCGGCGCG CTTCTTCATC GGCGTGTTCG TAGGCGCCGA CTACCCGATA
GCCACCTCGC TCATCGCCGA GTTCACGCCC AAGCAGCACC GCTCCATCTC CATGGGCATG
GTGTCGGCCG CCTGGTACCT CGGCGCCACG GTGGCGGCGT TCGTGGGGTA CTTCCTGTAC
AGCGTGCCCA ACGGATGGCA GTGGATGCTC GGCTCGGCCG TCATCCCCTG CATCATCCTG
CTGGTCGGGC GGCACGACAT CCCCGAGTCG CCCATGTGGC TGGCTCAGAA AGGCCGCACC
GAGGAGGCCG ACGCCGTCAT GCGGCGCGTG TTCGGCGAAG GCGTGGAGCT CGAGCTGGAG
GATCCGGGCG AGAAGACCAG CCTGCGCAAG GTGTTCGCGG GCGGCTACGC CAAGCGCATC
GTGTTCTTGG GCATCCTGAC GCTGTGCCAG GTGGTGCCGA TGTACGCCAT CTACACGTTC
GGCCCCGAGA TCATGACGGC CTTCGGGCTG GGGGAGGGCC ACGAGGCCAT CCTGGGCGAG
AGCGTGGTCA GCCTGTTCTT CCTGATCGGT TCCATCCCGG CCATGTTCTG GCTGAACTCG
ATGGGCCGCC GTCCGCTGCT CATCCGCTCG CTCGCCCTCA TGGCGGTGGG CCTGGTCATC
CTGGGCGTGT TCCCGGACGC CCCCATCTAC GTGATCATCC TGGGGTTCGG CCTGTACGCG
TTCTTCAGCG GCGGCCCGGG CATCCTGCAA TGGCTGTACC CCAACGAGCT GTTTCCCACC
GAGGTGCGCG CCTCGGCGGT GGGCATCGCC ATCGCGTTCT CGCGCATCGG CACTATCATC
GCCACGTACG GCACGCCGCT GTTCCTGGCC GCCTACGGCA TCGGCCCCAC CATGCTGATC
GCGGCGGGCC TCGTGATCCT GGGCCTCGTG CTGTCGGCGT TCATGGCGCC CGAGACGAAG
GGCAAGTCGC TTCTGGAGAC GAGCTCGCTC GACGAGGGGG ACGCGCACCC GCGCGGGGCG
TAG
 
Protein sequence
MSNVKHMTQV TSPPMGEVAD GAKQIGGDVV HKVKKSAKKT MDDVAMTPFL RKITFFSSGG 
SFLDGYVLSL IGVALTQITP LFNLDEAWSA AIGASVLLGI FVGTIAGGYL TDRIGRKKMF
IVDIVAIGTF SILSVFCADP LQLVAARFFI GVFVGADYPI ATSLIAEFTP KQHRSISMGM
VSAAWYLGAT VAAFVGYFLY SVPNGWQWML GSAVIPCIIL LVGRHDIPES PMWLAQKGRT
EEADAVMRRV FGEGVELELE DPGEKTSLRK VFAGGYAKRI VFLGILTLCQ VVPMYAIYTF
GPEIMTAFGL GEGHEAILGE SVVSLFFLIG SIPAMFWLNS MGRRPLLIRS LALMAVGLVI
LGVFPDAPIY VIILGFGLYA FFSGGPGILQ WLYPNELFPT EVRASAVGIA IAFSRIGTII
ATYGTPLFLA AYGIGPTMLI AAGLVILGLV LSAFMAPETK GKSLLETSSL DEGDAHPRGA