Gene Elen_1423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1423 
Symbol 
ID8415721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1695907 
End bp1697469 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content66% 
IMG OID645024392 
Productprotein of unknown function DUF6 transmembrane 
Protein accessionYP_003181781 
Protein GI257791175 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000166292 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGAAG AGGGTTCGAT TCAAGTGGCG GAAACGGTGG CGGGGAACCG TCATGTTGTG 
CGCGGCATCG TATGCGCGCT TGCGGGCGGC ATCTGCTGGG GCTTCTCGGG CACGTGCGCC
CAGCTGCTCA TGAACGACTA CGGCGCTCCC GCCACGTGGA TCACCTGCGT GCGCATGGTC
ATCGCCGCCG TGTTCTTCCT GTTCCTTACC GCGGTGCGCA ACTGGCGCGA TCTCGTAGCG
GTGTTCCGAG ACTGGCGCTC GCTCGCGCAG ATCGCGGCGT TCGCCATATT CGGCGTGCTG
TTGACCCAGC TCAGCTACTT GAACGCCATC TCGTACACGA GTGCCGGCGT GGGCACTACC
ATCGAGCAGG TGGGGCTCGT GCTCATCATG CTGTACGTGT GCGTTCGCGC CAAGCGCCTG
CCGCGTGCTC GCGAGGCGGC AGGCCTCGTG TTCGCGCTGG GCGGCATGCT GATCATCGCC
ACGCAAGGCG AGATCGACCA GCTGGCCATC CCCGCCGAGG GATTGGCCTG GGGCCTCGTG
TCGGCCGTGG CTTTGACGTT CTACACCCTC ATGCCCGTGC GCGTGCTGAA GAAGTGGGGC
TCCATGCTGG TGACGGGCCT TGCCATGCTG TTCGGCGGAT CGGCCGCCTC GGTGGTGGTG
CAGCCGTGGA CCATGCCTGT GAACCTGCCG CTCGGCGGCA TCGCGGCGCT GGTTGCCATC
GTGATCGTGG GCACCTTGGG AGCCTACATG CTGTATCTGC AGGGCGTGAA CGACGCAGGC
CCCGTCAAGG CGAGCCTTCT GTGCTGCGTC GAGCCCGTTT CGGCCATGAT CCTCGCGCTC
GCGTGGCTCC ATACGCCGGT GAGCGGCTGG GACCTCGCAG GATGCGCGCT CATCGTGATC
ATGATCTTCC TCGTCACCGA GCGAGAGCCG AAAACGGAGC AGGCCGCCGA GGGCGAGGGC
GCGCTCGCCG ACGCCTACGA CGACCCGCCG CTGTTCGCAG GCCGCGCTTC GGTGCTGGGC
TACTACACCA GCCGTCCGGC CACGCGCGAT GATTTCGAGC GTGCCACGGC GCTGCTCGAC
GTCGGGCATC AGACGTTCGC GGAGCTCGGC ATAGACGAGG GTCGGAGCAA GAAGTACCCA
TCGGCGCGTC GTCTCATGCA CAGCATCAAG AACGGCACGA CGCACGTCAT CGAGGATGCC
CACGGCCGCA TGATCGCGAT GTTCGCCGTG TCCTTCTCGC CTGACAAGAA CTACGAGCGC
CCCATCGACG GCGCTTGGCT CACCGACACG TCGGCCGAAC CGCAGCCCTA TGCGGAGCTG
CATTGGGTGG CCGTCGACTA TCCGGCTCGC CGCCGCGGCG TCGGCATGTT CATCCTCGAC
AAGGCCGACC AGATCGCCCG TGCGGGCGGC CGGTCCAGCA TTCGCGCCGA CGTCTACGAG
CTGAACGGGC CCATGCAGAA CCTGCTTGAG AAGCACGGAT ACGAACGCTG CGGAACCATC
ACGATCAAAG ACGTGTTCGG GCGTGTGAAA CATCGCGTGG GCTACGAGAG AATGTTGCGT
TGA
 
Protein sequence
MREEGSIQVA ETVAGNRHVV RGIVCALAGG ICWGFSGTCA QLLMNDYGAP ATWITCVRMV 
IAAVFFLFLT AVRNWRDLVA VFRDWRSLAQ IAAFAIFGVL LTQLSYLNAI SYTSAGVGTT
IEQVGLVLIM LYVCVRAKRL PRAREAAGLV FALGGMLIIA TQGEIDQLAI PAEGLAWGLV
SAVALTFYTL MPVRVLKKWG SMLVTGLAML FGGSAASVVV QPWTMPVNLP LGGIAALVAI
VIVGTLGAYM LYLQGVNDAG PVKASLLCCV EPVSAMILAL AWLHTPVSGW DLAGCALIVI
MIFLVTEREP KTEQAAEGEG ALADAYDDPP LFAGRASVLG YYTSRPATRD DFERATALLD
VGHQTFAELG IDEGRSKKYP SARRLMHSIK NGTTHVIEDA HGRMIAMFAV SFSPDKNYER
PIDGAWLTDT SAEPQPYAEL HWVAVDYPAR RRGVGMFILD KADQIARAGG RSSIRADVYE
LNGPMQNLLE KHGYERCGTI TIKDVFGRVK HRVGYERMLR