Gene Elen_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2047 
Symbol 
ID8416358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2398272 
End bp2399486 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content48% 
IMG OID645025024 
Producthypothetical protein 
Protein accessionYP_003182400 
Protein GI257791794 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.531982 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCT ACATCGCCGC CCACAAGCAC ATGAAGCTGC ACACGAACCA GCCATATCAG 
CCGCTTTTTG TCGGAGCGTT CCGGTGCCCT GAAGCCAACC GACGCGACGG TTGGCAATAC
GACGATACGG GAGTCGACGA TATCTCGTAC AAAAATGCTA CCTTCTGCGA GTTGACAGGT
GTCAACTGGA TACTGCATAA CAATGAGAGC GACATTACAG GACTTGTTCA CTATCGAAGG
TATTTCCGAT CTTCCACCGA ATGCAACGAA CCTTTATCAG AACAAGAGAT AAGGACCGCT
CTCTCCAAGC ACGACTGCAT CGTTGCACAA CGGACATTCT GCACAAGCAA ACTTGATGGA
TATCTCTGCT CTGCAGCTGA GCAGTACAGA ACATGTCACT CTTCGACCGA TCTCACTCAG
CTCGACAGGG TAATCAAACG GTATTTTCGG TCGTACCATC CTGCTTTCAG GCTCTGCATG
AAACGCGATT ATCTCCATCC CTTCAATATG CTCATATGCA GAAAAGAACT ATTCGATGAG
TACTGCCGCT GGCTTTTTGA GGTCGAAAGC CGACTCGAAG AACGCATCGA CCCTTATCTT
GATCGAGACG ACTATCAAAA AAGAGTATTT GGATTCCTTG CAGAGCGCCT TATGAACGTG
TATCTCGAGG CCAAAGGTAT CGACGTGGTC GAATATCCTA TTTTTGACCC CATCCATCCC
GACGATAGCA GCGTGTTGCC CTTAAAGAAA CCTCCACTTG TCCGATCTGA CGTTGGATTG
TCGTACCCCA CAATTCAGCC TGTCTACGAA GGAATCGACT ACTCGAAGGT GTTCGAATAT
CGCTTTTACC TTACGCATAA TGAAGACTTG GCAAAAGCCT ACTCAGACAA TCCCCAAGAA
TCGCTACAGC ACTTCATTGT GCATGGTGCG CGCGAAAAAC GCATGGCGCA TCCTTGCTTT
TCGGTTGCTT CCTACATGCA AGGACACCCT GAACTTAAGC CCGAATACGG AGACGATCCC
CTCGCATATG TTTCACATTA CCTCTCTACT CCCTCAGAGC GTAATCATGC AACAGGATAC
GAGAATCTGC AGACACCTTC GTTAGAAAAA CGAGAGGCGC TTTCCTCTGA GCGCACTTGC
ACGGGAAAGC GCATCAATAA AAAACGGCTT TCGCGATATA TCGCGAAAGC CGAAAAACTA
CCCGTGCTGG ATTAG
 
Protein sequence
MKIYIAAHKH MKLHTNQPYQ PLFVGAFRCP EANRRDGWQY DDTGVDDISY KNATFCELTG 
VNWILHNNES DITGLVHYRR YFRSSTECNE PLSEQEIRTA LSKHDCIVAQ RTFCTSKLDG
YLCSAAEQYR TCHSSTDLTQ LDRVIKRYFR SYHPAFRLCM KRDYLHPFNM LICRKELFDE
YCRWLFEVES RLEERIDPYL DRDDYQKRVF GFLAERLMNV YLEAKGIDVV EYPIFDPIHP
DDSSVLPLKK PPLVRSDVGL SYPTIQPVYE GIDYSKVFEY RFYLTHNEDL AKAYSDNPQE
SLQHFIVHGA REKRMAHPCF SVASYMQGHP ELKPEYGDDP LAYVSHYLST PSERNHATGY
ENLQTPSLEK REALSSERTC TGKRINKKRL SRYIAKAEKL PVLD