Gene Elen_1953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1953 
Symbol 
ID8416263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2292478 
End bp2293617 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content60% 
IMG OID645024929 
Producthypothetical protein 
Protein accessionYP_003182306 
Protein GI257791700 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000104092 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGGTA TCATCACGCA TTCGAATTCG AGCATGAGCA GGCAAGCTCT TCGACAGCAG 
GCGGAAAAGC ATCGTCTCGT GCGCCTGTTC CGTGGCGCGT ACATGGACGC GCAGGAGTAC
GCGGCGCTCG ATATCTCTGG AAGATACCGG GCTCGTGCCC AGGCGTTTCT TGCGACGCAT
GCGAAGCTTC GAGCGTGGGG GATCACCGCA GCCGCTCTTG AGGGCGCGCC GGTCCTCGGC
GGAGCGCCTT TGCATTTCGG CGGCGCGCGA AGCCACGCCA AGAGCAAGCA GGACGGCTGC
GCTTTTCACG AGGCTTCGCT TGAGACGCCA TCGAACCCGG TAGCGCAAAC GCTTTTCGAG
TGCGCCTCGA CCTCTCCTTT GCCGGATGCG CTTTTGGCTG CGAATTATCT GTTGCGTCGT
TCTTCCGCGA AAGCGCAGGG CGGTCTTGTG GCATGCAGGG ATATTGACGA GAGTACGACC
GAAGCGCTCG TATGGGAGCC TGCAACCTCT GGGAGCGGGG AAGCGAGCAT CCGCTCTGCG
TTCGATACCC GCATGCTCGA CATGGAAGAA CCTGAATTCC TTGCTAACTA CAGCGCTGCG
CGCGTGACCG GATTCGTTTC GCCGGAAGCC GAGCTTCTGT GGCTCGCCTT CGCGCAGCTC
TGCTTCGCCA ATGGAGGAAA ACGCGGAATA CGCAGCGCGT TGAAGGCGGG GCTGTACTTT
ACCGACCAGG TCGAATCGCC GGCCGAGTCG TTTCTGATCG CCCGTTGCGT CGAACTTGGC
TTCGAAATTC CCTATCTGCA GGTCAACATT CTCGACCCTT CGAACGGGAG GCATCTTGGT
CGCGTCGACG GGCTTTGGCC TTCTGAAGCC GTACAGAAGA GCCTCTATCG AAGCGATAGC
AGGTTCGGGC GCTTTCTCCA ATGCAGGCGG CTTGGAGACA ACGGCTCCAT CGTCATCGAC
TTCGACGGCA AGCTGAAGTA CCGGCAGGAT TATGCCGAAA TTTTGGAAAG AGAGCGACAG
CGGCAAAATG CCATAGGGAA TCTCGGGTTT CGGTTCGTGC GCATCGGCTG GGACGATCTC
ATGCGGCCCG AGCGCTTGCG TTCGATCCTC GAAGCGGCTC GCGTCCCGCG TTGCAGGTGA
 
Protein sequence
MPGIITHSNS SMSRQALRQQ AEKHRLVRLF RGAYMDAQEY AALDISGRYR ARAQAFLATH 
AKLRAWGITA AALEGAPVLG GAPLHFGGAR SHAKSKQDGC AFHEASLETP SNPVAQTLFE
CASTSPLPDA LLAANYLLRR SSAKAQGGLV ACRDIDESTT EALVWEPATS GSGEASIRSA
FDTRMLDMEE PEFLANYSAA RVTGFVSPEA ELLWLAFAQL CFANGGKRGI RSALKAGLYF
TDQVESPAES FLIARCVELG FEIPYLQVNI LDPSNGRHLG RVDGLWPSEA VQKSLYRSDS
RFGRFLQCRR LGDNGSIVID FDGKLKYRQD YAEILERERQ RQNAIGNLGF RFVRIGWDDL
MRPERLRSIL EAARVPRCR