Gene Elen_1936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1936 
Symbol 
ID8416243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2268830 
End bp2270635 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content68% 
IMG OID645024909 
Producthypothetical protein 
Protein accessionYP_003182289 
Protein GI257791683 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000210591 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.10336 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAAA ACGATTTCCG CAAGGAGTAC GAGCGCATGC AGCACCAGGT GCGCGCGTCG 
TCCGATCTCA AAGAACGCAC GCTAGCCGCC GCCGAGCGGG CAGCCGACCG CTTCGCCTCC
TCCGCTCAGC CGGTCGCGAC CGCGTCAGCC AAGCGCCCAC ATCGGCGCGC CGGATCGCGC
AGCGGCGGCG TCGCAGTTGC GCGTCGCTGG GGTCTGCCCG CCGCAGCCTG CCTCGTCGCC
GCGGCCATCG TCGCCGGCGG CGTGCCCATG GTCATGGGCG CGATGGACGC GGACGGCCAT
ACGGCCATCT CCCTGAGCGA CGCCCAACAG GCAAGCGGCT TCGCCGTGCG CGCCTACGCC
TCCGACGGCA GCGCGCCGCT CGCGCCCGGC GAGGGGGGCA CCGTTGCGTT CGACCGCGAC
TTGGGCTACC GCTTCTCAGG AGGCGACGAC TACAAAGTGA GCGGCTTCTT CACCGGCTGC
CTCTTCCACG TTGAGGGCGA GGGCATCTCC CGCGTGCAGG CGAACCTGAC GGGCGGAGCC
CTGTACCGCG TGACGTTCGA AGACGGACCG ACCGACCCCG ACGACCCGCG CATGGGCGAG
CTGGCAAGCT GGAAGCCCAC GGCGCGCGGT ACCGGCGAGT ACTACGGCGG CTACGATTTC
GTCGGAAGCT CCATGCGGAA CGGAGAGAGT AAGCTGAGCC TCGCGAAGCT CATGGGTTCC
ACCATCGACG TCTCCGCAAG CGACGACCCC GGCATCGCGG ACGGCACGAC GAGTTTCGGC
CTCTGGACGA ACGAGGGCGA GCCTCCTGAA AACATCATGG GCGACCTGCA ATCCCCCGTC
ATCGACCTGT TCGAAGGACA GACGCTCACC GTCACCGTGA CGTTCGAGGA CGGGCGCACG
TCCACCCAGG CCATCGAGCT GCACGCGGCC AACTTCGAGA CCGAAATGGT CGACGGCACC
CCCCGCCTGA CCACCCGTCT CGCCGCAGAC GACGCCGAAG CCCCCTCGGC AGCGAAATCG
CTCTACGGCA TCGTGGTGAA AGCGGGAAGC GGCCCGTTCC CCTTCCCGCT CGACGACGCG
AACGACCGCG CCGACGAAGT GCTGCCCGCG TCGACCATCG AGCGCCAGGA CGATACCTGG
CGGGCAACCG TCGAGGAGAA CGGCGCGCGC GTCGACGCGA CCCTGCCCGA AAACGCGCTC
ACACCGTCCG ACGGCGAAGT CGCGTTCGAC TTCGGATACG AAAGCACCGG CTCCTCTCAC
CCAGACTCCG AAAGCTCGCA ACAGCCGACC GCGCGCTTGG CCATGAGCTC CCCTTCCATC
TCGCTCTCCG ACACTCTTCC GGGCGGAAAG GCGCTCGACG ACTGCCTCTT TGTCGTGGAC
GGATGGCTGG GCAACGCGCG CTACATGGAC AAATGCTCGC GCGAGGTGTG GGGCTACGGC
TACAACGACG ACGGCACGCT TACGAGCGAC GACTACCGTT ACGCGTCCAC GACGGTGACG
CTGCGCAACC TTGAAGATAC GGCCGTCCCC GTTTGGACGC CCGTGCTCTA CGATTTCGCC
CTGCGCAACG ATGACGGAAC GCTCGACATG GTGCGGACGG GTTACGATCT GGACTTCGAG
GCAACGGGCG ACACCGTGCC CTCCGACGAC CCCCAGCACG TCGTCATCGC GCCGGGCGGC
ACAGTGCAGC TGACCGTGGT GCGCGTCCTG CCCACGTACG TCCTGGAGAG CGGAAACCTG
GTGCTCGTGC CGACCGACGA CGGCAGCCCG TTCTCCCAGG CCTTCTCCCT CGGCGGGCAG
ATCTAG
 
Protein sequence
MNENDFRKEY ERMQHQVRAS SDLKERTLAA AERAADRFAS SAQPVATASA KRPHRRAGSR 
SGGVAVARRW GLPAAACLVA AAIVAGGVPM VMGAMDADGH TAISLSDAQQ ASGFAVRAYA
SDGSAPLAPG EGGTVAFDRD LGYRFSGGDD YKVSGFFTGC LFHVEGEGIS RVQANLTGGA
LYRVTFEDGP TDPDDPRMGE LASWKPTARG TGEYYGGYDF VGSSMRNGES KLSLAKLMGS
TIDVSASDDP GIADGTTSFG LWTNEGEPPE NIMGDLQSPV IDLFEGQTLT VTVTFEDGRT
STQAIELHAA NFETEMVDGT PRLTTRLAAD DAEAPSAAKS LYGIVVKAGS GPFPFPLDDA
NDRADEVLPA STIERQDDTW RATVEENGAR VDATLPENAL TPSDGEVAFD FGYESTGSSH
PDSESSQQPT ARLAMSSPSI SLSDTLPGGK ALDDCLFVVD GWLGNARYMD KCSREVWGYG
YNDDGTLTSD DYRYASTTVT LRNLEDTAVP VWTPVLYDFA LRNDDGTLDM VRTGYDLDFE
ATGDTVPSDD PQHVVIAPGG TVQLTVVRVL PTYVLESGNL VLVPTDDGSP FSQAFSLGGQ
I