Gene Elen_0655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0655 
Symbol 
ID8414945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp832878 
End bp834497 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content61% 
IMG OID645023630 
Producthypothetical protein 
Protein accessionYP_003181027 
Protein GI257790421 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.507478 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0435014 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAGA AGCTGATGGT GGTGACCGCG ATGCTGATGG CGGGATCGCT GTTCGCCGCC 
GGTTGCGCGC AGCAAGGAGC CGCGGAAGGG GAGAGCTCTT CGACCGAGGT TCAATATCAG
AACAGCATCG AGGAGGCCGA AGCGTATCTT GACGGGGTGA ACGAGCGTCA GGACGAGCTG
AACAAGGAGT ACGCTCCCGA AATCCGCACC TTGGAAGACG GCACGAAGGT TCAGCGCACG
CCGACCGAGT ACCAGTGCTA CCACTGGAAC CGTCCGTACG AAGGCGGCAC GTCGTACAAC
AACTACTATC TTGATGCCGA CAACCGCGGT TGCGGGGCTT GCCATGAAGA TCTCGGCGAC
GCCCTGGCCA ACATGGAGTA CAGCCATCCG ACCGTCTGGA ACGACGCCCT CGGCAGTAAG
ATCACGGTCG ACCAGTGCAT GCTCTGCCAT AGCGCAGATG ACGGCTACGA GATGGGCACC
GTCATGCACG GCGTCCACTA CGGCGAACGC AACGGCGAGA ACTTCGAGGA GCGTGGCGGC
AAATGCATCA GCTGCCACAA CATGACGGAG AACGAGCAGG GCGCCGAGCT TTGGGACGTG
GTCAAGTACG ACCATCTTGA CGGCATCGTC AAGGTTCCCG ACGTCCAGGG CGAGTTCGTG
TTCGATCAGG ACACCGTTGT GGCCCCCGAG GACATGTTCA CCTACGACTG GATCCATTCC
CGCTACGACA GCCTGATCCA CATCATGGGC AAGAACGTGC TGGATACGGA GCTTCCGAAG
GACTTCGTCG ACAATTTCGA GATCACCATG GACGGCCTGG TGAACAAGCC GTTCACGGCG
AAGCTGGCCG ACCTGATCTC CGAAGCCGAG GCGGCCGGGG TCACCGTGAC GAAGCTTTCG
AAGATCCACT GCGTTGACAA CATGCCCGGT GGCGGCGGCA TCTCGAACGT CGAGATCACG
GGCATCCCGC TGACCTGGCT TATCGAGAAG GCGGGCGGCG CGAGCAAGGA CATCACCGGC
GTGCTGTTCG ATCGTCAGGA GTTCCGCACC AACGGCGCCC AGACGCACAG CAACCGCGGC
GTCCCCGAGG CTTCGTTCGA AGACGTGTAC CTCGTTTACG AGATCGGCGG CAAGCCGCTC
GATCCCAGCC AGGGCAGCCC CTGCATCAAC TGGGTCGAGA AGTGCGACGC CCAGTCGTAC
GTCAAGCAGT GCGTGGGCTA CAAGCTGACC GACGAGGAAA AGCCGTGGGA AGACTGGATG
ATGAACGGTT TCAATTCGTA CGGCGAGGGC CCCTACATCA ACAAGCCGAA TGCGACGGTG
CTCAAGGTTC CCGAGGGCAT GATCATCGAG ACCGGCAAGC CCTTCACCTT CGAGGGATAT
GCGGACGCCT ACGACGAGGC GGTGGTCAAG CTGGAGTTCT CCATGGACAA CGGCAAGACC
TGGACGCCGT ACGATCTGGG CCAGACCGAT CCTAGCAAGT GGGTGTACTG GCACTACACG
TGGACTCCTG AAAGCGACGG CAGCTACGTG CTGATGGTGC GGGGCACCAC CGATACCGGC
CTCGTGGGAA CCAACATCCA AAAGGTGATG GTGACCGCCA AGAGCAATGT GGAGGGATAG
 
Protein sequence
MRKKLMVVTA MLMAGSLFAA GCAQQGAAEG ESSSTEVQYQ NSIEEAEAYL DGVNERQDEL 
NKEYAPEIRT LEDGTKVQRT PTEYQCYHWN RPYEGGTSYN NYYLDADNRG CGACHEDLGD
ALANMEYSHP TVWNDALGSK ITVDQCMLCH SADDGYEMGT VMHGVHYGER NGENFEERGG
KCISCHNMTE NEQGAELWDV VKYDHLDGIV KVPDVQGEFV FDQDTVVAPE DMFTYDWIHS
RYDSLIHIMG KNVLDTELPK DFVDNFEITM DGLVNKPFTA KLADLISEAE AAGVTVTKLS
KIHCVDNMPG GGGISNVEIT GIPLTWLIEK AGGASKDITG VLFDRQEFRT NGAQTHSNRG
VPEASFEDVY LVYEIGGKPL DPSQGSPCIN WVEKCDAQSY VKQCVGYKLT DEEKPWEDWM
MNGFNSYGEG PYINKPNATV LKVPEGMIIE TGKPFTFEGY ADAYDEAVVK LEFSMDNGKT
WTPYDLGQTD PSKWVYWHYT WTPESDGSYV LMVRGTTDTG LVGTNIQKVM VTAKSNVEG