Gene Elen_1014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1014 
Symbol 
ID8415304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1229446 
End bp1230519 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content70% 
IMG OID645023978 
Productoxidoreductase molybdopterin binding 
Protein accessionYP_003181375 
Protein GI257790769 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0972537 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.212982 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGC CAGCTCAGAA GATCATGGCC GGCGTGGCGG GCGCGGTGCT GCTCGCATCC 
GGCGCCGGCG CGGCCCTTGC CGCCCAGCAG CCCGCGGCTG CGGACGGGGG CTCCCGGCCC
GTCGGCGCCA CCCACACGGT GGCCGACCAC GACATCCGCG CCCAGTGGCT GGGCGAGGAA
TCCGACTACG TGCGCGTCGC GGACGTGCAG GGGTCGTTCA CGTTCAATCA GGAGGGCGTC
ACGCCCAACG ACGAGCTGTT CAACGTGTTC GGAACCGCCA TCCTGTCGAT GTGCTCCAAG
CCCGCGCCCG AGCTTGCCGC CGGGCAGGAC GGCGTGGCCA CCTACTTCGT GAACGTGGGC
GGGCACGTGA AGGAGAGCTT CACGGTGGAC CTGTCCGAGC TCGACGACGA GGAGCAGGAG
GCGCTCATGG GCTGCTCGTG CGCCACGGGG TCGCCCTTCG GCCAGGCGGC CGTCATCGGC
GTGCCGCTGG CGTCGGTGGT GGGCATGGCC GACCTCGAGG ACGGCGTGAA CACCGTGACG
GCCTACGGCG CGGACGGCTA CGGCGAGCCG CTGCCGCTGC AGTACGCGCT CGACAAGAAC
GCGCTGCTCG TGTACCAGGT GAACGGCGAG GAGCTGAAGG CGTCGGAGGG CTCGAGCCTG
CAGCTGTGGA TGCCCGAGAC GGTGGCGCGC TACTTCACGC GCAACATCGC CAGCATCGAG
CTCACGCGCG AGGACGCGGT GCCCGAGGTG GCCTCGGTCG ATCCCATGTA CCGCAACAAG
ATCGAGATCA AGAACTCCGC CGACGGCTGC GCGTTCGAGG CGGGCGACGA GATCACGTTC
GAGGGCGTGG CCGACGACTG CGGAAGCCCC ATCGCCGCCA TCGAGTTCTC CTTCGACGGC
GGGCGCACCT GGACGGCGTG CGACACCGAC GGCGCCACGG CCGACAAGTG GGTGAACTGG
CAGTTCACCG CCTCGTTCGA GGAGAAGGGC GACTACGAGA TGACCGTGCG CGCCCGCACG
GCCGACGACG TGGTGTCGCC GCTGTCCGCC AGCCTTGCCT TCGCGGTGCG GTAG
 
Protein sequence
MNKPAQKIMA GVAGAVLLAS GAGAALAAQQ PAAADGGSRP VGATHTVADH DIRAQWLGEE 
SDYVRVADVQ GSFTFNQEGV TPNDELFNVF GTAILSMCSK PAPELAAGQD GVATYFVNVG
GHVKESFTVD LSELDDEEQE ALMGCSCATG SPFGQAAVIG VPLASVVGMA DLEDGVNTVT
AYGADGYGEP LPLQYALDKN ALLVYQVNGE ELKASEGSSL QLWMPETVAR YFTRNIASIE
LTREDAVPEV ASVDPMYRNK IEIKNSADGC AFEAGDEITF EGVADDCGSP IAAIEFSFDG
GRTWTACDTD GATADKWVNW QFTASFEEKG DYEMTVRART ADDVVSPLSA SLAFAVR