Gene Elen_1118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1118 
Symbol 
ID8415408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1347629 
End bp1348843 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content68% 
IMG OID645024080 
Productmolybdenum cofactor synthesis domain protein 
Protein accessionYP_003181477 
Protein GI257790871 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0303] Molybdopterin biosynthesis enzyme 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000621722 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000000046645 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGAGAGA AGATCATGGA CGGTTTCCCG TCGCGCGAAG AGGCGCTGGC CGACTTCTTC 
GCAGCATGGG AGCCGGCGAG GAGCGTCGAG TACGTGGCGC TCGACGACGC GGTGGGACGC
GTGCTCGCCT GCGACCTGGC GTCGACGAAC ACGCTGCCGG TGGTGCGCGC CTCGTCGTTC
GACAGCATCG CGGTGAAGTC GGCAGCGTTC GCGAACGGCA TGCCCGACAC AAGCAGCTGG
AAGCCCGGCG TGGATTACGT GCGCGCCGAC ACCGGAGACG ACTTCCCCGA CGCGTTCGAC
GCCGTGGTGA TGATCGAGAA GGCGGTCGTT CGGGAGGACG GATCGGTAAC GTTCGACGAC
GACGTGACCG TCGAGCCCGG TTCGGGCGTG CGGCCCGCCG GCTCCACGCT GCGCGCGGGC
GAGCCGCTCA TGAGCGCCGG CAGCATTATC CGACCCACCG ACCTGGCCGC TCTCGCCATG
GGCGGCGCCA CGATGGTGCC CGTGCGCGTC AAACCGCGCG TGGCGTTCAT TCCCACGGGC
AGCGAGCTCG TACCCGCAGG CATCAAGCCC CGACGAGGTC AAAACGTGGA CACGAACAGC
CTCATGTGCA AGCACCTCCT CATCGAGTAC GGTGCCGAAC CCGTGGTGTT CCCCCTCGTG
CACGACGATC CCGTCGAGCT CGAACGCGCC TTCGAGGCGG CGCTCGCCAC CGCCGACGTC
GTGGTGGTCA ACGGGGGATC GGCCCTCGGC GAGGAGGATT TCAACGTGAA GCTGATCGAA
CGCCGCGGGC AGGTGGTGCA CCATTACATC GCCGCCGTGC CGGGACGGCC GCTCATGCTG
GCCGTAGCCG ACGGCAAACC GGTCGTCGAT CTGCCCGGCC CCACCATGGC CGCCTACTTC
GGCTCCGAAT GGTGCCTGCA AGCGATCACG GCGCGCATCC TGGGAATTCC GCTGCGCCGC
CGCCCCGTCG TGCAGGCGCG GGCGGATGCC GCGAAGACGA GCATCCCCAA GATGGCGAAC
ATAGCCCGCG TACACGTGAC GCGCGACGAC GAGGGCTACG CGGCACACTT CCTCGATTTC
AAAGCCGGGG AGCTGGCCGC GTGCATGACG TCGAACGCGC AGCGCGTCTC GCCCCTCGGC
GAAGCGGGAT GGGCCGAAGG CGACCTTTTG GACGTGGAGT TGCTGCGCGG CGAGGAGTTC
GTCGATCAAG GCTAG
 
Protein sequence
MGEKIMDGFP SREEALADFF AAWEPARSVE YVALDDAVGR VLACDLASTN TLPVVRASSF 
DSIAVKSAAF ANGMPDTSSW KPGVDYVRAD TGDDFPDAFD AVVMIEKAVV REDGSVTFDD
DVTVEPGSGV RPAGSTLRAG EPLMSAGSII RPTDLAALAM GGATMVPVRV KPRVAFIPTG
SELVPAGIKP RRGQNVDTNS LMCKHLLIEY GAEPVVFPLV HDDPVELERA FEAALATADV
VVVNGGSALG EEDFNVKLIE RRGQVVHHYI AAVPGRPLML AVADGKPVVD LPGPTMAAYF
GSEWCLQAIT ARILGIPLRR RPVVQARADA AKTSIPKMAN IARVHVTRDD EGYAAHFLDF
KAGELAACMT SNAQRVSPLG EAGWAEGDLL DVELLRGEEF VDQG