Gene Elen_2067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2067 
Symbol 
ID8416384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2433757 
End bp2434749 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content69% 
IMG OID645025049 
ProductPorphobilinogen synthase 
Protein accessionYP_003182419 
Protein GI257791813 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0113] Delta-aminolevulinic acid dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.355719 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.00258041 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGATTTC CCGCTTACCG TCCGCGCCGC ATGCGTGCGA ACCCCGCGGT GCGCGCGTTC 
GTGCGCGAGA CGCGCGTGGA ACCGGGCGAT CTGGTATACC CGGTGTTCGT GAAGCCGGGG
GCCGGCGTGC GCGACGAGGT GGCGTCCATG CCGGGCGTGT TCCAGCTTTC GATCGACCAG
CTGGCGGCCG AGGTGGACGA GCTTCGAAGC TGCCGCGTGA ACTCGCTCAT GCTGTTCGGC
CTTCCTGCGC GCAAGGACGA GCGGGGCAGC GAAGCTTACG ACGACCGCGG CGTGGTGCAG
CAGGCCGTGC GCGCCATCAA GGAACACGCG CCCGACTTCC ACGTGATCAC CGACGTGTGC
TTGTGCGAGT ACACGAGCCA CGGCCATTGC GGCGTGCTCG ACGAGCGCGG GGGCGTGGAC
AACGACGAGA CGCTCGGGCT CCTGGCGGCC GAGGCGGTGA GCCATGCGCG CGCCGGAGCC
GACATGGTGG CCCCGTCCGA CATGATGGAC GGGCGCGTGG GCGCGCTGCG CTCGGCGCTC
GACGAGGCGG GCTTTTCGCA CGTGCCCATC ATGGCGTATG CGGCGAAGTA CGCGTCGGGC
TACTACGGGC CGTTCCGCGA TGCGGCCGAT TCGGCGCCGG CGTTCGGCGA CCGCTCGGCG
TACCAGATGG ATCCCGCGAA CAGCGTCGAG GCGCTGCGCG AGGTGCGCCT CGACATCGAG
GAGGGGGCCG ACCTTGTCAT CGTGAAGCCG GCGCTATCCT ATCTGGACGT GGTGCGGCGC
GTGAAGGACG CCTTCGCGTT TCCCACCGTG GCCTACAACG TGTCGGGCGA GTACGCCATG
GTGAAGGCCG CCGCCGCGCA AGGGTGGATC GACGAGCGCC GCGTGGTGCT GGAGACGCTG
CTTTCCATGA AGCGCGCCGG CGCCGACGCA ATCATCACCT ACCATGCGAA GGACGCTGCG
CGCTGGATCA TCGGAGGCCG TCATGGCCGC TGA
 
Protein sequence
MGFPAYRPRR MRANPAVRAF VRETRVEPGD LVYPVFVKPG AGVRDEVASM PGVFQLSIDQ 
LAAEVDELRS CRVNSLMLFG LPARKDERGS EAYDDRGVVQ QAVRAIKEHA PDFHVITDVC
LCEYTSHGHC GVLDERGGVD NDETLGLLAA EAVSHARAGA DMVAPSDMMD GRVGALRSAL
DEAGFSHVPI MAYAAKYASG YYGPFRDAAD SAPAFGDRSA YQMDPANSVE ALREVRLDIE
EGADLVIVKP ALSYLDVVRR VKDAFAFPTV AYNVSGEYAM VKAAAAQGWI DERRVVLETL
LSMKRAGADA IITYHAKDAA RWIIGGRHGR