Gene Acid345_4300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4300 
Symbol 
ID4071873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5111600 
End bp5112631 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content59% 
IMG OID637986333 
Productdelta-aminolevulinic acid dehydratase 
Protein accessionYP_593374 
Protein GI94971326 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0113] Delta-aminolevulinic acid dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTTC CCATCAGCCG TCCCCGTCGC CTCCGCAAGA ATGAAGCTTT CCGTTCGCTG 
GTTCGCGAGA CCCGCCTATC GCCGGCCGGG TTCGTTTACC CGCTATTCGT GTGCCCGGGT
GAGGGCGTGC GCAAAGAAGT CCGCTCCATG CCGGGAGTCT TCAATCTGTC GGTAGACGAG
GCAGTGAAGG AAGCACAAGA GGTCAAATCG CTCGGCATTC CGTCGGTCAT TTTGTTCGGA
CTTCCGGAGA GCAAGGACGA GCAGGCAACC GGCGCATGGG CTGAAGATGG CATCGTGCAG
CAGGCGGCGC GTGTAATTAA GCGCGAAGTG CCAGGCTTGC TGCTGATGGG CGACGTTTGT
CTCTGCGAAT ACATGTCGCA CGGGCATTGC GGTATTGTGC AGAAGACCGC CACCAACCGC
TCCGTCGGCG CGGCGTCCAC TGCGCAGATG AGCGGCGTGG ATGAATACGA AATTCTTAAC
GACGAATCAC TCGATATCCT GGCCAAGACC GCCGTCTCGC AGGCTCGCGC GGGCATGGAT
ATCATTGCCC CCAGCGACAT GATGGACGGC CGGGTCGCCG CCATTCGCGA CGCTCTCGAC
GATGAAGGCT TCGAGAACAT CCCGATCTTG GCCTATGCGG CGAAGTTTGC TTCCGGCTTC
TACGGGCCAT TCCGAGAAGC CGCGGACTCA GCCCCTGCCT TCGGCGATCG CCGCTCTTAC
CAAATGGATG GCGCTAACCT CCGCGAAGCC ATGATCGAAA TCGAACTCGA CCTTGAAGAG
GGCGCAGACA TGATTATGGT GAAGCCGGCG ATGCCCTATC TTGACGTCAT CTCGGAAGCG
CGCCGACGTT ACGACGTGCC GCTCGCCGCT TACCAGGTCA GCGGCGAATA CGCCATGATC
AAGGCCGCCG CGCAGAACAA CTGGATCGAT CACGATCGCG TAATGCTGGA ATCGTTGCAA
AGCATTCAGC GCGCCGGGGC GTCGATCATC TTGACTTACT TTGCGAAAGA TGTGGCGAAG
ATCCTCGGTT AG
 
Protein sequence
MSFPISRPRR LRKNEAFRSL VRETRLSPAG FVYPLFVCPG EGVRKEVRSM PGVFNLSVDE 
AVKEAQEVKS LGIPSVILFG LPESKDEQAT GAWAEDGIVQ QAARVIKREV PGLLLMGDVC
LCEYMSHGHC GIVQKTATNR SVGAASTAQM SGVDEYEILN DESLDILAKT AVSQARAGMD
IIAPSDMMDG RVAAIRDALD DEGFENIPIL AYAAKFASGF YGPFREAADS APAFGDRRSY
QMDGANLREA MIEIELDLEE GADMIMVKPA MPYLDVISEA RRRYDVPLAA YQVSGEYAMI
KAAAQNNWID HDRVMLESLQ SIQRAGASII LTYFAKDVAK ILG