Gene Daci_5235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaci_5235 
Symbol 
ID5750847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDelftia acidovorans SPH-1 
KingdomBacteria 
Replicon accessionNC_010002 
Strand
Start bp5801145 
End bp5802611 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content67% 
IMG OID641300360 
Productisopropylmalate isomerase large subunit 
Protein accessionYP_001566249 
Protein GI160900667 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.71536 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCAA AAATTTCCCC ACGACACCAA CCATTCGAGA ACGCCATGGG ACGCACCCTC 
TACGACAAGA TCTTCGACGA GCACGTCATC CACACCGAGG ATGACGGCAC CGCCGTCCTC
TACATCGACC GCCACCTGGT GCACGAAGTC ACCAGCCCCC AGGCCTTCGA AGGCCTTCGC
ATGGCCGGCC GCAAGCTGTG GCGCATCAGC TCCGTGGTCG CCACGGCCGA CCACAACACC
CCCACCGACG GCTGGGAGCG CGGCTATGAC GGCATTGCCG ACCCCATCAG CAAGGAACAG
ATCACCACGC TGAACACCAA CATCGGCGAG TTCGGCTCTG CGGCCTTCTT CCCCTTCATG
GACAAGGGTC AGGGCATCGT CCACGTCATG GGCCCCGAGC AGGGCGCCAC CCTGCCCGGC
ATGACCGTGG TCTGCGGCGA CAGCCACACG TCCACGCACG GCGCCTTCGG CGCACTGGCC
CACGGCATCG GCACGTCCGA GGTCGAGCAC GTGATGGCCA CCCAGACCCT GCTGGCCAAG
AAGGCCAAGA ACATGCTGAT CAACGTCAGC GGCAAGGCGG CCCCCGGCAT CACGGCCAAG
GACATCGTGC TGGCCATCAT CGGCAAGATC GGCACGGCCG GCGGCACGGG CTACACCATC
GAGTTCGCCG GCGAAGCCAT CCGTGACCTG AGCATGGAAG GCCGCATGAC GGTCTGCAAC
ATGGCCATCG AAGGCGGCGC GCGCGCAGGC CTGGTGGCCG TGGACGACAA GACCATCCAG
TACGTCAAGG GCCGCCCGCT GGCCCCCACT GGCGCCGAAT GGGATGCCGC CGTCTCCTAC
TGGAAGACGC TGCACTCCGA TGCCGATGCG AAGTTCGACC GCGTGGTCGA GCTGCAGGCC
AGCGAGATCG TGCCGCAGGT CACCTGGGGC ACCTCGCCCG AGATGGTGCT GGGCGTGGAT
GCCCGCGTGC CGGACCCCGA CAAGGAAAAG GATGCCAGCA AGCGCGGCGC CATCGAACGC
GCGCTGACCT ACATGGCGCT GGAGCCCGGC AAGGCGATCG ACGACATCTT CGTGGACAAG
GTCTTCATCG GCTCGTGCAC CAACAGCCGC ATCGAAGACA TGCGCGAGGC CGCCGCCGTG
GTCAAGAAGC TGGGCCAGAA GGTGGCGCGC AACATCAAGC TGGCCATGGT CGTGCCCGGC
TCCGGACTGG TCAAGGAACA GGCCGAGCGC GAAGGCCTGG ACCAGATCTT CAAGGCGGCA
GGCTTTGAAT GGCGCGAGCC CGGCTGCTCC ATGTGCCTGG CCATGAATGC CGACCGCCTG
GAGCCCGGCG AGCGCTGCGC CTCCACCAGC AACCGCAACT TCGAAGGCCG CCAGGGCGCG
GGCGGACGCA CCCACCTGGT GAGCCCCGCC ATGGCTGCAG CCGCCGCCAT CCACGGCCAC
TTCGTGGACA TCCGCAGGTT CTCCTGA
 
Protein sequence
MPPKISPRHQ PFENAMGRTL YDKIFDEHVI HTEDDGTAVL YIDRHLVHEV TSPQAFEGLR 
MAGRKLWRIS SVVATADHNT PTDGWERGYD GIADPISKEQ ITTLNTNIGE FGSAAFFPFM
DKGQGIVHVM GPEQGATLPG MTVVCGDSHT STHGAFGALA HGIGTSEVEH VMATQTLLAK
KAKNMLINVS GKAAPGITAK DIVLAIIGKI GTAGGTGYTI EFAGEAIRDL SMEGRMTVCN
MAIEGGARAG LVAVDDKTIQ YVKGRPLAPT GAEWDAAVSY WKTLHSDADA KFDRVVELQA
SEIVPQVTWG TSPEMVLGVD ARVPDPDKEK DASKRGAIER ALTYMALEPG KAIDDIFVDK
VFIGSCTNSR IEDMREAAAV VKKLGQKVAR NIKLAMVVPG SGLVKEQAER EGLDQIFKAA
GFEWREPGCS MCLAMNADRL EPGERCASTS NRNFEGRQGA GGRTHLVSPA MAAAAAIHGH
FVDIRRFS