Gene Caci_8049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_8049 
Symbol 
ID8339427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp9341223 
End bp9342629 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content72% 
IMG OID644961134 
Product3-isopropylmalate dehydratase, large subunit 
Protein accessionYP_003118713 
Protein GI256397149 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.200463 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCACA CCCTGGCCGA GAAGGTGTGG GCGGACCATG TGGTCCGCTC GGCCGACGGC 
GAGCCCGACC TGCTCTACAT CGACCTCCAC CTGCTGCACG AGGTGACCTC GCCGCAGGCC
TTCGACGGAC TCCGCCAGGC CGGCCGGACC GTGCGCCGCC CCGACCTCAC GATCGCCACC
GAGGACCACA ACACCCCGAC GCTGGAGATC GACAAGCCGA TCGCCGACCC GGTGTCCCGG
CTGCAGATCG AGACGCTGCG CAAGAACTGC GCCGACTTCG GCGTGCGCAT CCACTCCCTG
GGCGACGCCC AGCAGGGCAT CGTGCACGTG GTCGGGCCGC AGCTGGGCCT GACCCAGCCC
GGAATGACCG TGGTGTGCGG CGACTCACAC ACCTCCACCC ACGGCGCCTT CGGCGCGCTG
GCGTTCGGCA TCGGCACCAG CGAGGTCGAG CACGTGCTGG CGACCCAGAC GCTGCCGCTG
AAGCCGTTCA AGACGATGGC CGTCACGGTC ACCGGCGCGC TCGGCGAGGG CGTCACCGCC
AAGGACGTGG TGCTCGCCGT CATCGCGCAG ATCGGCACCG GCGGCGGGCA GGGCTACGTC
ATCGAGTACC GCGGCGAGGC CGTCGAGAAG CTGTCGATGG AGGGCCGGAT GACGGTCTGC
AACATGTCCA TCGAGGCCGG CGCCCGCGCC GGGATGATCG CCCCGGACGA GACCACGTTC
GCGTATCTGA AGGGCCGCGC GCACGCGCCC TCCGGCGCCG ACTGGGACGC CGCGGTGGAG
TACTGGAAGA CGCTGCGCAC CGAAGACGGC GCGACCTTCG ACGCCGAGGT GGTCATCGAC
GGCGACGCGC TGACCCCGTA CGTCACCTGG GGCACCAACC CCGGCCAAGG ACTGCCGCTG
TCGGCGTCGG TCCCGGACCC GGACAAGGAC TTCGGCGCCG AGGTCGACAA GGTCGCGGCG
CGCAAGGCGT TGGAATACAT GGGGCTGCAG GCCGGCACGC CGCTGCGCGA GATCAAGGTG
GACACGGTCT TCCTCGGCTC GTGCACCAAC GGCCGGCTGG AGGACCTGCG CGCCGCCGCC
GCGGTCATCA AGGGCCGCCA GGTCGCCGAC GGCGTCCGGA TGCTGGTGGT GCCGGGCTCG
GCGCGGGTGC GGCTGGAGGC CGAGGCCGAG GGTCTGCACG AGGTGTTCAC CGCCGCCGGC
GCCGAGTGGC GGTTCGCGGG CTGCTCGATG TGCCTGGGCA TGAACCCCGA CCAGCTGGCT
CCCGGCGAGC GCTCGGCGTC CACGTCCAAC CGCAACTTCG AGGGGCGGCA GGGCAAGGGC
GGGCGCACGC ACCTGGTGTC GCCGCTGGTG GCCGCCGCGA CGGCGGTCCG CGGAACCCTG
TCGTCCCCGT CGGATCTGGC CGCCTGA
 
Protein sequence
MPHTLAEKVW ADHVVRSADG EPDLLYIDLH LLHEVTSPQA FDGLRQAGRT VRRPDLTIAT 
EDHNTPTLEI DKPIADPVSR LQIETLRKNC ADFGVRIHSL GDAQQGIVHV VGPQLGLTQP
GMTVVCGDSH TSTHGAFGAL AFGIGTSEVE HVLATQTLPL KPFKTMAVTV TGALGEGVTA
KDVVLAVIAQ IGTGGGQGYV IEYRGEAVEK LSMEGRMTVC NMSIEAGARA GMIAPDETTF
AYLKGRAHAP SGADWDAAVE YWKTLRTEDG ATFDAEVVID GDALTPYVTW GTNPGQGLPL
SASVPDPDKD FGAEVDKVAA RKALEYMGLQ AGTPLREIKV DTVFLGSCTN GRLEDLRAAA
AVIKGRQVAD GVRMLVVPGS ARVRLEAEAE GLHEVFTAAG AEWRFAGCSM CLGMNPDQLA
PGERSASTSN RNFEGRQGKG GRTHLVSPLV AAATAVRGTL SSPSDLAA