Gene Sare_1149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1149 
Symbol 
ID5704413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1299333 
End bp1300778 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content71% 
IMG OID641270667 
Productisopropylmalate isomerase large subunit 
Protein accessionYP_001536048 
Protein GI159036795 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000597548 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGTGGGAG TCACTTCTGA ACCGAGGACC CTGGCCGAGA AGGTCTGGGC CGCGCACGTC 
GTCCGATCCG CCGAGGGCGA GCCCGATCTG CTCTTCATCG ACCTGCACCT GCTCCACGAG
GTGACGAGCC CGCAAGCCTT CGATGGGCTG CGTCTCGCCG GCCGCCGGGT TCGCCGCCCC
GACCTGACGA TCGCGACTGA GGACCACAAC ACTCCCACCG GGTACGCGGA CCCGTCGTTT
CGGTCCCGGC GCGGTGACCT GCTGACCATC ACCGACCCCA CCTCTCGTAC CCAGATCGAG
ACGCTGCGCC GCAACTGCGC CGAGTTCGGT GTGCGGCTGC ACCCGCTGGG CGACAAGAAC
CAGGGAATCG TCCACGTCAT CGGTCCGCAG CTGGGCCTGA CCCAGCCCGG CATGACGATC
GTCTGCGGTG ACTCGCACAC CGCCACGCAC GGGGCTTTCG GCGCGCTGGC CTTCGGGATC
GGCACCAGCG AGGTCGAGCA CGTGCTCGCC ACCCAGACGT TGCCGCAGGC CCGTCCGAGG
ACGATGGCGG TCAACGTGGT CGGCGACCTC GCGCCGGGCG TCACCGCGAA GGACCTGGTG
CTCGCCCTGA TTGCCCAGGT CGGAACCGGT GGTGGCCGGG GGCACGTGGT GGAGTACCGG
GGTGAGGCGA TCCGGAAGCT GTCCATGGAA GGCCGGATGA CCATCGCCAA CATGTCCATC
GAGTGGGGTG CCAAAGCGGG CATGATCGCC CCCGACGAGA CCACGTTCGA CTACCTCAGG
GGGCGGCCCA ACGCCCCGGC CGGCACCGAC TGGGAGGCGG CGGTCGCGTA CTGGCGGACG
CTGACCACCG ACGCCGACGC GACCTTCGAC GCCGAGGTGA CTCTGGACGC GAGCCGGATC
ACACCGTTTG TGACCTGGGG TACCAACCCG GGGCAGGGTG CGCCGCTGGA CGCGAGCGTG
CCGCACCCGG ACGAGCTCGC CACCGAGCCG GAGCGGGCCG CCGCCCGCCG CGCCCTGGAG
TACATGGACC TCGCTCCGGG CACCGCGCTG CGCGACCTCG CCGTCGACGT GGTCTTCGTC
GGCTCCTGCA CCAACGGTCG GATCGAGGAC CTGCGCGCGG CCGCGGACGT GCTGCGTGGG
CACCGGGTGG CGCAGGGCGT ACGGATGCTC GTCGTGCCGG GCTCCGCCGT GGTGCGGGAA
AGCGCCGAGG CCGAAGGGCT GGACAAGATC TTTACCGAAG CGGGCGCCGA GTGGCGCTTC
GCGGGCTGCT CGATGTGTCT GGGGATGAAC CCCGACACGC TCCTCCCGGG TCAGCGTGCC
GCCTCGACCT CGAACCGTAA CTTCGAGGGC CGCCAGGGTC GGGGCGGGCG TACCCATCTG
GTTTCCCCGC CGGTCGCCGC CGCCACCGCC GTGACGGGCC GACTGGCCTC CCCCGCCGAT
CTGTAG
 
Protein sequence
MVGVTSEPRT LAEKVWAAHV VRSAEGEPDL LFIDLHLLHE VTSPQAFDGL RLAGRRVRRP 
DLTIATEDHN TPTGYADPSF RSRRGDLLTI TDPTSRTQIE TLRRNCAEFG VRLHPLGDKN
QGIVHVIGPQ LGLTQPGMTI VCGDSHTATH GAFGALAFGI GTSEVEHVLA TQTLPQARPR
TMAVNVVGDL APGVTAKDLV LALIAQVGTG GGRGHVVEYR GEAIRKLSME GRMTIANMSI
EWGAKAGMIA PDETTFDYLR GRPNAPAGTD WEAAVAYWRT LTTDADATFD AEVTLDASRI
TPFVTWGTNP GQGAPLDASV PHPDELATEP ERAAARRALE YMDLAPGTAL RDLAVDVVFV
GSCTNGRIED LRAAADVLRG HRVAQGVRML VVPGSAVVRE SAEAEGLDKI FTEAGAEWRF
AGCSMCLGMN PDTLLPGQRA ASTSNRNFEG RQGRGGRTHL VSPPVAAATA VTGRLASPAD
L