Gene Arth_2226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2226 
Symbol 
ID4445287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2502364 
End bp2504103 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content64% 
IMG OID639690035 
Product2-isopropylmalate synthase 
Protein accessionYP_831706 
Protein GI116670773 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00970] 2-isopropylmalate synthase, yeast type 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.389024 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAAACG CACAAAAGCC CTCCGGAATG CCCGTCCACC GCTACATGCC GTTCCAGGAC 
CAGATCACCG TTGAACTGCC TGACCGGACG TGGCCGGACA AAGTCATTAC GAAGGCCCCG
CGCTGGTGCG CCGTTGACCT CAGGGACGGC AACCAGGCCC TGATCGACCC CATGAGCCCG
GCCCGCAAGA TGAAGATGTT CGACCTGCTG GTCCGCATGG GTTACAAGGA AATCGAGGTG
GGCTTTCCCT CCGCCTCGCA GACTGACTTC GATTTTGTCC GTCAGCTGAT TGAAGGCAAC
CACATTCCGG ACGACGTCAC CATCCAGGTG CTGACGCAGG CCCGTGAGCA CCTGATCGAG
CGGACCTATG AATCCCTGGT TGGCGCCAAG CAGGCCATCG TGCACCTCTA CAACTCGACG
TCGGTCCTGC AACGCCGCGT GGTGTTCAAC CAGGACGAGG ACGGCATCCT GGACATTGCA
CTCCAGGGCG CCCGGCTGTG CAAGAAGTAC GAGGAAACGC TGGCGGACAC CCACATCACC
TACGAGTACT CACCGGAATC CTTTACCGGC ACGGAACTCG AATACGCTGT CCGCGTCTGC
AACGCAGTGG CCGATGTCTT TGAGGCCTCG GCTGACAGCC AGGTCATCAT CAACCTGCCG
GCCACTGTGG AAATGGCCAC CCCCAATGTC TACGCCGATT CGATCGAGTG GATGAGCCGG
CACCTGCACC CCCGCGAGGG CATCATCCTC TCGCTGCACC CGCACAACGA CCGCGGCACC
GGTGTTGCCG CGGCCGAGCT CGGTTACCTC GCCGGGGCGG ACCGGATTGA GGGCTGCCTG
TTCGGAAACG GCGAGCGGAC CGGAAACGTG GACCTGGTGA CGCTGGGCCT GAACATGTTC
GTCCAGGGCA TCGACCCCAT GATTGATTTC TCCGACATTG ACGACGTCCG CCGCACCGTG
GAGTACTGCA ACCAGCTGCC GGTGGCGGAG CGTTCGCCGT ATGGCGGCGA CCTTGTCTTT
ACGGCGTTCT CCGGCTCGCA CCAGGATGCC ATCAAGAAGG GCTTCGAAGC GCTCGAAAAG
GATGCGGCTG CCGCCGGCAA GGATGTGGCC GACTACACCT GGCAGGTTCC GTACCTGCCG
GTCGACCCCA AGGACCTGGG GCGCAGCTAC GAAGCCGTCA TCCGGGTCAA CTCGCAGTCC
GGTAAGGGCG GCGTGGCCTA CCTGCTGAAG AACGAGCACA GCCTGGACCT GCCGCGCCGC
GCACAGATCG AATTCTCCGG GGTCATCCAG AAGCGGACGG ACACCGTGGG CGGCGAGGTC
AGCGGCGCCC AGCTCTGGCA GATCTTCCAG GACGAATACC TGCCCTCCAG CAAGGAGGAC
GGCCAGTGGG GCCGGTATTC GCTGGGGTCG TTCAGCACGG AAACGGACGA CGACGGCGCC
ATGACCTTGC ATGCGACGGT CACCGTGGAC GGCGTCCAGG TCCGCCGCAC CGGCTCCGGC
AACGGTCCGA TCGCAGCGCT GCTGTCGATC CTCGGCCAGG ATGGCGTGGA CGTACGCGTC
CTGGACTACA GTGAGCACGC GCTCTCTGAA GGCGGCAACG CCCGTGCAGC CGCGTACGTT
GAATGCGCTG TCGGCGAGCG GGTGTTGTGG GGCGTGGGGA TCGACTCCAA CACCACCACC
TCCTCGCTGA AGGCCGTCAT TTCGGCCGTC AACCGTGCCA TCCGGGACGC GCAGGCTTAG
 
Protein sequence
MRNAQKPSGM PVHRYMPFQD QITVELPDRT WPDKVITKAP RWCAVDLRDG NQALIDPMSP 
ARKMKMFDLL VRMGYKEIEV GFPSASQTDF DFVRQLIEGN HIPDDVTIQV LTQAREHLIE
RTYESLVGAK QAIVHLYNST SVLQRRVVFN QDEDGILDIA LQGARLCKKY EETLADTHIT
YEYSPESFTG TELEYAVRVC NAVADVFEAS ADSQVIINLP ATVEMATPNV YADSIEWMSR
HLHPREGIIL SLHPHNDRGT GVAAAELGYL AGADRIEGCL FGNGERTGNV DLVTLGLNMF
VQGIDPMIDF SDIDDVRRTV EYCNQLPVAE RSPYGGDLVF TAFSGSHQDA IKKGFEALEK
DAAAAGKDVA DYTWQVPYLP VDPKDLGRSY EAVIRVNSQS GKGGVAYLLK NEHSLDLPRR
AQIEFSGVIQ KRTDTVGGEV SGAQLWQIFQ DEYLPSSKED GQWGRYSLGS FSTETDDDGA
MTLHATVTVD GVQVRRTGSG NGPIAALLSI LGQDGVDVRV LDYSEHALSE GGNARAAAYV
ECAVGERVLW GVGIDSNTTT SSLKAVISAV NRAIRDAQA