Gene TM1040_1154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1154 
Symbol 
ID4075940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1244983 
End bp1246611 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content63% 
IMG OID638006459 
Productputative alpha-isopropylmalate/homocitrate synthase family transferase 
Protein accessionYP_613149 
Protein GI99080995 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.981726 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0493285 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGAAC GGCTCTATCT TTATGACACC ACCCTGCGCG ACGGTCAGCA GACCCAAGGG 
GTGCAATTCT CCACGCTTGA AAAAAAGCAG ATCGCCCAGG CCCTTGACGA CCTAGGTGTG
GACTATATCG AAGGCGGCTG GCCGGGGGCC AACCCCACCG ACAGCGCCTT TTTCGATGAG
GCGCCCAAGA CCCGCGCGAC GCTCACCGCC TTTGGCATGA CCAAGCGGGC AGGACGTTCG
GCGGCAAATG ACGATGTGCT GGCGGCTGTG ATGAACGCAG GCACCCGGTC GGTCTGTCTT
GTGGGCAAGT CGCATGATTA TCACGTAACG ACCGCGCTTG GGATCAGCCT TGACGAGAAC
CTCGACAACA TCGCGCAGTC GGTGGCGCAT ATCGTGGCCG AGGGGCGCGA GGCGCTGTTT
GATGCAGAGC ATTTCTTTGA TGGCTACAAG GCGAACCCGG AGTATGCGCT GGCGGCCTGC
CGTGCCGCGC TCGATGCGGG CGCGCGCTGG GTTGTTCTGT GTGATACAAA TGGTGGCACG
CTGCCGCATG AGGTGTCTCG GATCGTCACT GAGGTGATTG CTGCGGGGCT GCCGGGCGAG
AAGCTCGGCA TTCACACCCA CAATGACACC GAAAACGCGG TGGCCTGTTC GCTGGCAGCG
GTGGACGCCG GGGTGCGCCA GATCCAGGGG ACGCTGAATG GGCTGGGGGA GCGCTGCGGC
AACGCCAATC TCACCACGCT GATCCCGACG CTGCTGCTGA AGGAGCCCTA CGCGAGCGGC
TATGAGACAG GCGTGCCAAA AGGCGCGCTG GCCCGGCTCA CCAAGGTCAG CCGGATGCTT
GATGAGATCC TCAACCGGGT GCCGCAGAAG CAGGCCGCCT ATGTGGGGGC CTCGGCCTTT
GCCCATAAGG CGGGCCTGCA CGCCAGCGCG ATCCTCAAGG ACCCGTCAAC CTATGAGCAT
ATCGACCCGG CTGTGGTCGG CAACGGCCGC ATCATTCCGA TGTCCAATCA GGCCGGGCAG
TCGAACCTGC GCCGTCGTCT GGCCGAGGCG GGCCTCACGG TTGAAAAAGG CGACCCTGCG
TTGGGCCGTA TTCTGGAGCG GGTGAAGGAA CGCGAGACCG AGGGTTACTC CTATGACACC
GCGCAGGCCT CGTTCGAGCT GCTCGCGCGC GAAGAACTGG GCCAGATGCC AGAGTTTTTC
GAAGTGAAGC GCTACAAGGT CACCGTGGAG CGGCGCAAGA ACAAGTACAA CAAGATGGTG
AGCCTCTCCG AGGCGGTGGT CGTGGTCAAG GTCGACGGTG AAAAGCTGCT CTCGGTGAGT
GAATCGCTGG ATCCCTCGGG CAGCGACCGT GGACCGGTGA ACGCGCTCGC AAAGGCGTTG
CGCAAGGATT TGGGGCGCTA TTCCAAGGTG TTGGATGATA TGCGTCTGGT CGACTTCAAG
GTGCGGATCA CCCAGGGGGG CACCGAGGCC GTGACCCGCG TGATCATCGA CAGCGAGGAC
GGCAAGGGGC GGCGCTGGTC CACGGTTGGG GTCAGTGCCA ACATATTGGA TGCCTCGTTC
GACGCGCTGC TTGATGCGAT GAACTGGAAG CTTCTGCGGG ATGCATCTGA GGCAGCGGAT
GCAGTTTGA
 
Protein sequence
MTERLYLYDT TLRDGQQTQG VQFSTLEKKQ IAQALDDLGV DYIEGGWPGA NPTDSAFFDE 
APKTRATLTA FGMTKRAGRS AANDDVLAAV MNAGTRSVCL VGKSHDYHVT TALGISLDEN
LDNIAQSVAH IVAEGREALF DAEHFFDGYK ANPEYALAAC RAALDAGARW VVLCDTNGGT
LPHEVSRIVT EVIAAGLPGE KLGIHTHNDT ENAVACSLAA VDAGVRQIQG TLNGLGERCG
NANLTTLIPT LLLKEPYASG YETGVPKGAL ARLTKVSRML DEILNRVPQK QAAYVGASAF
AHKAGLHASA ILKDPSTYEH IDPAVVGNGR IIPMSNQAGQ SNLRRRLAEA GLTVEKGDPA
LGRILERVKE RETEGYSYDT AQASFELLAR EELGQMPEFF EVKRYKVTVE RRKNKYNKMV
SLSEAVVVVK VDGEKLLSVS ESLDPSGSDR GPVNALAKAL RKDLGRYSKV LDDMRLVDFK
VRITQGGTEA VTRVIIDSED GKGRRWSTVG VSANILDASF DALLDAMNWK LLRDASEAAD
AV