Gene Sare_1133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1133 
Symbol 
ID5703764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1283032 
End bp1284615 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content69% 
IMG OID641270648 
Productputative alpha-isopropylmalate/homocitrate synthase family transferase 
Protein accessionYP_001536032 
Protein GI159036779 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.740048 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000205142 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGTTCC AGGTATACGA CACGACCCTG CGCGACGGCG CCCAGCGTGA GGGGGTCAGC 
TATTCGGTCG TCGACAAACT CGCGGTGGCC CGGCTTCTCG ACGAGTTCGG CGTCGGTTTC
ATCGAAGGAG GATGGCCGGG CGCAGTACCG AAGGACACCG AGTTCTTCCG TCGCGCGCGC
ACCGACCTCG ACCTGAACCA CGCGGTGCTG GTGGCCTTCG GCGCCACCCG TCGCGCCGGT
CTGGGCGTCG ACAACGACCC GCAGGTGCAC AGCCTGCTCG CCGCCGAGAC CCCCGTCGTC
ACGCTGGTCG CCAAGGCCGA CCTGCGGCAC GTGCAGCGGG CCCTGCGTAC CACCGCGGAC
GAGAACCTGG CGATGATCCA TGACACCGTG ACGTACCTGG TGGCCGAGGG CCGGCGGGTG
TTCGTTGACG GGGAGCACTT CTTCGACGGC TACCGTGACG ATCCCACGTA CACCTCGTCG
GTGGTCGAGG CCGCGCTCGC CGCCGGTGCG GAGCGGTTCG TGCTGTGCGA CACCAATGGT
GGCATGCTTC CGTCCCAGGT CACCGCCGTG ATCGCTGACC TCACCGCCCG ACTCGGCGTG
GCACCCGAGC AACTCGGCAT CCACGGCCAG GACGACACCG CCTGTGCCGT GGCGAACACC
ATCGCCGCGG TGGAGGCGGG CGTACGGCAC GTGCAGGGCA CCGCCAACGG GTACGGCGAG
CGACCCGGTA ACGCCGACCT CTTCGCGGTG GTGGCGAACC TTCAACTCAA GCTCGGGCTG
CCAGTCCTAC CAGCGGGCTG CCTGGAACGG ATGGTGCGGG TGTCCCGCGG CATCGCCGAC
ATCGCCAACA TCGCACCCGA CGACCACCAG GCGTACGTCG GTGCCGCGGC CTTCGCCCAC
AAGGCGGGGC TGCACGCGAG CGCGATCAAG GTGGATCCGT TGTTCTACAA CCATGTGGAC
CCGCAGGCGG TGGGAAACCA CATGCGCATT CTCGTCACCG AGATGGCCGG ACGGGCCAGC
ATCGAACTCA AGAGCCGCGA GCTGGGTCTC GCACTGGCCG ACCACCCGGA GGCCCTGAAC
CGGGTCACCA AGCGGGTCAA GGACCTGGAG GCCGCCGGCT GGTCGTTCGA GGCGGCGGAT
GCCTCCTTCG AACTTCTGGT CCGCTCCGAA CTGCCCGACG GGGCGCCGGC GCGACCGTTC
ACCCTCGAGT CCTACCGCAT CCTGGTCGAG CACCGGGAGG ACAACGCGGT GGTCTCCGAG
GCGACGGTAA AGATCCGGAT AGGCGGTGAG CGGATGATCG CCACCGCGGA GGGCAACGGC
CCGGTGAACG CGCTCGACGA GGCGTTACGC GTCGGCCTCG CCCGGCACTA TCCGGAGCTG
CGTGACTTCG AGTTGGCCGA CTACAAGGTG AGGATTCTGG AGGGCAGCCA CGGCACCGGG
GCGGTGACCA GGGTGCTGGT GGAGACGGCC GATGGGGCCG GCCGGGACTG GACCACGGTG
GGCGTGCATC CCAACGTGGT GGAGGCGAGT TGGGGCGCAC TGGTCGACGC CCTGACCTAC
GGGCTGGATC GGGCCCGGAC CTGA
 
Protein sequence
MTFQVYDTTL RDGAQREGVS YSVVDKLAVA RLLDEFGVGF IEGGWPGAVP KDTEFFRRAR 
TDLDLNHAVL VAFGATRRAG LGVDNDPQVH SLLAAETPVV TLVAKADLRH VQRALRTTAD
ENLAMIHDTV TYLVAEGRRV FVDGEHFFDG YRDDPTYTSS VVEAALAAGA ERFVLCDTNG
GMLPSQVTAV IADLTARLGV APEQLGIHGQ DDTACAVANT IAAVEAGVRH VQGTANGYGE
RPGNADLFAV VANLQLKLGL PVLPAGCLER MVRVSRGIAD IANIAPDDHQ AYVGAAAFAH
KAGLHASAIK VDPLFYNHVD PQAVGNHMRI LVTEMAGRAS IELKSRELGL ALADHPEALN
RVTKRVKDLE AAGWSFEAAD ASFELLVRSE LPDGAPARPF TLESYRILVE HREDNAVVSE
ATVKIRIGGE RMIATAEGNG PVNALDEALR VGLARHYPEL RDFELADYKV RILEGSHGTG
AVTRVLVETA DGAGRDWTTV GVHPNVVEAS WGALVDALTY GLDRART