Gene Strop_3525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3525 
Symbol 
ID5059999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4041912 
End bp4042967 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content70% 
IMG OID640475779 
Product4-hydroxy-2-ketovalerate aldolase 
Protein accessionYP_001160334 
Protein GI145596037 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR03217] 4-hydroxy-2-oxovalerate aldolase 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCGC TCTACCTCCA GGATGTGACG CTGCGGGACG GGATGCACGC CATCGCCCAC 
CGCTACACCG TTGACCAGGT GCGGACGATC GCCGCCGCCC TCGACGCCGC CGGAATCGCC
GCCATCGAGG TGGCGCACGG TGACGGGCTG GCCGGGTCGA GTGTCAACTA CGGGCACGGC
GCGGCGAGCG ACGCCGAGTG GATCGCGGCG GCGGCCGAGG TACTGACCAC GGCGCGGCTG
ACCACGCTGC TGGTCCCCGG AATCGGCACC ATCGCCGACC TGAAGGCCGC GCGGGCCCTC
GGCGTGACCA GCGTTCGGAT CGCCACCCAC TGCACCGAGG CCGACATCTC CGCACAACAC
ATCGGCTGGG CTCGGGAGAA CGGGATGGAC GTCTCCGGGT TCCTGATGAT GGCCCACATG
AACGATCCGG TTGACCTGGC AGCGCAGGCC AAGCTCATGG AGTCGTACGG GGCGCACTGC
GTCTACGTCA CCGACTCCGG CGGCCGGCTC CTGATGTCCG ACGTGGCCGA GCGGATCGAC
GCCTACCGTC AGGTACTCGA ACCGGAGACG CAGATCGGCA TCCACGCCCA CCACAACCTC
TCGCTCGGTG TGGCCAACAG CGTGGTCGCG GTGGAGCACG GCCGGATCCT CGGGGACGGG
CCGCTGGGCA CACCGGTTGG CCGAACCGTC CGGGTCGACG CCTCCCTCGC CGGGCAGGGT
GCGGGCGCGG GTAACGCACC GCTGGAGGTC TTCGTCGCGG TCGCCGAGTT GCACGGCTGG
GAGCACGGCT GCGACGTCTT CGCGCTGATG GACGCGGCCG AGGACGTGGT CCGGCCGTTG
CAGGACCGAC CGGTGCGGGT GGACCGGGAG ACGCTCTCGC TGGGCTACGC CGGGGTCTAC
TCCAGCTTCC TGCGGCACGC TGAGCGGGCC GCCGAACGCT ACGGCGTGGA CGTCCGCTCG
ATTCTGGTCG AGCTGGGCCG GCGCCAGATG GTCGGTGGCC AGGAGGACAT GATCGTGGAT
GTGGCGCTGG ACCTGGCTGG CAAGGAGGAG ACGTGA
 
Protein sequence
MTPLYLQDVT LRDGMHAIAH RYTVDQVRTI AAALDAAGIA AIEVAHGDGL AGSSVNYGHG 
AASDAEWIAA AAEVLTTARL TTLLVPGIGT IADLKAARAL GVTSVRIATH CTEADISAQH
IGWARENGMD VSGFLMMAHM NDPVDLAAQA KLMESYGAHC VYVTDSGGRL LMSDVAERID
AYRQVLEPET QIGIHAHHNL SLGVANSVVA VEHGRILGDG PLGTPVGRTV RVDASLAGQG
AGAGNAPLEV FVAVAELHGW EHGCDVFALM DAAEDVVRPL QDRPVRVDRE TLSLGYAGVY
SSFLRHAERA AERYGVDVRS ILVELGRRQM VGGQEDMIVD VALDLAGKEE T