Gene Strop_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_1039 
Symbol 
ID5057485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp1178731 
End bp1179693 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content67% 
IMG OID640473308 
Productbranched-chain amino acid aminotransferase 
Protein accessionYP_001157891 
Protein GI145593594 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase 
TIGRFAM ID[TIGR01122] branched-chain amino acid aminotransferase, group I 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGGCT TGGACCGGGC AGCGTCGCCC GGACTGCGGC TCGGACGCTG GGCCTACCAC 
CGCGGCGAGT TCGTGCCGGC GTCTGATCCG CAGTTGCCGC TCAGCACGCA GGCTTTGCAC
TATGGGATCG GTGTATTCGA GGGGATTCGG GCGTACCGCA GCGCTGACGG CCTGTTTCTG
TTCCGTGCGT ACGACCACTA CGAGCGCATG CTGCGTGGCT GCCGAACGCT GAGGATTCCG
CTGCCGGGGA AGCCCGGCGA TCTGGTCGAC ATCACCGTCG AGCTGCTGCG GCGCAATGCT
CACGATGAAG ACGTGTACGT GCGGCCGGTG GCCTACAAGC TCTCGTTGCT GCCCGGCATG
CCACCTGGCG TATTCCTGAG CGGGCTGTCG GACGCCATGT CGATCATCAG CTACAGCCTG
CCGGTTGAGC GGCTCGGCCA GGGCGTGCGG TGTGGCATCA GCTCCTGGCG GCGCCCGCCA
CGCGACACAC TGCCCGCCCA GGCGAAGATC ACCGGAGGCT ATGTGACGAG CGCTTTCGCT
ACCGACGAGG CACGAGCAGG CGGGCAGGAC GACGCCATCC TGCTTGACCG CTCGGGCAAC
GTGGCCGAGG CGACAACCGC CAACGTCTTC ACGGTCCGCG ACGGTTGCCT GGTCACCCCC
CCGACAACCG GAGACCTGCT GCCCGGCATT ACCAGGGACA CGCTCCTCAC GCTCATCCGC
GAGGTGGGCC TGCCCGTCGC TGAGCGGTCG GTCAGCCCCG CCGAGCTGTT CTCCGCAGAC
GAGGTTTTCC TCTGTTCCAC CGGCAAAGGT GTGGTGCCGG TCATCGCCGT GGCAGGCCGT
GACGTCGGCA CCGGTGCGAT CGGCCCAGTT ACCGCCAAGG TCCGTGCGCT CTACGCGGCC
GCGACGACCA TGCCGGGCGG TATTCACGCG GACTGGCTTA CTCCCGTCAT GGAGACACCG
TGA
 
Protein sequence
MAGLDRAASP GLRLGRWAYH RGEFVPASDP QLPLSTQALH YGIGVFEGIR AYRSADGLFL 
FRAYDHYERM LRGCRTLRIP LPGKPGDLVD ITVELLRRNA HDEDVYVRPV AYKLSLLPGM
PPGVFLSGLS DAMSIISYSL PVERLGQGVR CGISSWRRPP RDTLPAQAKI TGGYVTSAFA
TDEARAGGQD DAILLDRSGN VAEATTANVF TVRDGCLVTP PTTGDLLPGI TRDTLLTLIR
EVGLPVAERS VSPAELFSAD EVFLCSTGKG VVPVIAVAGR DVGTGAIGPV TAKVRALYAA
ATTMPGGIHA DWLTPVMETP