Gene Sare_1130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1130 
Symbol 
ID5703761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1280273 
End bp1281370 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content70% 
IMG OID641270645 
Productbranched-chain amino acid aminotransferase 
Protein accessionYP_001536029 
Protein GI159036776 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase 
TIGRFAM ID[TIGR01123] branched-chain amino acid aminotransferase, group II 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.4266 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000123663 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGGTG GTGACAAGCT CGACTTCGAG ATCCGTCCGA ATCCCGCGCC GGTATCCGCC 
ACGGACCGGG CCGCGCTGCT GGCCGACCCG GGCTTCGGGC GGGTCTTCAC CGACCACATG
GTCACCATCC GCTATGCCGC CGGCAAGGGC TGGTACGACG CGCGGGTCGA GGCGCGGGCG
CCGATCCCGA TGGACCCGGC CGCCGCGGTC CTGCACTACG CCCAGGAGAT CTTCGAGGGC
ATGAAGGCGT ACCGGACCGT CAGTGGTGGC GTGACCATGT TCCGGCCGTA CGCCAACGCG
GCCCGGTTCG CCGCGTCCGC CCGGCGGATG GCAATGCCCA CGCTGCCCGA GTCGGTGTTC
GTCGATTCCC TGCGCCGGCT GATCGAGGTC GACCGGGAGT GGATTCCCGA GGGTGAGGAC
GGCAGCCTCT ACCTGCGGCC GTTCATGTTC GCCAGCGAGG TCTTCCTGGG TGTGCGGCCC
GCCAACGAAT ACCTGTACGC GGTGATCGCC TCCCCGGTCG GCGCGTACTT CTCCGGTGGG
GTGAAGCCGG TCACCGTCTG GGCCTCGCCG GACTACACCC GGGCCGCGCC CGGTGGCACC
GGCGCCGCCA AGTGCGGCGG CAACTACGCC AGTTCGTTGG TCGCCCACGC GGAGGCCCTT
GAGCACGGCT GCGACCAGGT CGTCTTCCTG GACGCGGTGG AGCGCCGCTT CGTCGACGAA
CTGGGTGGCA TGAACCTGTT CTTCGTCTAC GACGACGGTA CTCTGGTCAC CCCGCCGCTG
ACCGGCACCA TCCTGCCCGG CATCACCCGG GAGTCGGTGC TCGCGCTCGC CGCCGAGGCC
GGCCACCAGG TGGCGGAGCA GCCGATCGCC TTCACCGACT GGCAGGCCGA CGCGGCGAGC
GGCCGCCTGC GTGAGGTCTT CGCCTGCGGA ACGGCCGCGG TGATCACGCC GGTCGGCGCG
GTCCGTTCCC CCGACGGCGA GTTCCGCATC GGCGGCGGTG AGCCTGGCCG GGTCACCATG
GCGTTGCGTC AGCAGCTCGT CGACATCCAA CGTGGCAAGG CCGCAGATCC ATACAACTGG
GCCCACCACG TGCTCTGA
 
Protein sequence
MSGGDKLDFE IRPNPAPVSA TDRAALLADP GFGRVFTDHM VTIRYAAGKG WYDARVEARA 
PIPMDPAAAV LHYAQEIFEG MKAYRTVSGG VTMFRPYANA ARFAASARRM AMPTLPESVF
VDSLRRLIEV DREWIPEGED GSLYLRPFMF ASEVFLGVRP ANEYLYAVIA SPVGAYFSGG
VKPVTVWASP DYTRAAPGGT GAAKCGGNYA SSLVAHAEAL EHGCDQVVFL DAVERRFVDE
LGGMNLFFVY DDGTLVTPPL TGTILPGITR ESVLALAAEA GHQVAEQPIA FTDWQADAAS
GRLREVFACG TAAVITPVGA VRSPDGEFRI GGGEPGRVTM ALRQQLVDIQ RGKAADPYNW
AHHVL