Gene Hmuk_1109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1109 
Symbol 
ID8410628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1061136 
End bp1062074 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content65% 
IMG OID645019445 
Productbranched-chain amino acid aminotransferase 
Protein accessionYP_003176943 
Protein GI257387170 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase 
TIGRFAM ID[TIGR01122] branched-chain amino acid aminotransferase, group I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000646452 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0889333 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGC GCCCGGAGAT GTTCGAAGGG GTCGAGACGA TCTGGCAGGA CGGCGAGTTC 
GTCGACTTCG AGGACGCACA GGTCCACGTC CTGACTCACG CGCTACACTA CGGCACCGGC
GTCTTCGAGG GCGTGCGCTG TTACGACACC GACGACGGCC CGGCGATCTT CCGCTGGGAG
GAACACCTCG ACCGACTGTA CGAGTCGGCG AAGGTCTACG ACATCGAGAT TCCCTACACG
CCGGAAGAAC TCACCGAGGC GACCGTCGAG CTGATCCGTC GGGAGGGGCT GGAGTCGTGT
TACATCCGCC CGATCGCCTT CTACGGCTAC GGCCCGCTGG GGCTCAACCC GAGCGAGTCG
CCAGTCGAGA CCGCCATCGG CGTCTGGCCG TGGGGTGCCT ACCTCGGCGA GGAAGCGCTC
GAAGAGGGTA TCGACACGGC GATCTCCTCG TGGCGCAAGC ACGCCTCCAG CCAGATCCCG
ACCAGCGCGA AGACGACGGG AGCCTACGTC AACAGCGTCC TCGCCTCGCT GGAGGCAAAG
GGCAACGACT ACGGCGAGGC GATCGTCCTC AACAAGGAGG GCAACGTCGC CGAGGGGCCG
GGCGAGAACA TCTTCCTCGT CCGCGACGGC GAGATCTACA CGACCGGGCT GGCCGAGTCG
ATCCTCGACG GCATCACCCG CCGCAGCGTC ATTACCCTCG CCGAGGATCT GGGCTACACC
GTCCACGACG ACGCCACCAT CTCGCGGGGC GAGCTGTACG CCGCCGACGA ACTGTTCTTC
ACCGGCACCG CCGCCGAGGT GACGCCGATC CGCAGCGTCG ACGACAACGA GATCGGTGCC
GGAACGAAGG GGCCGGTCAC CGACGAGATC CAGACGCGGT TTTTCGACAT CCTCGAAGAG
CCACCCGAGC AGTACGACGA CTGGTTCAAG TACGTCTGA
 
Protein sequence
MSERPEMFEG VETIWQDGEF VDFEDAQVHV LTHALHYGTG VFEGVRCYDT DDGPAIFRWE 
EHLDRLYESA KVYDIEIPYT PEELTEATVE LIRREGLESC YIRPIAFYGY GPLGLNPSES
PVETAIGVWP WGAYLGEEAL EEGIDTAISS WRKHASSQIP TSAKTTGAYV NSVLASLEAK
GNDYGEAIVL NKEGNVAEGP GENIFLVRDG EIYTTGLAES ILDGITRRSV ITLAEDLGYT
VHDDATISRG ELYAADELFF TGTAAEVTPI RSVDDNEIGA GTKGPVTDEI QTRFFDILEE
PPEQYDDWFK YV