Gene Amir_4844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_4844 
Symbol 
ID8329042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp5765696 
End bp5767000 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content76% 
IMG OID644945285 
ProductSterol 3-beta-glucosyltransferase 
Protein accessionYP_003102517 
Protein GI256378857 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCATTC TCATCTACAC CTACGGGACC AGGGGCGACG TCCAGCCCTA CGTGGCGCTG 
GCGGTCGCGC TCAACGCGCG CGGGCACCAC TGCGTGCTCT CGGCCCCCGC GCGCTTCGCA
GGGCTCGCCG CCGCGCACGG CGTCGAGTTC GCGGGCCGGG ACGACGAGCT GATCCGGTTC
TACCTGGAAG ACCCCGAGGT GCAGTACAGC CTCGCCCACC AGGGCAGTGC CGAGCCCGGT
TTCCGGGCGC GGGGCCGCCG CGCCAGCACC GCCCTGCGCC GCACCCTGGT CGCCCGGCTG
CCGCACATCC TGCGCGACAC CGCCGCGGCG GCGGAGGGCG GCGCGGACCT GGTCGTCGCC
GGGCACTACC AGTGGGAGCT GGGCCAGCAC ATCGCCGAGC ACCTGAAGGC GCCGCTGGTG
ATGACCTCGC TGTGGCCGAC CTGCCTGCCG TCCAGGCGCC ACCCCAGCGA GGTGGTGCCC
TTCGGCGGCA GCCTCCCGCC GCTGCTCAAC CGGTTGTCCT ACCTGCCCCT GCGCTGGTTC
CAGGTCGGCG GCGCCGAGGT CGACCGGTGG CGCGCTGACC TGGGCCTGCC CAAGCGCAGG
GGCAGGCACG ACCGCTCCCG CACGGCGACG GGCGAGCCGG TCCCCTTCGT CCACGGCATC
AGCCCGCTGG TCGTGCCACC CGCCCCCGAC TGGCCCGCGA ACGCCCACAC CTCCGGATTC
TGGCGGCTGC CGCCCGCGCC GGACTGGAGC CCGCCCCCGT CCCTCGCCGA CTTCCTCGAC
CGCGACCCCA AGCCGGTGTT CATCGGCTTC GGCAGCATCG TCAGCCGCGA CCCGGAGGAC
ACCGCCCGCG TCATCCGCGA GGCCGTCTCC CGAGCAGGGG TGCGCGCCGT GGTGCGGTTG
GAGGCCAACA TCGACGCCGA CGCGCTCGGC CCCGACGTGC TCCCCGCGGG CGAGGCGCCC
TACGACTGGC TGTTCCCCCG CGTTGCCGCG ATCGTGCACG GCGGCGGGGT CGGCACGGTC
AACGACGCCC TCGCCTCGGG CGTGCCCCAG GTGCCCGTCC CGCACACCAG CGAGCAGGAG
GTCTGGTGCC GGATCGCGCA CCGGCTGGGT GTGGCCACCG AGCCGTTCCG GCAGCGCGAC
CTGGACGTCG ACCGGCTCGC CACCGCCCTG CGCGCCGCGA CCGGCGACGA GGGCCTGGCC
CGCGCCGCCC GCTCGGTGGG CGAGCGCGTC CGCGCCGAGG ACGGGGCGGG GACGGCCGCC
GCACTGGTGG AGCGCTACGG CCTCGACCGG GCGGCCACCC GATGA
 
Protein sequence
MRILIYTYGT RGDVQPYVAL AVALNARGHH CVLSAPARFA GLAAAHGVEF AGRDDELIRF 
YLEDPEVQYS LAHQGSAEPG FRARGRRAST ALRRTLVARL PHILRDTAAA AEGGADLVVA
GHYQWELGQH IAEHLKAPLV MTSLWPTCLP SRRHPSEVVP FGGSLPPLLN RLSYLPLRWF
QVGGAEVDRW RADLGLPKRR GRHDRSRTAT GEPVPFVHGI SPLVVPPAPD WPANAHTSGF
WRLPPAPDWS PPPSLADFLD RDPKPVFIGF GSIVSRDPED TARVIREAVS RAGVRAVVRL
EANIDADALG PDVLPAGEAP YDWLFPRVAA IVHGGGVGTV NDALASGVPQ VPVPHTSEQE
VWCRIAHRLG VATEPFRQRD LDVDRLATAL RAATGDEGLA RAARSVGERV RAEDGAGTAA
ALVERYGLDR AATR