Gene Amir_2103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_2103 
Symbol 
ID8326292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp2324805 
End bp2326415 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content78% 
IMG OID644942653 
Producttranscriptional regulator, PucR family 
Protein accessionYP_003099894 
Protein GI256376234 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism
[T] Signal transduction mechanisms 
COG ID[COG2508] Regulator of polyketide synthase expression 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACCCGA CCGTGGCGGA GGCGTTATCC CTGTCCGTGG TGCGACAGGG TCAGCCCAGG 
GTGGTCGCGG GCGCGGAGGG GCTGGGGCGG TCGGTGCGCT GGGTGCACGT GGCCGAGGTC
GCCGACATCG CGCCGCTGCT GCGCGGCGGG GAGCTGGTGC TGACCAGCGG GATCGCCCTG
CCGGAGGAGC CGGAGGCGCT GCGCGAGTAC GTGGCGGGGC TGGCCGGGGT GGGGGCGACC
GGGCTGGTCG TGGAGCTGGT GCGGCGGTGG CGGGGCGGGG TGCCGGGGGC GCTGGTCGCG
GCGGCGCAGG AGCACGGGCT GCCACTGGTC ACGCTGTCGG TGGAGGTGCG GTTCGTCGCG
GTCACCGAGG CGGTGGTGTC GCTGATCGTG GACGCGCAGC TCGCCGAGCT GCGGGCCGCC
GAGCAGGTGC ACGAGGCGTT CACGGCGCTG ACCGTGGCGG GCGCGGAACC GGCCGAGGTG
CTGCGGGAGG TGGCGCGCAC GTCGGGGCTG CCGGTGGTGC TGGAGACCCT GGGGCACGAG
GTGCTGGCGT ACGACGCGGC CGGGCAGGAC CCGGTGGCGC TGCTGGCCGA CTGGGGCGCG
CGCTCGCGCG AGGTGTCCGG GGCCGGGCGG ACCGCCTACC ACGCGGGGCC GGGGTGGCTG
GTGACCGCGG TGGGGGCGCG CGGGTCGGAC TGGGGCAGGC TCGTGCTGGT CAGCGCGGCC
GAGCCGCCGC ACCGGCACGT GGTGGTCGCG GAGCGGGCGG CGTCCGCGCT GGCCGTGCAC
CGGCTGGTGG CGCGGGACCG GGAGTCGCTG GAGCGGCAGA CGCACCGCAC GCTGCTCACC
CAGCTGCTGG ACCGGCCGCC CGCCGACCTG CCCGCGCGGG CCGCCGCGCT GGGGGTGCCG
CTGGAGCGGC GGCAGCTGCT CGGGGTGGCG ATCCGGCCGG GCGCGGCGAC CACGCCGGGG
CAGTCGCTGG CCACGCAGGA GGTGCTGCGG GACCTGGCCG AGGCGACGGC GCTGGCGGCC
CGGCGGGTGC CGGTGCCCGC GCTGGTCGGG GTGGTGGACG ACACGAGCGT GCGGGCGCTG
CTGGCGCTGC CGCCGCAGGC CCCGGTGGAG GGGGCGCTGC GGCAGTTCGC CCGCGAGGTG
CACCGGGCGG CGGCGGCCAC GCAGCACGGG CTGCCGGTGG TGGTGGCGGT CGGGACGGTG
GTGTGCGGGG TGCCGGAGGT GCGGCGCAGC CTCGGGGAGG CGGCGCACGT GGCGGGGGCG
GCGCTGCGGT CGCCGGGGGT GCGGCTGTAC CACCGGCTCG ACGACGTGCG GTTGCGCGGG
CTGCTGCACC TGCTGCGCGA GGACGAGCGG GTGCGGGCGT TCGCCGACCG GGAGCTGGGC
GTGCTGCTGG CCCGCGACCG GGCGCACGGG AGCAGGCTGG TGGAGCTGCT GCGGCACCTG
TGCGAGCAGG GCGGGAACAA GTCGGCGGCC GCGGCGGCGG CGCACCTGTC GCGGACGGCG
TACTACCAGC AGCTGGCGCG GGTGCAGCAG GTGCTCGGGG TGTCGCTGGA GGACCCGGAG
TCGTTGCTGT CGCTGCACAT CGCGCTGCTG GTGCGCGAGC TGGACCGCTA G
 
Protein sequence
MYPTVAEALS LSVVRQGQPR VVAGAEGLGR SVRWVHVAEV ADIAPLLRGG ELVLTSGIAL 
PEEPEALREY VAGLAGVGAT GLVVELVRRW RGGVPGALVA AAQEHGLPLV TLSVEVRFVA
VTEAVVSLIV DAQLAELRAA EQVHEAFTAL TVAGAEPAEV LREVARTSGL PVVLETLGHE
VLAYDAAGQD PVALLADWGA RSREVSGAGR TAYHAGPGWL VTAVGARGSD WGRLVLVSAA
EPPHRHVVVA ERAASALAVH RLVARDRESL ERQTHRTLLT QLLDRPPADL PARAAALGVP
LERRQLLGVA IRPGAATTPG QSLATQEVLR DLAEATALAA RRVPVPALVG VVDDTSVRAL
LALPPQAPVE GALRQFAREV HRAAAATQHG LPVVVAVGTV VCGVPEVRRS LGEAAHVAGA
ALRSPGVRLY HRLDDVRLRG LLHLLREDER VRAFADRELG VLLARDRAHG SRLVELLRHL
CEQGGNKSAA AAAAHLSRTA YYQQLARVQQ VLGVSLEDPE SLLSLHIALL VRELDR