Gene Amir_3984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_3984 
Symbol 
ID8328177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp4667269 
End bp4669086 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content77% 
IMG OID644944458 
ProductPBS lyase HEAT domain protein repeat-containing protein 
Protein accessionYP_003101695 
Protein GI256378035 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.187514 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGAAAGA TCGACTGGCA GTCCCTGGAA TGCCTCCACG GTCCCGCCGG GAACGTGCCG 
GACCTGCTGG AGGAGTGCGC GCACGAGGAC CCGCTCCACG CGTTCGGCGC GATCGCCGAC
CTGGGCGACC TCCTGCGCCC CTCCCGCGCC CGAATCCTCT CCGCAGCCCC GGCGGCGCTC
CCGTTCCTGG TCGACCTGGC CGAGAACGGC CCGCACGCGC GCGAGCGGGT CGTGCGCCTG
ATCCGCCAGA TCGCCGAACG CGGCCACCCC ACCCCCGAGT GGACCGGGGC CCTGGACGCG
GCCCGGCCCG GCCTCCTCGC CCTCCTCACC GACCCGGACC CGCTGGTGCG CCGCCAGGCG
GGCAGGCTCC TGTCCTGCCT GGACCACCCG GAGGCCCTGA CCGCCCTGCG CGAGGGCTGG
GACACCGAGC AGGACCTCCG CGTGCGCTGC GACCTGGTGC GCTCCCTGGG CGGAGCGGAC
CCCGCCTTCG ACCTCACCGC ACTGCTCACC CACGACGACC CGCAGCTGTG CCTGGCAGCG
GCCCACGCCC TGCCCGCCAC CGCCGCGCTC CCCGACGCGA CCGCGCTGGC GAACGCCGTG
GCCTCACCGG ACAGCGCGGT GTGGGTCGAC TCCGCCTGGC TGGAAGATCC CCGCCACGAC
GACGCCCTCA CCGAGCTGGT CATCACCACC GGCGACCTCC TCGCCGAGGA CCCCGCCGCC
CTGACGGGCT ACGTCACCCT GGTGGCCCGC AACGGCATCG CCCCCCGCCG CGCGGCGGTC
CTTGGCTCAG CCCTGCGCCT GCTCTGCACC TGGCGCGACG TGGACCTGGT CCCCCTGCTG
GGAACCCTCC TCCACGACCC CCACCCCGCG GTCCGCTACC GAGCCGCGGC GGTCCTGGCC
TGCCTGGGCC CAGCCGCGCG CCCCCACGCC GACCGCCTGG CGGCCCTGCT CCAGGACCGG
TCGGAGCAGC CCGGCGAGTC GACCACGCAC ACCGCGGGCG ACATGGCCCT GTGGGCGCTG
GCGGCCCAGG GCGACCCGCG CTGCGTCCCC GCGCTGGTCG CCCTGCTGGA GAGCGACCGC
GTCCCGTTCG ACCTGAACAC CCACCGCCCC ACGAACCCCA CCCCCGTGAC CGCCTGCGGC
CCCTGGCTCC ACGAGCCCAC CGCCGAGGAG GTCCTCACCC CGCTGCGCGC GCACGCCGCC
GCCCTGGTCC CCCCGATCGC GGCCCGCCTG GCACGCCCCG ACCAGCACCG CCTGCTGGTG
GCGGCCCTGT GCCGGGTCCT GGCAGCCTGG GGCCCGCTCT CCGGCGAGGC CAAGTCCGCG
CTGGAACCCC TCACCGGCCA CCGCTACTAC GGCCGCTACG CGACGGCGGC CCTCAAGTCG
ATCGACGGCT ACACCGAGGC CGACGTCCCC GCCCTGGCGG GCGAGGCCCG CCGCTCCGGC
GTCACGATCG GCGTTCTGGG CGCGCTGGGC GCCGCGGCAG CGGAGGCCGA GGACACCCTG
CGCCGTCTGG CCACCCCGGA CGAAACGGCC TGGCGCCGCG TGGAGGCGTC CTACGCCCTG
TGGCGCGTCA CCGGCGAGAC CACGACCGCG GTCCCCCTCC TGCTGGAAGC CGCCGCTCCC
CTGGCGACCG GTGACTACAC CCGCCCGCGC GGGGCGGCCC TGCACCACCT GGCGGAGATC
GGCGTGCGCA CCGAGGAGGT CCTCGCCACC GCCCGAGCCG TGGCCACCAC CCGCCGCCGG
GTGGCGAACG TCGGCGACCG CGAGCGGATC GCCGAGGACG AGGCGCTGCG GGCATCAGCG
GCCCAACTGC TCGGATGA
 
Protein sequence
MRKIDWQSLE CLHGPAGNVP DLLEECAHED PLHAFGAIAD LGDLLRPSRA RILSAAPAAL 
PFLVDLAENG PHARERVVRL IRQIAERGHP TPEWTGALDA ARPGLLALLT DPDPLVRRQA
GRLLSCLDHP EALTALREGW DTEQDLRVRC DLVRSLGGAD PAFDLTALLT HDDPQLCLAA
AHALPATAAL PDATALANAV ASPDSAVWVD SAWLEDPRHD DALTELVITT GDLLAEDPAA
LTGYVTLVAR NGIAPRRAAV LGSALRLLCT WRDVDLVPLL GTLLHDPHPA VRYRAAAVLA
CLGPAARPHA DRLAALLQDR SEQPGESTTH TAGDMALWAL AAQGDPRCVP ALVALLESDR
VPFDLNTHRP TNPTPVTACG PWLHEPTAEE VLTPLRAHAA ALVPPIAARL ARPDQHRLLV
AALCRVLAAW GPLSGEAKSA LEPLTGHRYY GRYATAALKS IDGYTEADVP ALAGEARRSG
VTIGVLGALG AAAAEAEDTL RRLATPDETA WRRVEASYAL WRVTGETTTA VPLLLEAAAP
LATGDYTRPR GAALHHLAEI GVRTEEVLAT ARAVATTRRR VANVGDRERI AEDEALRASA
AQLLG