Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_4404 |
Symbol | |
ID | 8328601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | - |
Start bp | 5200156 |
End bp | 5203194 |
Gene Length | 3039 bp |
Protein Length | 1012 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 644944865 |
Product | hypothetical protein |
Protein accession | YP_003102098 |
Protein GI | 256378438 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGTCC TGGACCTCGA AGCGATCGCG CGGGACGAGG CGGAGCACCC CTGGCTGCGC GCCGAGGTCG CCGAGGAGCT GGCGGGCAGG GCGAGCCCGC TCGTCGCCGC CGCGCGGGTC TGGTGCGCCC GCCAACGCCC GGAACCGCAG GTCGTCGACC TGTGGGCGCT CGACCACGGC CTGGTCTTCG CGGCGGTCGC CGCGCTGCGC ACCTGCCTGC TCACCGAGGG CGCGCGGATG ATCGAGGCCG ACGCGCGCGA CAGCTGGTGG AGCAGGCACA CCCCGCTCCT GCGGCGGGCC CGCGCGCTGC TGGCGGCGGC GTCCGACGCG GACTACGCCG CCGCCGTCAC CGCGCTCGAC GGCGAGCGCG AGGGCGCGCG CACCTCGGTC ATCGCCGCGT ACCTCGTGCC CGACCAGCCC GGTTGGGTGG ACCGGGCGGT CGAGGCGCTG AGCGCGCTCG ACGACCGGAG CGCGTGGCGG CTGCTGCTGC ACTCGGTCGA CTCGCTCGAC CGGCTGCAGG CGGTCCTGCG CCTGTCCGGG CGCGAGGCGC TGCGCTCGCC CGAGGTGCTG CACACGCTCG GCACGGTCAT CGGGCCGGAC CTGGCGCCCC TGATCACGCA CCTGCTCCCG GACACCTTCG GCTACCAGGC CACGGGCGAC CAGGTCAGGG CGATGCTGGT GGAGTTCGCC ACCGCCCCGG CGTTCACGGC GCTCCTGGCC GACCTGCTGG GCGGCGGCGT CCCGATCTCC GCGCTGGAGG TCGCGCACAA GCGGCCCCGG ATGGCGCTGG AGCGGCTCGC GGCGGCCGGG GAGCCCGCGC GGGAGCTGCT GCTGGACCAC CTCCGGGTGC ACCCCGACCT GCTGGACTCC GCGCCCGAGT GGGCGCGCGA GGTCCTCGTG TCGGAGCTGG CCGCGCGGTC GGCGGTGCCC GACGCGGTCG ACGCCGAGCT GCCCGCCGCG CTGGTGACCC CGCCGTGGGA GGCGGCCCGC GAGCTGCCGG AACCGCTGGT GGTGCGGGGG GTTCCGGTGC CCGACGAGCG GTCCGTGGTG TGGCGGGCCG GGCTGCGGGA GGTGTGGGCG GTCAGGAGCC CGTACATCGC GGGGTCGCCC GAGTCCTGCT GGGAGCGCTA CCCCGAGCGG TTCGAGCGGG GGGAGCTGAG CGCGAGCGAG CAGTTCCGCT GGTTCGCTCA CGGGCCGCTC GACCAGGTGC TGCCCAGGGT CGCCCGCTGG ACCCTGCCCG AGCTGAACTG GTACTCCGGC GAGCTGGTCC AGGTGCTGCT GAACCGGTTC GAGGTCGACG CGTTCGGGCT CGCGCTCGAC TACGCCCGCA ACCCGGCGGC GCGCGCCTAC GACGTGGTGC TGCCGCTGGT GTCGGTGGAG GTCGCCCGGC TGGTCGCCGC GCGGCTGGAG GGTCCGAAGG CGGCGCGGCC GGTGTCGGTG CGGTGGGTGC GCGCGCACCC CGAGGCGGCG GCCCGGTTGC TGCTGGCGGA CGCGGTCGGC GAGCCGGGGG AGCGCAGGGC GGCGGCGGAG GCCGTGCTGC GGATGGCGCC GGAGCAGGCG CGGGCCGCCG CCGCTCCGCA CGGCGACGAG GTGGTGGAGG CGGTGGAGCG AGTGCTGTCG GTGGCCCCGG TGGACCTGGT GCCGTCCCCG GTGCCGGATC CGGGCTGGTG GTGCCCGCCG GAGGGGCTGC CCAGGCTGCT GCTGCGCGGC GGCGCTCGGG TGCTGCCGGA CCGGGCGGTG CGGCACGTGA CGACCATGCT GGCCATGACC GTGCCCGGTC GGCACTACGC CGGGGTGGAC GTGGTGCGCG AGGTGTGCGA CCCGGAGTCG CTGGCGCGCT TCGCGGAGGC CCTGCTCGAC CGGTGGATCG CCCACGGCCT GCCCGCGAGC GGGAGGTGGG CGCTGTTCGC GCTGGGCCCG GTCGGCGACG ACGCGGTGGT GCGGCGGCTG GTGCCGCTGA TCACCGCGTG GCCGGGGCAG TCGCAGCACC ACAACGCGGT CGCCGGGCTG GACGTGCTGG CCGGGATCGG CAGCGAGGCC GCGCTGGTGG CGCTGAGCGG CATCGCGCAG CGGGCGAGGT TCAAGGCGCT CAAGCAGAAG GCCGGGCAGA GGGTCGCCGA GGTCGCCGAG GACCTGGGGC TGACCCCCGA GCAGCTCGGC GACCGCCTGG CGCCGACCCT CGGACTCGAC GAGGAGAGCG CGCTGGTCCT GGACTACGGG GCGCGCTCGT TCACGGTCGG CTTCGACGAG AAGCTCGCCC CGTTCGTGCT CGACGGGACG GGCAAGCGGC TGAAGTCCCC GCCCAAGCCC GGCGCGAAGG ACGACCCGGA GCTGGCGCCC GCCGAGCACC GGCGGTTCGC GGCGCTGAGG AAGGAGGTCA GGGTCGTCGC GGCGGACCAG GTGGCGCGCC TGGAGCGGGC GATGGTGGCC GGGCGGGACT GGAGCGCCGG GGAGTTCGCG GCGCTGCTGG CCGGGCACCC GCTGGTGCGG CACCTGGTGC GCGGGCTGGT GTGGGCCGCC GACGGCGCGG CGTTCCGGGT GGCCGAGGAC GGCACGCTCG CCGACGTGAC CGACGAGCCG TTCACCCTGC CAGGGGACGC GCGCGTCACC CTGCCCCACC CGCTGCGCCT GCCGGACCTG CCGGGGTGGT CGGAGCTGTT CGCGGACTAC GAGATCCTCC AACCGTTCCC GCAGCTGGCC AGACCGGTCC ACCGCCTCAC CGGGCCGGAG CTGGCCGGCC GCGACCTGAC CCGCTTCGTC GGCGCGAAGA CCGGGACGGT GCCCACGTCG GTGAAGCGCC ACTGGGAACC GGGCCCACCG GTGGACGCCG GGATGATCCT CGGCGTGCGC CGCCCCCTGC CGGGCGGGGC GTCGGTGTGG GTGGAACTCG ATCCGGGCCT GTTCACCGAC CACTGGCAGG ACTGGTCGGC GCAGCGGGTG GTGGAGGTGC GGCTGAGCGG GGTGGCGGAC TTCGCGGCGG TGGACGAGGT GCTGGTGTCG GAGGTGGTGG GTGAGCTGGA GATCATGACG GCGGGGTGA
|
Protein sequence | MPVLDLEAIA RDEAEHPWLR AEVAEELAGR ASPLVAAARV WCARQRPEPQ VVDLWALDHG LVFAAVAALR TCLLTEGARM IEADARDSWW SRHTPLLRRA RALLAAASDA DYAAAVTALD GEREGARTSV IAAYLVPDQP GWVDRAVEAL SALDDRSAWR LLLHSVDSLD RLQAVLRLSG REALRSPEVL HTLGTVIGPD LAPLITHLLP DTFGYQATGD QVRAMLVEFA TAPAFTALLA DLLGGGVPIS ALEVAHKRPR MALERLAAAG EPARELLLDH LRVHPDLLDS APEWAREVLV SELAARSAVP DAVDAELPAA LVTPPWEAAR ELPEPLVVRG VPVPDERSVV WRAGLREVWA VRSPYIAGSP ESCWERYPER FERGELSASE QFRWFAHGPL DQVLPRVARW TLPELNWYSG ELVQVLLNRF EVDAFGLALD YARNPAARAY DVVLPLVSVE VARLVAARLE GPKAARPVSV RWVRAHPEAA ARLLLADAVG EPGERRAAAE AVLRMAPEQA RAAAAPHGDE VVEAVERVLS VAPVDLVPSP VPDPGWWCPP EGLPRLLLRG GARVLPDRAV RHVTTMLAMT VPGRHYAGVD VVREVCDPES LARFAEALLD RWIAHGLPAS GRWALFALGP VGDDAVVRRL VPLITAWPGQ SQHHNAVAGL DVLAGIGSEA ALVALSGIAQ RARFKALKQK AGQRVAEVAE DLGLTPEQLG DRLAPTLGLD EESALVLDYG ARSFTVGFDE KLAPFVLDGT GKRLKSPPKP GAKDDPELAP AEHRRFAALR KEVRVVAADQ VARLERAMVA GRDWSAGEFA ALLAGHPLVR HLVRGLVWAA DGAAFRVAED GTLADVTDEP FTLPGDARVT LPHPLRLPDL PGWSELFADY EILQPFPQLA RPVHRLTGPE LAGRDLTRFV GAKTGTVPTS VKRHWEPGPP VDAGMILGVR RPLPGGASVW VELDPGLFTD HWQDWSAQRV VEVRLSGVAD FAAVDEVLVS EVVGELEIMT AG
|
| |