Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_2814 |
Symbol | |
ID | 8327003 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | + |
Start bp | 3239824 |
End bp | 3242874 |
Gene Length | 3051 bp |
Protein Length | 1016 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 644943349 |
Product | hypothetical protein |
Protein accession | YP_003100590 |
Protein GI | 256376930 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCACGA CCGAACAGCA GATGCAGAAG ATGCACGTGA TGTCCCGACG CCCGGCCCCG ACCAGCGACG GTGGCGCGGG CTCCGGGTTC TCCATGAACC AGTCCGACAT GGAGAACCTG CGCAAGCAGG CCGAGGACCT CGGCAGCAGC TACAAGGAGA TCGCCGCCAC GGTCACCGGC GCCTCCCTGT CCGGCAACGC CTTCGGGCAG ATCGGCGCCG AGGCGGTGGA GCGGTTCAAC GTCTCCTCCA CCAAGGGCGG GATGCACCTC GACAAGATCT CCATGACGAT GGGCAAGGTC GCGGGCGGCA TCCAGGCCAC GGCCCAGGAC CACGCCCAGA CCGACGACAG CAACAACCAG GAGTTCGCGG GCATCAGCAC CGAGACCGAC GCCCAGGCCC CTGCCGGCGC GGACGGCGCC GCTGGGACGG GCGCGACCGG TCAGGGCGCG GGCCCGCAGG TCAACGACCC CGGTTCGGTC AACGGCGACC CCGGCGCCAC GGGCGGCGCC GGCACGGGCG GCGCGGGCGG CGGTCCCGCC GTGAACCAGC CGGGGACCAC CGGCGGCGCG GGCGGTCCGG ACGGCACGGG CGCCGGTGGG ACCGGCGGCA GCGGCACGGG CGGCAGCGGT TCCGCCGGGG GCGCCAACGG CGCGCCGAAG CCCGGCGACT ACCAGATCGA GGACACCGCG AACGTCGGCA CGCCGGGCGG CGGCGACGCC GGGGCGACCG GTGGCGCGGG CGCTGGGACC GGCGCTGGTG CGGGCACTGG CGCTGGTGCC GGCTCGGGTT CCGGCGCGGG CACGGGTTCT GGCTCCGGCT CTGGTTCTGG CTCTGGCTCT GGTTCGGGCA GCGAGACCAA CGGCCAGCAG CAACCGCAGC AGGCCCCCGA GATGCCTCCG ATCCCCGAGG TCCCGAAGGA CCTGTTCTCC CAGGGCGGGG GCGCAGGTGC GGGCGCCGGT GGCGAGGGCG CAGGCGCCGG TGGCGCCGGT GGCGGCGTGC CCGGCGGCTC TGGCTCGACC GGCGGCTCCG GCGCGGGTTC CGGCTCCGGC AGCGGTTCCG GCGCTGGTGC AGGCAGCGGC TCGACCCCGT CCACGCCCAG CCCGAGCGAC TACCAGATCG AGGACACGGC GGACATCGGC TCGCCCGGTG GCTCCGGCAG CGGTTCCACC AGCGGCAGCG GCTCCGGCTC CGGCTCCGGC CAGCAGCAGC AGCCGATGCC GAACATCCCG CCGATCCCGG AGGTCCCGAA GGACCTGTTC TCCCCTGGCG GGGGCGCAGG TGCGGGCGCG GGCACCGGTG GCGAGGGCGC GGGCACCGGC GGCGCGTCCG GCGGCAGCGG CTCCACGGGT GGTTCCGGTT CGACCGGTGG CTCGGGTTCC ACGGGTGGCT CGGGTTCCAC GGGTGGTTCC GGTTCGACCG GTGGTTCGTC GCCGTCGATC CCGAAGCCCG GCGACTACCA GATCGAGGAC ACCGCCAACA TCGGCACGCC GGGCGGCAGC GGCGGCTCGA CCCCGTCGAC GCCCACCATG CCGACCATCC CGCCGATTCC CGAGGTCCCG AAGGACCTGT TCTCCCCCGG CGGCAGCGCG GGCGCGGGCG GCTCCGACAG CTCCGCGGGC GGCAGCGGTT CGACCGGCGG CAGCGGCTCC ACCGGCGGCT CCGGCCTGCC CGGCGGCAGT GGTTCGACCG GTGGCTCGGG TTCCACGGGT GGTTCCGGTT CGACCGGTGG TTCGTCGCCG TCGATCCCGA AGCCCGGCGA CTACCAGATC GAGGACACCG CCAACATCGG CACGCCCGGT GGCAGCGGCG GCTCGACCCC GACGGCCCCG ACCATGCCGA CCATCCCGCC GATCCCGGAG GTCCCGAAGG ACCTGTTCTC CCCTGGCGGC ACGGGCGGCG TCCCGGCGGG CGACAGCAGC GCGGACCCGA TCAAGGGCGG CGACTCGACC GGCAGCAGCG GCTCCGGCTC TGGTTCCGGC AGCGGCTCTG GCAGCGGTTC CGGCACCGGC GGCTTGGGCG GCGGTCTCGG CTCCGGTTCG GGCAGTGGCA GCGGCACCAC CACCGGTTCC GGCGCGACCA CCCCGACCAT GCCCACCATC CCGCCGATCC CCGAGGTCCC GAAGGACCTG TTCGGCACGC CGGGCGTTCC CGCCGGTGAC ACGGACAACT CGGGCGCCGG TTCTGGCTCT GGCTCCGGCT CCGGCGCTGG CTCGGGCGCT GGCGACGGCT CCGGTTCCGG TTCGGGCTCC GACGAGGACC GCGGTCGCCA CTGCGGTAAC GACGGTCGCG GCGAGACCGG CGGCAAGGGC GAGCACGAGG GCCGCGGTGA CCACGGTGGC CGGGGCGACC ACGACGGTCG TGGTGAGCCC GGCGGCAAGG GCGAGCACGA CGGTCGCGGC GAGCACGGTG GCCGGGGCGA GCACGACGGC GACCGCGACG GCGACAAGGG CGACGGCAAG GACGGCGACC GCCCCGACGG CCTCATCGGC GACAAGGACG ACAAGTCCGA GGACCTCGTC GGCGAGCGCG ACGGCGAGAA GGACGGCTTC CCGATCACCT CGGTGGACGG CGAGCCGTTC CTCGACCTGC GCGACCTCCC GGCCGACGAG GCCGAGCGCT GGACCGAGCG CATCCGCGAC ATCATGGAGA GCAAGGGCGA GGGCTCCTTC TACTGGGCCG ACAGCACCAT CGACGGCGAG GGCCAGCGGC ACTCGCTGAT GGACGTGGCC GAGCTGATGA CCGGCCTGGA CAACCGCACC GAGGGCTCGG ACGGGGTCAA GGCGTTCACG AACCTCTCCG ACACCGGCGC GACCGCGACC GGCCAGCAGC CCACGTCCGT CGCGGGCAAG CTGTCCTCCG ACGTCGCGGC CAGCCCGGCG CGCAGCGCCA ACGGCGACGT GTTCGTCCTG GTCGGCCCGA ACCGCGCCGA GGGCGACCCG GTCGCCATGA CCGAGTTCCC GTCGCTCCAG TCCAACCCCA AGGTCGAGCG GGTCTTCGCG ATCGACGTGA CCACCGGCAA GGAGGTCCAG ATCCACCCGA AGCAGGCCTG A
|
Protein sequence | MSTTEQQMQK MHVMSRRPAP TSDGGAGSGF SMNQSDMENL RKQAEDLGSS YKEIAATVTG ASLSGNAFGQ IGAEAVERFN VSSTKGGMHL DKISMTMGKV AGGIQATAQD HAQTDDSNNQ EFAGISTETD AQAPAGADGA AGTGATGQGA GPQVNDPGSV NGDPGATGGA GTGGAGGGPA VNQPGTTGGA GGPDGTGAGG TGGSGTGGSG SAGGANGAPK PGDYQIEDTA NVGTPGGGDA GATGGAGAGT GAGAGTGAGA GSGSGAGTGS GSGSGSGSGS GSGSETNGQQ QPQQAPEMPP IPEVPKDLFS QGGGAGAGAG GEGAGAGGAG GGVPGGSGST GGSGAGSGSG SGSGAGAGSG STPSTPSPSD YQIEDTADIG SPGGSGSGST SGSGSGSGSG QQQQPMPNIP PIPEVPKDLF SPGGGAGAGA GTGGEGAGTG GASGGSGSTG GSGSTGGSGS TGGSGSTGGS GSTGGSSPSI PKPGDYQIED TANIGTPGGS GGSTPSTPTM PTIPPIPEVP KDLFSPGGSA GAGGSDSSAG GSGSTGGSGS TGGSGLPGGS GSTGGSGSTG GSGSTGGSSP SIPKPGDYQI EDTANIGTPG GSGGSTPTAP TMPTIPPIPE VPKDLFSPGG TGGVPAGDSS ADPIKGGDST GSSGSGSGSG SGSGSGSGTG GLGGGLGSGS GSGSGTTTGS GATTPTMPTI PPIPEVPKDL FGTPGVPAGD TDNSGAGSGS GSGSGAGSGA GDGSGSGSGS DEDRGRHCGN DGRGETGGKG EHEGRGDHGG RGDHDGRGEP GGKGEHDGRG EHGGRGEHDG DRDGDKGDGK DGDRPDGLIG DKDDKSEDLV GERDGEKDGF PITSVDGEPF LDLRDLPADE AERWTERIRD IMESKGEGSF YWADSTIDGE GQRHSLMDVA ELMTGLDNRT EGSDGVKAFT NLSDTGATAT GQQPTSVAGK LSSDVAASPA RSANGDVFVL VGPNRAEGDP VAMTEFPSLQ SNPKVERVFA IDVTTGKEVQ IHPKQA
|
| |