Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_4196 |
Symbol | |
ID | 8328389 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | - |
Start bp | 4945227 |
End bp | 4947227 |
Gene Length | 2001 bp |
Protein Length | 666 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 644944660 |
Product | conserved repeat domain protein |
Protein accession | YP_003101897 |
Protein GI | 256378237 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00838745 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGCTG CACGGGGCGC GCACCGCTTC GCGGTGGTCG CGGCGATCGC GCTGCTGGTC CTCGGCGTCC TGTGGTGGGA CGTGCCGGGA GTGCTGCGCG CGGTCGGCAC CGGATCGATC ACCGGACAGG CGTACGTGGA CGCGAACGAC GACGGCGTCC GGCAGGGCGG TGAGGCCGCG CTGCCCGACC TGCCCGTCCG GCTCACCGGC ACGCGCGACA GCTGCCCGGT GGGCGAGCAG CCCGCCTGCG CGGCGTCCGC GACCACGACC ACGTCCGCGG CCGGCGAGTT CACCTTCACC GGCCTGGAGG CGGGCCTGTA CTCGGTCACC GCCGCCCGGC CCGCCGGGTA CGCCGACGGC AAGTCCACGG CGGGCGCGGC GGGCGGCTCG CTGAGCGCCC CGAACCAGAT CAGCGGCATC CGGGTCGACA GCGGCGCGTC CGCCACCGGG TACTCGTTCG GCATGAGGGT CGGGGCCGTC ACCGGCACCG CGTGGGTCGA CAACAACGCC AACGGCACGA TCGAGGACGA CGAGCACGAG TGGCTGGAGG ACGTGACCGT CACGCTCGTG AAGTCCGACG AGACCCCGGT CGCGACCACG ACCACCTCCA CCACCGGCAA CTACAGCTTC GACGCCGTCC TGCCGGGCGA CTACGTCGTC CGGGCGACCC TGCCCGCCGG GTACGGCGCG GCCTCACCCA CCTCGGTGCC GTTCACGCTG GCGTCCGGGC AGGGCAGGCA CGTGGACTTC TCGATGGTCA AGGGCGCGCT GGGCAACTTC GTGTGGCTCG ACGCCGACCG CGACGGGCTC CAGGGGATCG GCGAGGACGG GGTGCCCGGC ATCGCCGTCG AGCTGCACCG CACGCCGGGC GGGGTGGTCG ACAGCCAGCT CACCGACGCC AACGGCGAGT ACTACTTCGT GGGGCTGGAC GTGGGCACGT ACTTCATCCG AGTGATCAAG CCAAGCGGGA CGGTGTTCAC CGAGCGCGAC AAGTCCGCCA CGGTCGGGTC GCACGTCGAC GCGGACGGGT ACTCATCACC GGTCGAGATC AAGGTCGAGA ACAGCGGGAT CACCCAGGAC ATGACCCTGG ACGCGGGCTT CTACGAGCTC GCGCAGGGCG AGACCCCGCC GACCACGACG ACCACCACCA CCACCACCAC CACCACCGCT CCGACGACCA CAACCACGAC GACACCGACC ACCACCACCA CGCCGACGAC GACCACCACG GGCCCGACGA CCACGACCAC GGGTTCGACG ACCACGACCA CGGGCCCGAC GACCACGACC GGTCCGACCA CCACGACGGG CGCGACCACC TCCACCACGC CGCCCACCAC AACCGCCCCC GGCACGACCA CGACCACCAC CACGCCCCCG CCGCGCACCG ACCTGGGTGT GCAGTTCGCC GTGGACAACC CGAAGCCCGC CGTGGGCGAC AAGGTCACGT TCACCACCGT GGTCGTCAAC CGGGGCACCG CGCCGGTGGA GGGCTCACGG GTGACGATCA CCCTGCCGGA CGGCCTGCGT CCCGAGACCG GCACCGGCCA GAACCTCCGC AGGGCGCTCC TGGCCCAGTC CGGCTGGACC TGCCAGGCGA CCGGCCAGCA GCTGGTCTGC GCGAACCCGG CCACCGTCCA ACCGGGCGCC TCGTTCGAGC CGCTGACCGT GGTGACGACC GCGACGGCCC CGGTCCTGCC GCAGGCGACC ACGGTCGCGG TCGCCCTGTA CGACGGCACT CCCGACGACA ACCCGGACAA CGACGGCACG ACCCCGGCCC TCGCCATCCC GAGCACGACC ACCCCGACCC CGGGCGTGGT GACCGAGCTG CTCCTCCCGA CCCAGCAGCC CACCACAACC CCCCTGGCCA CCACGGGCCG CCCGGCGCAC GCGCTCCTGC TGACGGCCCT GGTCCTGATG GTCCTGGGCG CGGGCCTGCT GGTCACCACC CGCAAGCCCG CCACCGGAGG CCGCCACCGC GCGGCACGCC GGAGCGACTG A
|
Protein sequence | MPAARGAHRF AVVAAIALLV LGVLWWDVPG VLRAVGTGSI TGQAYVDAND DGVRQGGEAA LPDLPVRLTG TRDSCPVGEQ PACAASATTT TSAAGEFTFT GLEAGLYSVT AARPAGYADG KSTAGAAGGS LSAPNQISGI RVDSGASATG YSFGMRVGAV TGTAWVDNNA NGTIEDDEHE WLEDVTVTLV KSDETPVATT TTSTTGNYSF DAVLPGDYVV RATLPAGYGA ASPTSVPFTL ASGQGRHVDF SMVKGALGNF VWLDADRDGL QGIGEDGVPG IAVELHRTPG GVVDSQLTDA NGEYYFVGLD VGTYFIRVIK PSGTVFTERD KSATVGSHVD ADGYSSPVEI KVENSGITQD MTLDAGFYEL AQGETPPTTT TTTTTTTTTA PTTTTTTTPT TTTTPTTTTT GPTTTTTGST TTTTGPTTTT GPTTTTGATT STTPPTTTAP GTTTTTTTPP PRTDLGVQFA VDNPKPAVGD KVTFTTVVVN RGTAPVEGSR VTITLPDGLR PETGTGQNLR RALLAQSGWT CQATGQQLVC ANPATVQPGA SFEPLTVVTT ATAPVLPQAT TVAVALYDGT PDDNPDNDGT TPALAIPSTT TPTPGVVTEL LLPTQQPTTT PLATTGRPAH ALLLTALVLM VLGAGLLVTT RKPATGGRHR AARRSD
|
| |