Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_2810 |
Symbol | |
ID | 8326999 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | + |
Start bp | 3233060 |
End bp | 3236017 |
Gene Length | 2958 bp |
Protein Length | 985 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 644943346 |
Product | hypothetical protein |
Protein accession | YP_003100587 |
Protein GI | 256376927 |
COG category | [N] Cell motility |
COG ID | [COG5651] PPE-repeat proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.157063 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGAGC GCGGTGCGAC GGTGAGCGGG GAGCTGGTTG TGGGAGAGGC GGTCTCGGTG CGGGCTTTGG GGTTCTGGCG TGCGGTCGAG CTGTTCGACC AGCCGGAACT GGGCGAGGTC GAGCGGGTCA GGGCCGGTGA GCCGCTGCCG TGGGATGCCG AGCACCCCCT CGCCCGGCGG GAGCTGCCCA GTGGGACCGA GTGGCGGCAC GTCGTGCACG TCGGCATCCA CCCGCGCGGC AAGGCGCTCG CCGTGCTCGC CGGGGCGTTC GGCGCGCCCA CGCCCGACGC GGACGGGGAC GGGGCGCTCG CCCGGTTCGC CGTCGACGAC AGCGGGCGCG CGCTGCTCGG GTCCGCCGCC CTGTCGGCCG CCGCCTGGGC CACCGGCGAG GTCGGGCTGA ACGGCGCGGC CGTGCTCGGG CGGCTCACCG GGTTCGCCCA GGCGCAGGAG CGGTTCGCCG CCGAGTTCCG CGCCACCGCC CTCGGCAGGC TCGGGCACAC CGAGCTGGCC GCCTGCGAGG ACGCCGCCGT GCGCGCCACC GGGTTCTCCC CGGCCGACGC CGAGATCGGC GTCACGAGCA CCGCCGTGCC GCGCGGCAGC GGCACGGCCT GGCTGCCCGA GCCGACCAGC CCGTTCCTCG CCGAGCTGGA CGCCGCCGAG CCCACCGCGC CCGCGCTGCG CGACCACCTG GCTGGCGAGC CGCCCGAGCC AGGTCCGGCC GACGCCGACG CGGTCCACGA GCTGGTGGTC GGCGTGCTCG TGCGCCGGGC CCGCGACCTC GCCGAGCTGG GGCACCCGCG CGAGGCGTTC AGCGGGCTGC GGTTCCGCTG GCTCACCGCC GACCGGGCCC GCGAGGTCAG CGCCTGGCGA CCGGAGCTGG CCGGGGGAGG GGTGCTCGCG GTCAGCGCCG ACGAGACCAC CGCGCGTGCC GCGGGGGCGG CCGTGCGACC CGCGCCCGCC GCTGCGGACG GGGCCACCGG GCACGACTGG CCGCTCGCGC TCACCCTCGT CGACCGCGCC GACCGCGAGC GGTTCGACGA AGAGCTCTGG ACCGGCGGCG CCGGCCTCCT GGACTTGCTG AAGACCTGGG AGGACAATGG TCCTCGACGC CCGTGGTCGG ATGCCGTTGC CGCGTTCCGC GCTGCCCGCC TGCTCGTGGA CGATTTGCAC GCCAGGCGCG CCGCCGCCAG CCGTGCCGTC GACCGCGAAC CCGTGCTGCG CGAGGAGCTG GACCGCGCCC GCGCGGGGCT CACGGCCGCC ATCGCGCGCC TGGCCGAGGT GCGCGCCCAC CGCGACCAGC TCGCCCAGGT CGAGCGGGCC GCCGTCGACG CGCACGCGAA GCTGGAGCAG GAGGCCGCCC AGCAGTTCGC GGCCATCCAG CACCAGGTCT GGGACTGGAA CCAGGAGCTG GAGCGCCGCA AGGGCGAGCA CCGGGAGCAC CGCAAGCTCC GGCCCGCGCT GTGGAAGCGC GTCGCCAAGC AGGACAGCGA CCAGCACCTG TGGTCGTGGC GCGACACGTG GTTGTCCGAC CGGGTGAAGC TGGCCGAGGC CGAGCTGAAG CGCCTGCACG CCACCCCGCC GCCCCAGGCC CCGCCGCCAC CGCCCCGCAC GCCGGGGCTC GACGCGGCGG AGCAGGCGGT GCGCGCGGCC GTGCTCGCCC AGGCCGACCA CGAGTGGGTG ATCAGCGAGC GCACGGCGGA ACTGGCGCGC GTGGAGGAGC TGCTGCCGGT CGCGGCCGAG CTGTTCGCCG GGCGGGAGCG CGCCGAGCCG TGGACGGACC CGGAGTGGAC GGCCGCGCGC GACGCGCTGC TCCTGGCCGC CCTGGAGCTG CACCGCGAGT TCCTCGGCCA TGAGGCGCGG GCGGTGCGGC GCAACCTCCA GGCGGTGGCC GACCTGATCG CGGGCGAGTC GTCCCCGGAG ATGCCGGACG GCGCGGTCGG GGAGGCGTGG CGCACGCTGT TCCTGGTCAC GCCGCTGATC AGCGTCACGG CGGAGTCGGG CGCGCGCCTG CTGGCGGGCG CCGGGCCGGA CTCCCTCGGC TGGCTCCTGC TGGACCGCGC CGTGCCACCC GCGCGGGCGG TCGGCGCGCT GCGCCGGGCG CGCCGCGTCC TGGCGGTGGG CGACGGCGAG CCGGGGACGA CGCCGGTCGA GCGCCTGCTC GGCCCGCGCT GGGCCGCCCT GGAGGCGGCT GCGGCTGCCG CGGCGACGGC TGCGCTGGCG GCTGCCGCGG CCAGCGCCGC GCCCGCCCAC GGGGCGGTGG GCAACCCCGC GCCCGGTCAG GGCTTCGGCG GTCCCGGCGC GGGTCCCGGT GGTGGGCTGC CCGGCGGCGG CCTGCCCAGG TCTGGGCTGC CCGGAAGTGG GGTGGCTGGG CACGGGCTGC CTGGGAACGG GGTGGCTGGA AGCGGGCTGC CTGGAGGCGG GGCGGCCGGA CCTGGGCTGC CTGGGGCCGG GGCTCCCGGT GCTGGGATGC CCGGTGCTGG TTCGGCGGGC AGCGGGTTGT CCGGCACCTG GATGACTGGA GGCGGGGCGC CTGGTGGTGG CGTGCCGAAC AGCGGCACGT TCTCTCCCGG TACGCCGAAT CCGGGGACGC TCAACCCTGG TGCGCGGGTT CCCGGCGTGC CGACCCCCGG AACGCTGAGC CCTGGTGCGC GAGTTCCCGG AGCCTCCTTC CCCGGAACGC TCAACCCCGG CGCGCGGGTC CCCGGCGCGC CCGGTGGGCC GATGCCCACT GGTCAGGCGT TCGGCGGCCC GGCTCCTACC GGGCAGGTCC CCGCTGGTTC GATCCCCGTC ACCCCGGCCC CCGTCAGCTC GTCTCCCTCC GGCCCGAGCG CCTCCGGCCA CACCCCAACC GACCCGGTGT CCAACGCCGA TCCGCAGACC TCCGGCGGGG ACCGTCCCGA CAACCCCGAC CCCGGTTTCT CGGGCCGTGA CCTGTTCGGC GGTGACCCCC GCGCGTGA
|
Protein sequence | MIERGATVSG ELVVGEAVSV RALGFWRAVE LFDQPELGEV ERVRAGEPLP WDAEHPLARR ELPSGTEWRH VVHVGIHPRG KALAVLAGAF GAPTPDADGD GALARFAVDD SGRALLGSAA LSAAAWATGE VGLNGAAVLG RLTGFAQAQE RFAAEFRATA LGRLGHTELA ACEDAAVRAT GFSPADAEIG VTSTAVPRGS GTAWLPEPTS PFLAELDAAE PTAPALRDHL AGEPPEPGPA DADAVHELVV GVLVRRARDL AELGHPREAF SGLRFRWLTA DRAREVSAWR PELAGGGVLA VSADETTARA AGAAVRPAPA AADGATGHDW PLALTLVDRA DRERFDEELW TGGAGLLDLL KTWEDNGPRR PWSDAVAAFR AARLLVDDLH ARRAAASRAV DREPVLREEL DRARAGLTAA IARLAEVRAH RDQLAQVERA AVDAHAKLEQ EAAQQFAAIQ HQVWDWNQEL ERRKGEHREH RKLRPALWKR VAKQDSDQHL WSWRDTWLSD RVKLAEAELK RLHATPPPQA PPPPPRTPGL DAAEQAVRAA VLAQADHEWV ISERTAELAR VEELLPVAAE LFAGRERAEP WTDPEWTAAR DALLLAALEL HREFLGHEAR AVRRNLQAVA DLIAGESSPE MPDGAVGEAW RTLFLVTPLI SVTAESGARL LAGAGPDSLG WLLLDRAVPP ARAVGALRRA RRVLAVGDGE PGTTPVERLL GPRWAALEAA AAAAATAALA AAAASAAPAH GAVGNPAPGQ GFGGPGAGPG GGLPGGGLPR SGLPGSGVAG HGLPGNGVAG SGLPGGGAAG PGLPGAGAPG AGMPGAGSAG SGLSGTWMTG GGAPGGGVPN SGTFSPGTPN PGTLNPGARV PGVPTPGTLS PGARVPGASF PGTLNPGARV PGAPGGPMPT GQAFGGPAPT GQVPAGSIPV TPAPVSSSPS GPSASGHTPT DPVSNADPQT SGGDRPDNPD PGFSGRDLFG GDPRA
|
| |