Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_6846 |
Symbol | |
ID | 8331066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | - |
Start bp | 7981923 |
End bp | 7983773 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 644947278 |
Product | thiamine pyrophosphate protein central region |
Protein accession | YP_003104488 |
Protein GI | 256380828 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3962] Acetolactate synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.243842 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGCTCA CCACCGCGCA GGCACTGGTC CGCTGGCTGC TCGCCCAACG CACCGAGCTG CTCGACGGGA CCGAGGCCCC GCTGTTCCCC GGCGTCTTCG CGATCCTCGG CCACGGCAAC GCGCTCGGCC TGGGCACCGC GCTGGAGGAG GTCCGGGACC GCCTGCCGGT GTGGCGCGGG CACACCGAGC AGGGCATGGC GCTGGCCGCC ACCGCGATCG CCAGGGCGAC CCACCGCCGC CAGGTCGGCG TGGTCACCAC CTCCCTCGGC CCCGGCGCGC TCAACGCCGT CACGGCGGCG GGCGTGGCCC ACGCCAACCG CCTCCCGCTG CTGCTGCTCC CCGGCGACAC CTTCACCGGC CGCGCCCCCG ACCCGGTGCT CCAGCAGGTC GAGCACTTCC ACGACCCGAC CACCACGGCG AACGACGCCT TCCGCCCGGT CGCCCGCTAC TTCGACCGGA TCACCCGCCC CGAGCAGCTG ATCGCCACCC TGCCGCAGCT CGCCCGCGCG CTGACCGACC CGGCCGACTG CGGCCCCGCC GTCCTGGCGC TGCCGCAGGA CGCGCAGGTC GAGGAGCACG ACTTCCCGGA CCCGCTGTTC GAGCCGGTCC TGCACCGCCC GCTCCGCCCG CGCCCCGACC GCGTCGAGCT GGCCTCGGCG GCCGGGGCGC TGTCCCGGTC CACCCGCCCG CTGCTGGTGC TGGGCGGGGG CGTCCGCTAC TCCGGCGCGG CGGCCGAGGC GCTCGCGCTC GCGGAGCGCC ACGGCGTCCC GGTCACCGAG ACCACGGCGG GCCGCACCCT CGTCCCGCAC GACCACCCGC TGCACGCCGG TCCGCTGGGC ATCACCGGTT CGGCGTCCGC GAACCGGCTG GCCGCCGACG CCGACGCCGT CCTCGCGGTC GGCACCCGCC TGCAGGACTT CACCACCGCC TCCTGGACGG CCTTCGCCCT CACGGCCGCG CTGGTCAACG TCAACACCGC CCGCTTCGAC GCGGTCAAGC ACGGCGCGAC CGCCGTCACC GGGGACGCCC TGGAGGCCCT GCGCGAGCTG GGCGGGGCGC TGGCGGGCTG GCGCGCGGAC CCCGCGTGGA CCACGAGGGC GCGCGCCGAG CGCGAGGTGT GGGACGCCCG CGTCACCGCC CTGCGCGCCC CGAGCCCCGG CCTGCCCACG TACGCGCAGG TCGTGGGCGT GGTCAACGAC CTGTCCGAGG ACGGCGACTA CGCGCTGACC GCGTCGGGCG GCCTGCCGGG CGAGCTGGTC GGCGGCTGGC GGGCGCGCGG CGGCGAGCCG GAGCTGGACG TCGAGTACGG CTTCTCCTGC ATGGGCTACG AGCTGGCGGG CGCGTGGGGC GCGGCGATCG CGCGGGAGCG GGGCGTGGTG ACCACCCTGC TGGGCGACGG CTCGTACCTG ATGCTGAACT CCGACCTGTT CTCCGCCGCG TTCGCCGGGC ACGGGTTCGT GGCGGTGGTG TGCGACAACG GCGGCTACGC GGTGATCGAC CGCCTCCAGC GCGACCAAGG CGCGCGGCCG TTCCACAACC TGTACGCGGA CGTCCGCACC GGTCACGCCG ACCCGCCGTC GGTCGACTTC GCGGCCCACG CCGCCGCGCT GGGCTGCTCG GTCCACCCCG CGCCGGACCT GGTCGCGCTG CCGAACGCCT ACCGCGAGGC GAGGAGGGCG GCGGTGGCCG GGAAGCGGCC CGCCGTGGTG GTGATCCGCA CCCACCCGTC GTCGTGGACG CAGTCGGGCG CGTGGTGGGA GGTCGGCGTG CCCGAGTCCC TGGCGGGACG CCGGGACCAC GTCGAGGGCA AGCGCCGCCA GGTCAGGCAC CTGGGGCGGG GGCACCGGTA G
|
Protein sequence | MRLTTAQALV RWLLAQRTEL LDGTEAPLFP GVFAILGHGN ALGLGTALEE VRDRLPVWRG HTEQGMALAA TAIARATHRR QVGVVTTSLG PGALNAVTAA GVAHANRLPL LLLPGDTFTG RAPDPVLQQV EHFHDPTTTA NDAFRPVARY FDRITRPEQL IATLPQLARA LTDPADCGPA VLALPQDAQV EEHDFPDPLF EPVLHRPLRP RPDRVELASA AGALSRSTRP LLVLGGGVRY SGAAAEALAL AERHGVPVTE TTAGRTLVPH DHPLHAGPLG ITGSASANRL AADADAVLAV GTRLQDFTTA SWTAFALTAA LVNVNTARFD AVKHGATAVT GDALEALREL GGALAGWRAD PAWTTRARAE REVWDARVTA LRAPSPGLPT YAQVVGVVND LSEDGDYALT ASGGLPGELV GGWRARGGEP ELDVEYGFSC MGYELAGAWG AAIARERGVV TTLLGDGSYL MLNSDLFSAA FAGHGFVAVV CDNGGYAVID RLQRDQGARP FHNLYADVRT GHADPPSVDF AAHAAALGCS VHPAPDLVAL PNAYREARRA AVAGKRPAVV VIRTHPSSWT QSGAWWEVGV PESLAGRRDH VEGKRRQVRH LGRGHR
|
| |