Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_0215 |
Symbol | |
ID | 9154349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 228531 |
End bp | 229688 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | protein of unknown function DUF1006 |
Protein accession | YP_003645208 |
Protein GI | 296137965 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGATCA GCCAATCGCA GGCGCGGCGG ATCGCCCTCG CCGCGCAGGG ATTCGAGCCC GGCTTCGCCG GTGCCTCGAC GCCCACGATG CGGCACGTCC AGAAGACCAT CGACCGCCTG AAACTGATCC AGATCGACAG CGTCAACGTG GTGGCCCGCA GTCAGTACCT GCCGGTTTTC GCACGGCTCG GCGACTACGA CACGACACTG CTCGACCGCG CCCGTGACAC GGCGCCGCGG CGGCTCGTCG AGGCCTGGGC GCACGAGGCC TCGCTAGTCC CGCCGCAGAC CTGGCCGCTG CTGCGGCACC GTCGGACCGC GGACCGGGTG GCGCAGCGGT TCGCGAGTTA CGACGCCCGG CACCCCGGCC AGTTGGACCG GCTGCGCGGC GCGCTCGCCG AGTTGCCGCC GCTCACCGCC CGCGCGCTCG AGGCGCACCT CGAGCACGAG CAGGTGGTGG AGAAGACGCA CTGGGGCTGG AACTGGTCGT CGGTGAAGGA GGGACTCGAG GTGCTCTTCC ACGCCGGCGA GGTCACGAGT GCCGGCCGCA CCAGCCAGTT CGAGCGACTC TACGCGCCCA CATCCACGGT GCTCGGTGAG CTCGCCGAGC ACGAGGTCTC CGACGAGGAC GCCTACGTCG AGCTGATCCG GCAATCGGCG AAGGCGCACG GCATCGGGAC GCTGCGGTGC CTGCGCGACT ACTTCCGGCT GACCACCGCG CAGGCCGCAC CGGCAGTGGA GAAACTCGTG GCCTCAGGCG AACTCGTGCC GGTGCAGGCC GAGTGGTGGC CCGGCACCGT GTACCTGCAC GCGGAGGCCA AGCGGCCCCG GAGCATCGCG GCACGAGCGC TGCTCTCGCC CTTCGACCCC GTGGTGTGGC AGCGCGAGCG GGCCGAGGCG CTGTTCGACT TCTTCTACCG GATCGAGATC TACACGCCCA AGGAGAAGCG GGTGCACGGC TATTACGTGT TGCCCTTCGT GTTCGGCGAC CGGATCGTGG CGCGCTGCGA TGTGAAGGCG GACCGCAAGG CCTCCGAGCT GCTGGTGCAC ACCACCACGT GGGAGCCGGG CGGGCGCGAC GCCGCATCCG AGACTGCGCT GGAGGAAACG GTGTCCGAGA TGGCCCACTG GCTGGGATTG GAAAATTACC GGTTCTAG
|
Protein sequence | MRISQSQARR IALAAQGFEP GFAGASTPTM RHVQKTIDRL KLIQIDSVNV VARSQYLPVF ARLGDYDTTL LDRARDTAPR RLVEAWAHEA SLVPPQTWPL LRHRRTADRV AQRFASYDAR HPGQLDRLRG ALAELPPLTA RALEAHLEHE QVVEKTHWGW NWSSVKEGLE VLFHAGEVTS AGRTSQFERL YAPTSTVLGE LAEHEVSDED AYVELIRQSA KAHGIGTLRC LRDYFRLTTA QAAPAVEKLV ASGELVPVQA EWWPGTVYLH AEAKRPRSIA ARALLSPFDP VVWQRERAEA LFDFFYRIEI YTPKEKRVHG YYVLPFVFGD RIVARCDVKA DRKASELLVH TTTWEPGGRD AASETALEET VSEMAHWLGL ENYRF
|
| |