Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_1043 |
Symbol | |
ID | 9155183 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 1069362 |
End bp | 1071011 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | carboxyl transferase |
Protein accession | YP_003646015 |
Protein GI | 296138772 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.282671 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAGTG CCTCCACCAC CGACGGCGCG GCTCCCGGCG CTGACGGCGC CGAGCCCAGC ATCCACACCA CCGCTGGCAA GCTGGCCGAC TTCCGCAAGC GTGACGCGCA GGCCCTGGCC CCCATGGGGC AGGCCGCGAT CGACCGCGTG CACGAGAAGG GCCGCCTCAC CGCCCGCGAG CGCGTCCTGG CGCTGCTCGA CGAGGGTTCC TTCGTCGAGC TCGACAAGCT GGCGCAGCAC CGCTCCACCA ACTTCGGCAT GGGCGCCAAG CGCCCCGTGG GCGACGGCGT GGTGGCCGGC TACGGCACCA TCGACGGCCG CGAGGTGTGC GTCTTCAGCC AGGACTCCAC CGTCTTCGGC GGCTCGCTCG GCGAGGTCTA CGGCGACAAG ATCGTCAAGG TCATGGACCT GGCCACTAAG ACCGGCCGCC CGCTGGTGGG CATCATCGAC GGTGCCGGCG CGCGCATCCA GGAGGGCGTG GTCTCGCTCG CCCTGTACTC GAAGATCTTC TTCCGCAACA CCCAGGCGTC GGGCGTCATC CCGCAGATCT CGGTGATCAT GGGCGCGGCC GCCGGTGGCC ACGTGTACTC CCCCGCCCTC ACCGACTTCG TGGTCATGGT CGATCACGAG AGCCAGATGT TCATCACCGG CCCGGACGTG ATCAAGTCCG TGACCGGCGA GGAGGTCACC AAGGAGGAGC TGGGCGGCGC CCAGACCCAC ATGAGCAAGT CGGGCACCGC GCACTACGTG GCCTCCGGCG AGCAGGACGC ACTCGACTAC GTGCGCGAGC TGATGACCTA CCTCCCCTCG AACAACCGCG CCGAGGCGCC GCGCTTCGTG CCGACCGACC CGCTCACCGG GTCGATCGAG GAGTCGGTCA ACGACGAGGA CCGCGAGCTC GACACGCTGA TCCCGGATTC GCCGAACCAG CCGTACGACA TGCACGAGGT GATCCGCCGC CTGCTCGACG ACGACGAGTT CCTCGAGATC CAGCCGCAGC GCGCGATGAA CATCATCGTG GGCTTCGGCC GGATCGACGG CCGCAGCGTG GGCCTGGTGG CGAACCAGCC CACCCAGCTG GCCGGTTGCC TCGACATCGA CGCCTCGGAG AAGGCCGCGC GCTTCGTGCG GTTCTGCGAT GCCTTCAACA TCCCGATCAT CACCCTGGTC GACGTGCCCG GCTTCCTGCC GGGTACGGGC CAGGAGTACG ACGGCATCAT CCGCCGCGGT GCGAAGCTGC TCTACGCCTA CGGCGAGGCC ACCGTCCCGA AGATCACCGT GGTCACCCGC AAGGCCTACG GCGGCGCGTA CTGCGTGATG GGCAGCAAGG ACATGGGCGC CGATATCAAC CTGGCGTGGC CCACCGCCCA GTTCGCGGTG ATGGGCGCCT CGGGCGCCGT CGGCTTCGTG TACCGCAAGG AGCTGGCCGA GGCCGCCGAG AAGGGCGAGG ACGTCGACGC CCTGCGCCTG AAGCTCCAGG AGGAGTACGA GGACACCCTG GTCAACCCGT ACGTGGCGGC CGAGCGCGGC TACGTCGACG CGGTGATCCC GCCCTCGCAC ACCCGCGGCC AGATCGTGAC GGCGCTCAAC ATGCTCGAGC GCAAGGTGGC GATCACGCTG CCGAAGAAGC ACGGGAACAT CCCGCTGTGA
|
Protein sequence | MTSASTTDGA APGADGAEPS IHTTAGKLAD FRKRDAQALA PMGQAAIDRV HEKGRLTARE RVLALLDEGS FVELDKLAQH RSTNFGMGAK RPVGDGVVAG YGTIDGREVC VFSQDSTVFG GSLGEVYGDK IVKVMDLATK TGRPLVGIID GAGARIQEGV VSLALYSKIF FRNTQASGVI PQISVIMGAA AGGHVYSPAL TDFVVMVDHE SQMFITGPDV IKSVTGEEVT KEELGGAQTH MSKSGTAHYV ASGEQDALDY VRELMTYLPS NNRAEAPRFV PTDPLTGSIE ESVNDEDREL DTLIPDSPNQ PYDMHEVIRR LLDDDEFLEI QPQRAMNIIV GFGRIDGRSV GLVANQPTQL AGCLDIDASE KAARFVRFCD AFNIPIITLV DVPGFLPGTG QEYDGIIRRG AKLLYAYGEA TVPKITVVTR KAYGGAYCVM GSKDMGADIN LAWPTAQFAV MGASGAVGFV YRKELAEAAE KGEDVDALRL KLQEEYEDTL VNPYVAAERG YVDAVIPPSH TRGQIVTALN MLERKVAITL PKKHGNIPL
|
| |