Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_4269 |
Symbol | |
ID | 9158456 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014159 |
Strand | + |
Start bp | 19904 |
End bp | 21115 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | transposase IS111A/IS1328/IS1533 |
Protein accession | YP_003649174 |
Protein GI | 296141932 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGGTC CGGATCGATG GTGGGTGGGA GTTGACGTCG GCAAGGAGTT CCACTGGGTC GCGGTCTGTG ATGACGCCGG CAAGGTGGTC TCCTCCCGCA AGGTGGTCAA CGACGAAGCT GCGATCGCCA CGGTGATCGC CGAGGTTGAT GGCCGCGGCG GAACGGTGTC GTGGACTGTG GACTTGATCA GCCCCTATGC GACGTTGTTG TTGACGATGC TCGCCGCCGC AGGGCACTCG GTGCGGTACC TGACCGGCCG CGCGGTGTGG CAGGCATCGG TGGTTTACCG CGGCGGCGAA GCGAAGAGCG ATGCCAAAGA CGCCCGCGTC ATCGCTGATC AGGGACGGAT GCGTGGTGAT GATCTGCCAC TGCTGACTCC GGCTGATGGC CTGGTCACGG AGCTGGCGAT GCTCACCGCC CACCGGTCGG ATTTGGTCGC GGACCGCACC CGCACGATCA ACCGGCTGCG TCAGCAGCTG GTCTCGGTAA GTCCCGCGCT CGAGCGCGCC GCGGAACCGA CTGCCGACCG TGGCTGGGTG AGGCTGCTGG CCCGCTACCA GCGCCCGAAA GTGCTGCGCC GAACTGGTGT CGTAAGGCTG ACCCGCATGC TCTCCGACGC TGGTGTGCGT AACGCCGGCA AGATCGCCGA AGCTGCAGTC GAGGCGGCGA AAGCGCAAAC CGTGGCACTG CCCGGGGAGG ATGTCGCCGC CGCGCTGGTT GCTGAGCTCG CGCAGGGGGT GATCGACCTC GATACGCGGA TTAAGAACGT CGATGCCGCC ATCGAGGAGC GATTTCGCCG ACACCCTCTG GCCGAAGCGA TCGTGAGCCT GCCCGGCATG GGCTTTCGTC TCGGTGCTGA ATTCCTCGCT GCCGTCGGAG ATCCCAGCCG AATCGTCTCC GCTGATCACC TCGCGGCGTG GGCCGGCCTG GCCCCGGTTT CCAAGGACTC CGGCAAGCGC ACCGGCAGGC TGTGCACCCC CAAGCGCTAT CACCGCGGAC TGCGACGAGT GATGTACATG TCCGCGGTCA CCGCCACCCG CTGCGACCCC GAATCCCGGG CCTACTACCA ACGCAAACGG TCCCAGGGAA AGAAACACAT TCCCGCGACG ATCTGCCTGG CCCGAAAGCG CGTCAACGTT CTCTACGCCC TCATCCGCGA CAACCGCACC TGGCAGCCCA CCGCACCGCA AATCACGGCC TCGGCTGCTT GA
|
Protein sequence | MAGPDRWWVG VDVGKEFHWV AVCDDAGKVV SSRKVVNDEA AIATVIAEVD GRGGTVSWTV DLISPYATLL LTMLAAAGHS VRYLTGRAVW QASVVYRGGE AKSDAKDARV IADQGRMRGD DLPLLTPADG LVTELAMLTA HRSDLVADRT RTINRLRQQL VSVSPALERA AEPTADRGWV RLLARYQRPK VLRRTGVVRL TRMLSDAGVR NAGKIAEAAV EAAKAQTVAL PGEDVAAALV AELAQGVIDL DTRIKNVDAA IEERFRRHPL AEAIVSLPGM GFRLGAEFLA AVGDPSRIVS ADHLAAWAGL APVSKDSGKR TGRLCTPKRY HRGLRRVMYM SAVTATRCDP ESRAYYQRKR SQGKKHIPAT ICLARKRVNV LYALIRDNRT WQPTAPQITA SAA
|
| |