Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3142 |
Symbol | |
ID | 9157313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 3250775 |
End bp | 3252745 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | protein of unknown function DUF255 |
Protein accession | YP_003648068 |
Protein GI | 296140825 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGTAATC GACTCGGCGC GTCCACCAGC CCGTATCTTC GCCAGCACGC CGACAACCCG GTGCACTGGC AGGAGTGGTC GGATGCCGCG CTGGCCGAAG CGGCGGAGCG CGACGTGCCG ATCCTGCTCT CGATCGGCTA TGCGGCGTGC CACTGGTGCC ATGTGATGGC GCACGAGTCG TTCGAGAACG AGGCCATCGC GGCGCAGATG AACGAGGGCT TCGTGTGCAT CAAGGTCGAC CGCGAGGAGC GTCCCGACCT CGATTCGATC TACATGAACG CCACCGTCGC GATGACCGGA CAGGGCGGCT GGCCCATGAC CTGTTTCCTC ACCCCCGGCG GTGCCCCGTT CTACTGCGGC ACCTACTACC CGCCCGAACC CCGCAACGGG CAGGCGAGTT TCCCGCAGTT GCTCGACGCG ATCACCGATA CCTGGCGCGA GCGCCGCGGC GACGTCGACC GCGTGTCCGA CCAGGTGGCC GGGCATCTGC GCCAGGCCTC GTCGGGCCTG CCCGACGGTG CGCCGCCCAG CCCCGGCGAC CTCGCCACCG CGGTCGCGAC TCTGGTCGCC GACGAGGACC CGTCGGGCGG CTTCGGTCGG GCACCGAAGT TCCCTCCGTC AGCCACCGTC GAAGCCCTGC TGCGCCACCA CGAGCGCACC GGCGACGCCG CGGCCTATGC CGTGGCCGTG CGGTGCGCCG AGGCGATGGC CCGCGGCGGC ATCTACGACC AGCTGGGTGG CGGTTTCGCC CGCTACGCCG TGGACGCCGA CTGGGTGGTG CCGCATTTCG AGAAGATGCT CTACGACAAC GCCTTGCTGC TGCGGTTCTA CGCGCACCTC GGCCGCCGCA CCGGCTCGCC GCTCGCGTTG CGGGTGGCCG ATGAGACCGC CGACTTCCTC GTTCGAGACC TGGGCGTCCA CGGTGCGTTC GCGTCGTCGC TGGATGCCGA TACCGAGATC GACGGCCATG GGGTGGAGGG CGCGACGTAC GTGTGGACGC CGTCTCAGCT GGCCGAGGTG CTCGGCGACG ACGACGGAGC CTGGGCCGCT GAGGTTTTCG CGGTCACCGA CGGCGGCACT TTTGAGCACG GCACCTCCAC CCTGCAACTG CGCGGCGATC CGGACCCGGC TCGTCTCGCC GACGTGCGCC GAAAGCTGTT CCACGCCCGT CAGTCCCGCC CGCAGCCCGC ACGAGACGAC AAGGTGGTCA CGGTGTGGAA CGGGCTCGCG ATCACCGCAC TCGCCGAAGC GGGCCGCGTT CCCGACGCCG CCGCGTGCGC TCGCTACCTA CTGGAGAAGC ACTGGAACGG TGCGACTCTG CGACGCAGCT CGCTCGACGG TGTTGCGGGC GAGGCGCAGG GCATGTTGGA GGACTACGCC GCCCTCGTCA CCGGGCTGCT CGCGTTGCGT CAGCGCACCG GTGACACAGC GTGGTCCGAC GATGCCGCAG TGATCCTCGA TGCGGCGATC GACAAGTTCG CCGACCCGGA CGCGTCCGGC GGCTGGTACG ACGCACCGTC GGACGGTGAG GCCCTGCTCA CCCGCCCGCG GGACCCGGCC GACGGCGCCA CTCCGTCGGG CGCGAGCCTG ATCGCCGAGG CGCTCACCTA CGCCGAGGTG TTGCTGGGGG AACGGTACGC GGGCCTCGCC GCCGCCACCC GCGCCCGGTC GGGGGAACTG CTCCGCCGCG TGCCCCGCGG GGCAGGACAT CATCTCGCCG TGGCCGAGCA GGCGACGGCC GGCCTGCAGA TCGCGGTGGC AGACGGTGCC GGCGCCGCAG CGCTCACCGC CGAGGCTCGG CGGCTCGCGC CCGGTGGCGC CATAGTTGTT GCTGGAGCTA AAGACTCGCA GGTGCTACTC GAGGGGCGCG GCCCCGTGGA CGGTCGTGCC GCGGCCTATG TGTGCCGCGG CACCGTCTGC TCCCTGCCGG TGACCGACGG TGAGACGCTC GCCATAGAGC TCGCCCCTTA G
|
Protein sequence | MSNRLGASTS PYLRQHADNP VHWQEWSDAA LAEAAERDVP ILLSIGYAAC HWCHVMAHES FENEAIAAQM NEGFVCIKVD REERPDLDSI YMNATVAMTG QGGWPMTCFL TPGGAPFYCG TYYPPEPRNG QASFPQLLDA ITDTWRERRG DVDRVSDQVA GHLRQASSGL PDGAPPSPGD LATAVATLVA DEDPSGGFGR APKFPPSATV EALLRHHERT GDAAAYAVAV RCAEAMARGG IYDQLGGGFA RYAVDADWVV PHFEKMLYDN ALLLRFYAHL GRRTGSPLAL RVADETADFL VRDLGVHGAF ASSLDADTEI DGHGVEGATY VWTPSQLAEV LGDDDGAWAA EVFAVTDGGT FEHGTSTLQL RGDPDPARLA DVRRKLFHAR QSRPQPARDD KVVTVWNGLA ITALAEAGRV PDAAACARYL LEKHWNGATL RRSSLDGVAG EAQGMLEDYA ALVTGLLALR QRTGDTAWSD DAAVILDAAI DKFADPDASG GWYDAPSDGE ALLTRPRDPA DGATPSGASL IAEALTYAEV LLGERYAGLA AATRARSGEL LRRVPRGAGH HLAVAEQATA GLQIAVADGA GAAALTAEAR RLAPGGAIVV AGAKDSQVLL EGRGPVDGRA AAYVCRGTVC SLPVTDGETL AIELAP
|
| |