Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_4191 |
Symbol | |
ID | 9158379 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 4316259 |
End bp | 4317917 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003649099 |
Protein GI | 296141856 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACGGCGG GTTCCGCACC CGAGGAGGAT GACCGCCTTC ATGTGAGCCC GGGTCGATTC GACCCGGCCA CGGTGGCGGA GCCCGATCGC GACATTCCGA GCCACACCGA TCCGGTCGCG CGGGACTTCG CGGCGGCGCT CGGCGGCCCG GTCGGCGCAC ACGCGCTGAT CGGGTTCCAG CGGTTCTTCA CGCCGCTGCG GCTGATCCTG CTGGTCGCGG TGCTCTTTCT CGCGCTGGGC TGGACCACCA AGGCGGGCTG TCTGCAACAG AAGAACGACG GCGGCTCCCT GGTGCTGGAC TGGTCGGCGA ACCGCCCGTA CACGGCGATG TGCTACTCGG ACACCGTGCC GCTGTACTCG GCCGAGCGTC TCGACGAGGG CCTCATGCCC TACAAGACTC AGTTCTTCGA CACCGATCCC CTGGGCCAGC CGCAGGAGCG GTATATGGAG TACCCGGTGC TCACCGGGAT GTACCAGTAC GTGTCGATGC GGATCGCGAA GTTGTGGACG TACCTGCACG AGGAGTGGGG TGTTCCGGCT GCGATCGAGG TGGTGCTGTT CTTCACCGTT GCGGCCGTGG GTCTCGCACT GTTCTGGCTG ATCGCGGTGT GGGCCACCAC GCTGTTATCC GGGGCCGGTC GCCGGCCCTG GGACGCCATG CTGGTGGCCG CCTCGCCACT GGTGATCGTG CATGCCTTCA CCAACTTCGA CGCGATCGCC GTCGCCGCCA CTGCGGTCGC CCTGCTTCTC TGGGCGCGTA ACCGGCCGGC CTGGGCGGGC GTGGTCATCG GTCTGGGCGC CGCAGCCAAG TTGTATCCGG CGTTCCTGCT GGTGGTGCTG CTGTTGCTGT GCCTGCGCGC AGGCCTGCTT CGCTCCTGGG CGACCGCCGC AGCCTCGGCC GCGGCCGCCT GGCTCGCGGT GAATCTTCCG GTGCTCGCGT TGTGGCCCCA GGGCTGGTGG GAGTTCTTCC ACCGCAACTC GATCCGCGCG GTCGATATGG ATTCGATCTA CGCCGTGATC AGTTCCTTCA CCGGTGGCTG GGTATTCGGC GGTCAGGGCC CCCGAGGCGG TGCCTCGACG CTGGCCAACA TGATCACGCT GGGGTTGTTC GTGCTGGTGA TCGCGGGCGT GGCCTACTTG GCGCTGAAGG CTCCGCGTCG GCCCCGCGTG GCCCAGCTTG CGTTTCTCCT GGTGGCCGGA TTCCTTCTGG TGAACAAGGT GTGGAGTCCG CAGTACTCGC TGTGGCTGGT GCCATTGGCG GTGCTGGCGC TGCCGCACAC CCGGATCCTG TTGGCGTGGA TGACGATCGA TGCGCTGGTG TGGGTGCCGA GGATGATGTA CTTCCTCGGT GTCTCGAACA AGGGTCTGCC CGAGCAGGCC TTCACCTTCA CCGTGCTGCT GCGCGATATC GCCGTGATCG GTCTGTGTGC GTTGGTGATT CGGCAGATCT ACAAACCCGA CGAAGATCTG GTGCGGTCCA CGTTCCCCGG CCCGTTCTCA CCGCCGCTCG ACGATCCCGC AGGCGGACCG CTCGACGGTG CGCCCGATGT TCCGCTGTCG GAATCGCTGC GCAGTCGCAG CGGACGGCCG GGCAGGTACC AGAGGAAGAA CAGCAGCATC GCCGCGCGGC CCCAGACGCC GATCCAGACG CCCCACTGA
|
Protein sequence | MTAGSAPEED DRLHVSPGRF DPATVAEPDR DIPSHTDPVA RDFAAALGGP VGAHALIGFQ RFFTPLRLIL LVAVLFLALG WTTKAGCLQQ KNDGGSLVLD WSANRPYTAM CYSDTVPLYS AERLDEGLMP YKTQFFDTDP LGQPQERYME YPVLTGMYQY VSMRIAKLWT YLHEEWGVPA AIEVVLFFTV AAVGLALFWL IAVWATTLLS GAGRRPWDAM LVAASPLVIV HAFTNFDAIA VAATAVALLL WARNRPAWAG VVIGLGAAAK LYPAFLLVVL LLLCLRAGLL RSWATAAASA AAAWLAVNLP VLALWPQGWW EFFHRNSIRA VDMDSIYAVI SSFTGGWVFG GQGPRGGAST LANMITLGLF VLVIAGVAYL ALKAPRRPRV AQLAFLLVAG FLLVNKVWSP QYSLWLVPLA VLALPHTRIL LAWMTIDALV WVPRMMYFLG VSNKGLPEQA FTFTVLLRDI AVIGLCALVI RQIYKPDEDL VRSTFPGPFS PPLDDPAGGP LDGAPDVPLS ESLRSRSGRP GRYQRKNSSI AARPQTPIQT PH
|
| |