Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3582 |
Symbol | |
ID | 9157761 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 3690821 |
End bp | 3692167 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | domain of unknown function DUF1731 |
Protein accession | YP_003648499 |
Protein GI | 296141256 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.126499 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTATAC GGCGATCAGC AGTCATCGAG GCACCCGTCG ACGAGGTCTT CGCGTGGTTC TCCCGGCCCG GGGCGTTCGC CCGACTCCAG GCACCATGGT TGCCGATGCG GCTGCGGTCC CAGGCATCCT CACTCGCCGA TGGTGTCACC GTGATCGATC TTCCCGGAGG CCTGCAATGG TTCGCGCGGC ACGACCCCGC GGCCTTCGTT CCCGGGCGCC GGTTCGTCGA TGAGGTCGCG GCCGATGGCC TGCGCTCGCT GCCCGTGGGC GTGTTCCCGT GGCGCCACGT CCACGATTTC GAACCCGTCG ACGCCGCGCG CACCCGGGTG ATCGACGAGG TGCGCACCCC GGTCCCCGAG CAGTTGCTGT CCGAGGCGAT CGAGTTCCGG TACCGACGGC TCGCAGGAGA TCTGGCAGCG CACCGCCGAG CCGCGCAGGC GGGTCTGCCG CCCTCGACGG TGGCCGTGAC CGGTGCGTCG GGACTCGTCG GATCAGCGCT CACGGCGTTC CTCAGCACCG GGGGCCACCG GGTGCTCTCC TTGGTGCGGG GGGAGCCGCG CTCGGAGTCC GAGCGCGCGT GGGATCCGGC GGCGCCCGAC CCGGCGGCGC TGCGCGGCGT GGATGCCGTG ATCCACCTCG CCGGCGCAAG CATCGCCGGC CGATTCACCG ACGCCCACAA GACGCTGATC GCGGACAGCC GTATCGAGCC CACCCGGCTG CTGGCCCGCG CAGCGGCACA AGCCGGTGTG CCGGTGTTCG TGAGCGCCTC CGCGGTCGGC TGGTACGGCC GGGACCGGGG CGATGAGCTG CTCACCGAGG ACGCGGCCCC GCCCGAGGCT TCCGACTTCC TCTCCGACCT GGTGCGCCGA TGGGAGGGCG CCGCCCATGA CGGCGCCGGC GCCGACACTC GCGTGGTCAC CGTGCGCACG GGCATCGTGC AATCCCCGCA GGGCGGCACC CTGAAACTCC TCTTGCCGCT CTTCCGGGCC GGCCTCGGCG GACGGCTCGG CAGCGGGGAA CAGTGGCAGC CCTGGATCGG CATCGATGAC CTCGTCGACA TCTACCACCG GAGCCTCTGG GACGGTGGGC TTTCCGGACC GGTCAACGCC GTCGCGCCCG GCGTGGTGCG CAATAGTGCG TACACCCGCG AACTGGGCCG TGCGGTGCAC CGCCCCACCC TGCTGCCCGT GCCCTCGTTC GGTCCGGCCT TGCTCCTCGG CCGCGAAGGG GCCGAGGATC TCGCGCTCGC CTCGCAACGG GTGCGGCCGA CCGTCCTGGA GGACCGCGGA CACCCCTTCC GGCACCCGGA TCTCGGTGGC GCCCTGCGGC ATCTGCTGGG AGGCTAG
|
Protein sequence | MGIRRSAVIE APVDEVFAWF SRPGAFARLQ APWLPMRLRS QASSLADGVT VIDLPGGLQW FARHDPAAFV PGRRFVDEVA ADGLRSLPVG VFPWRHVHDF EPVDAARTRV IDEVRTPVPE QLLSEAIEFR YRRLAGDLAA HRRAAQAGLP PSTVAVTGAS GLVGSALTAF LSTGGHRVLS LVRGEPRSES ERAWDPAAPD PAALRGVDAV IHLAGASIAG RFTDAHKTLI ADSRIEPTRL LARAAAQAGV PVFVSASAVG WYGRDRGDEL LTEDAAPPEA SDFLSDLVRR WEGAAHDGAG ADTRVVTVRT GIVQSPQGGT LKLLLPLFRA GLGGRLGSGE QWQPWIGIDD LVDIYHRSLW DGGLSGPVNA VAPGVVRNSA YTRELGRAVH RPTLLPVPSF GPALLLGREG AEDLALASQR VRPTVLEDRG HPFRHPDLGG ALRHLLGG
|
| |