Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_1708 |
Symbol | |
ID | 9155858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 1782713 |
End bp | 1784272 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | HNH endonuclease |
Protein accession | YP_003646665 |
Protein GI | 296139422 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0664589 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATGGTT CGAATATCGC CGAGGCGGAA CAGGCCTCCG ACTATGAGCC GTTCTGGCGC GATTTCTCCG CCTTCGATCG CACCTCCGTC GTCGAGCTGA TCGACGAGTC AACGCGGATC ATCAACCACG AGGTAGCGCG GCGGTGCGCC TGGATCGCGA CGCGACACGA CCAGATTTAC GCCGAATGGG CTGATTCATC GGCTGTTCTG GGGCCGGAAG GTGACTGTCG GGCCGGGTTC ACCGATGAGT TCGATCAGCT GGTCGGAGAG GTCGGCGCGG TGATCGGTCG CGGCGCCGGC GCAGCCCGTG CCTTGATCGA GCTGAGTCTC GCGCTGCGCG ACCGCACCCC ACTGGTGTTC GACATGCTCT ACGAGGGCAG GATCGGCGAG ACCATCGCCC GCGCCGTCGT CACCCGCGCC GCGGCGATCA CGGATCCGGC CGTGATGCAT AGCTACGACC AGGCCGTCTC CGGGCTCCTC GGCTGCAAGC TGGCCCGCGG GCGGGCCGTG GTCTCGGAGA GCGCCGCTCG CCGACTTGCC GACGGCGTGC TCGCCACGAT CGATCCCGAT GCTGTGCCGG TTCCGGCGAG TGCTCGGCGG CGCGGGGTGT GGTTCGACGT TCGCCGTGAC GGGCTGACCG AGATGGCCGC AGTCTTGTTC TGTGAGGACG GGGCCTACCT GGATCGCGAG GTCGAGCGCA TCGCCACCAC GGTGTGCGGC AAGGATCCGC GGAGTCTGGG CGAACGGCGG GCCGATGGTC TGGTTGCCTT GGTGCAGGGC TACGAGACAC TGGGGTGCCG CTGCGAATCG GACGGTTGCG AGGTGCAGGC GCAGGTGCTG AAACATGCTG CTGTCGAGGC GAAGACGGTG GCCGATGTGA CTGTGGTGCT CAACGAATCC ACCGTGGCCG GCGCAGATGA TGCGCCCGCG CTGATCGGTG GGCAGCCGGT TCCCGCCGCC CTGGCGCGCG AGGTGATCGC TCGCGTCGGC GAGGCCCGAT TCCGCTCGCT CGGGAAGCCT ACGGGTGCGC GGGTGTGCGC CCACGACGTT GCAGGCTACC GCCCCTCGGC ATTGCAGGCT GACTTCCTCA AGATCCGCTA CCCGGAGTGC GTGTTTCCCG GCTGCGCCGT GCGATTCGAC GCGTGCCAGC TCGATCACGT CACCGAATGG AACCATCGCG ACCACGCCGT GGGCGGGAAG ACGCGGATCG GGAACCTCGT TCCGCTGTGT CCGCGACATC ATCGGCTCAA GACCGAGCAG AACTGGCTTT CCGATGTGCT CCCCGGTGGA GAAGTGGAGT GGCACACGCC CACCGGACAC GTCTACCGCA CGGCTGTGGT CACCGGCGAC GATCTGTTCC CCGACCTGGA CCTGATCGAA TGGCTCGCAC CAGTACGACG CCTGAAACCC GCGGTGCCCC GCTCGAATGG TCCCACTCGG GTGGAGCTGC GTAACGCTGG GCGTGAGGCT CGGCGCGAGA AGTACCGCCG TGCACGGGAG ACGCTGCGCG ACGAACATCC CGGCGATCCG CGCTGGAACG ATGAGCCGCC GCCCTACTGA
|
Protein sequence | MHGSNIAEAE QASDYEPFWR DFSAFDRTSV VELIDESTRI INHEVARRCA WIATRHDQIY AEWADSSAVL GPEGDCRAGF TDEFDQLVGE VGAVIGRGAG AARALIELSL ALRDRTPLVF DMLYEGRIGE TIARAVVTRA AAITDPAVMH SYDQAVSGLL GCKLARGRAV VSESAARRLA DGVLATIDPD AVPVPASARR RGVWFDVRRD GLTEMAAVLF CEDGAYLDRE VERIATTVCG KDPRSLGERR ADGLVALVQG YETLGCRCES DGCEVQAQVL KHAAVEAKTV ADVTVVLNES TVAGADDAPA LIGGQPVPAA LAREVIARVG EARFRSLGKP TGARVCAHDV AGYRPSALQA DFLKIRYPEC VFPGCAVRFD ACQLDHVTEW NHRDHAVGGK TRIGNLVPLC PRHHRLKTEQ NWLSDVLPGG EVEWHTPTGH VYRTAVVTGD DLFPDLDLIE WLAPVRRLKP AVPRSNGPTR VELRNAGREA RREKYRRARE TLRDEHPGDP RWNDEPPPY
|
| |