Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3362 |
Symbol | |
ID | 9157537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 3461431 |
End bp | 3462621 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003648285 |
Protein GI | 296141042 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCGCC GGCGGTCCGC CGCCGCGCTC GCCGCAGGCA CGGTGGGCGG AGGCCTGATC GGTGCCACCG GCGTGTACCT GCTGCGCGCC GGACGGGGCC TGAGGCGGTC CATGGGGGCG CAGGCCGACA CCGTGTACAC CAATTCCGAT CCCGACAGTC CGCACGAGCG GTTCGAGCCG GGGAAGATCC TGCGCGGCGT TCTCGGCCGC GCCCGGCACG GCCAGGGGCG CCCGTCGCGG CCCATTCCCC TGGTCGCGCC GCTGAGCCCC GTGGACGCCG CGGATCTGGC CGTCACCTGG TACGGGCACT CCTCGGTGCT CATCGAGATC GACGGCTTCC GGGTTCTGGC CGATCCGGTC TGGGGTGAAC GGGTGTCACC GTCGCCCACC ATCGGCCCGT CCCGGCTGCA TCCCGTTCCC GTGGCGCTGA GCGCGTTGCC GCCGATCGAT GCGGTGATCA TCAGCCACGA CCACTACGAC CACCTCGATC TGCCGACCAT CGACGAGCTC ACCCGCGACC GGGACGTGCC CTTCTGCGTG CCCATCGGCG TCGGCGGGCA CCTCCGGGCG TGGGGTGTGC CCGAGGACCG GATCATCGAA CTCGACTGGG ACCAGAGCCA CACCCTCACG CGCGACGACG GGCACGACGG CACCGACGAG CTCAAGGTGG TGTGCACCGA AGCACGGCAC TTCTCCGGCC GCGGGCTGAC CCGGAACTCC ACCCAGTGGG CGTCGTGGTC GCTGGTGGGG CGGAAGGCGT CGGGCCAGGG GCACTCCGTG TTCTTCGGGG GCGACACCGG GTACACGGAG CGATTCAAGC TGATCGGCGA CCATTTCGGC CCGTTCGACC TCACGCTGTT GCCGATCGGT GCCTACGATC CGCTGTGGCC CGATGTACAT ACCAATCCGG AGGAGGCCGT CGCGATCCAC CAGATGATCG CGGGGCCCAA GGCGCCGCTG GTACCGGTGC ACTGGGCCAC CTTCAATCTC GCCTTCCACG ACTGGTCGGA GCCGGTCGAG CGACTATTGG TGGCCGCGAA GGACGCCGGT ATCACCACGG TGGTTCCCAA GCCCGGTGGC AGGGTCGACG GCATCGCGGC GGCGGCGGGC CGGATTCCGC AGGTAGATCA CCACAGTGGA GATTCTTCGG GAACGCGAAG CAATGGAGAC TGGTGGACCG AGGTCGGCTA G
|
Protein sequence | MPRRRSAAAL AAGTVGGGLI GATGVYLLRA GRGLRRSMGA QADTVYTNSD PDSPHERFEP GKILRGVLGR ARHGQGRPSR PIPLVAPLSP VDAADLAVTW YGHSSVLIEI DGFRVLADPV WGERVSPSPT IGPSRLHPVP VALSALPPID AVIISHDHYD HLDLPTIDEL TRDRDVPFCV PIGVGGHLRA WGVPEDRIIE LDWDQSHTLT RDDGHDGTDE LKVVCTEARH FSGRGLTRNS TQWASWSLVG RKASGQGHSV FFGGDTGYTE RFKLIGDHFG PFDLTLLPIG AYDPLWPDVH TNPEEAVAIH QMIAGPKAPL VPVHWATFNL AFHDWSEPVE RLLVAAKDAG ITTVVPKPGG RVDGIAAAAG RIPQVDHHSG DSSGTRSNGD WWTEVG
|
| |