Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3858 |
Symbol | |
ID | 9158038 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 3982275 |
End bp | 3983447 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003648772 |
Protein GI | 296141529 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.198117 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCGC TGAGCGGTAA GGCCGCGATC GCCGGTATCG GCGCCACCGA GTTCTCCAAG GACTCCGGCC GCAGCGAACT GCGCCTCGCC GTCGAGTGCG TCCGTGCGGC GCTGGACGAC GCCGGGCTCA CGCCACAGGA CGTCGACGGC CTGGTGACCT TCACCATGGA CACCAACTCG GAGATCGCCG TGGCCCGCGA GCTGCGGATT CCCGAGCTGA AGTTCTTCAG CCGCATCAAC TTCGGCGGCG GTGCCGCGGC GGCCACGGTG CAGCAGGCGG CCATGGCCGT GGCCACCGGC GTGGCCGATG TGGTGGTGGC CTACCGCGCC TTCAACGAGC GGTCCGGGCA CCGGTTCGGG CAGGTCTCGA ATGCCGCGGC GCAGCAGGTG AACACCAATG GCATCGACAA CGCCTTCCAC TACCCCGCCG GGATCGCGAC GCCCGCAGCC ACCGTGGCCA TGGCAGCCCG GCGATACATG CATGACTACG GTGCCACCAG TGAGGATTTC GGCCGCGTCG CCGTGCTCGA CCGCGCGCGG GCGGCCACCA ACCCGAAGGC GTGGTTCCAC GGCAAACCGA TCACCTTGGA CGATCATCAG GCGTCCAAGT ACGTCGCCGA GCCCCTGCAC CTGCTGGACT GCTGCCAGGA GTCGGACGGC GGTGTGGCGC TGGTGATCAC CAGCGCCGAA CGCGCCCGTG ACCTGAAGCA GACGCCCGCT GTGATCGCTG CGGCGGCGCA GGGCAGCGGC CCCGATCAGT ACGTGATGAC CAGCTACTAC CGCGACGCCC TCACCGGGCT GCCGGAGATG GGGGTCGTCG GTGGACAGCT CTGGTCGCAG TCGGGAATGA CACCGACGGA CATGGACCTC GCGGTGCTCT ACGACCACTT CACTCCGTAC GTGTTGATGC AGCTCGAAGA GCTCGGCTTC TGCGGGCGTG GCGAGGCAGC GTCGTTCCTC GCCTCCGGTG CTTGCGAGCT GGAAGGGCGA CTACCGCTCA ACCCGCACGG CGGGCAGCTG GGCGAGGCCT ACATCCACGG CATGAACGGC ATCGCCGAAG CGGTGCGCCA ATTGCGCGGC ACCGCGGCGA ACCAGGTGCC GGGCGCGTCG CGAGCCGTGG TGACCGCGGG CACCGGAGTT CCGACCAGCG GATTGGTGCT GAGTAAGGTC TGA
|
Protein sequence | MSALSGKAAI AGIGATEFSK DSGRSELRLA VECVRAALDD AGLTPQDVDG LVTFTMDTNS EIAVARELRI PELKFFSRIN FGGGAAAATV QQAAMAVATG VADVVVAYRA FNERSGHRFG QVSNAAAQQV NTNGIDNAFH YPAGIATPAA TVAMAARRYM HDYGATSEDF GRVAVLDRAR AATNPKAWFH GKPITLDDHQ ASKYVAEPLH LLDCCQESDG GVALVITSAE RARDLKQTPA VIAAAAQGSG PDQYVMTSYY RDALTGLPEM GVVGGQLWSQ SGMTPTDMDL AVLYDHFTPY VLMQLEELGF CGRGEAASFL ASGACELEGR LPLNPHGGQL GEAYIHGMNG IAEAVRQLRG TAANQVPGAS RAVVTAGTGV PTSGLVLSKV
|
| |