Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_4177 |
Symbol | |
ID | 9158365 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 4300083 |
End bp | 4301153 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | Mammalian cell entry related domain protein |
Protein accession | YP_003649085 |
Protein GI | 296141842 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGCG AGCTGATCGC CAACATCGTC ACCTTCGTGG TGGTCCTCGC GGTGGGGGTG AGCGCGCTGG TCTTCGGCTA CATGGGGGTG CGGCCCGGCA CGCAGTACAC CACGATCACC CTGCAACTGC CCCGATCCGC GCAGCTGGTG ACGGGTTCCT CGGTACTGTT GCGCGGCACC CGGATCGGCG AAGTGGAGCG CGTCGACGGC AGCACGCGTG GAGTATCGGT GCGGCTGAAG TATCCGGATA CGAACCGGGT CCCCGTGGAC TCGGAGGTCT CGATCGAGCA GCTGACCGCA CTGGGCGAGC CCTACGTGGA GTTCGCCCCC AGCGGCACGG ACGGACCGTA CTTCACCGAC GGGGCGATGA TCGACCCGAA GCGGGTCTCG ATCCCGACCT CGATTCCCGA TATCTTCCGT TCGCTGTCCG AGGTGAGCAA GATCGCCGAT GTCGGCCCGC TCGCCGACAT CGTGAAGACG ATGTGGCAGG CGACCACGGG AACCGATGCG GCGATGCCCC GGCTGACGCA GGCCGGCGAG TTGCTGACCT CGACGCTCGT GTCCCGGATG CCTCAGATCC GCAGCATGTT CGAGCAGACC CAGGTGTACT CGGCCGATCT CGAGTGGATG GGACCGGCGA TCACCGACTT CGGCCCGGCC TTCCGCAGCA CGATCGGAGT GATCCGGCCC GCGGTCGACA AGGTGCTGAC ACTGGTCACC GAGCTCGGCC TGCCGGGGCC CTTCAACGAT GTCCTGCACC CGTTCGCGAC CCGTCTGCAG CCCTACCTGA CCGAATTGAT TCCGAAGGTC GCCGAGATCC TGGGGCCGAG CCTGCCTATT CTCAAGGCCG TGAACGGAAC CGTGCCTCCG ATCGATCTGA GCGCCTTCCT CACATCGGCC CTGGCCATGG TCGGCGACGA CGGAACGCCG CGGCTCTCGA TCACCGTCCC GATCCCGGGT GGTCCGGACG CCGCGGTACC GTCGGCCCGG CCCGGCCCGG CGGCACCGTC CGCGACCACG CCGCCGCGCA GTGCGGCGCC CGCACCGAGC ACGTCCCCGA CCCCGAGGTG A
|
Protein sequence | MKRELIANIV TFVVVLAVGV SALVFGYMGV RPGTQYTTIT LQLPRSAQLV TGSSVLLRGT RIGEVERVDG STRGVSVRLK YPDTNRVPVD SEVSIEQLTA LGEPYVEFAP SGTDGPYFTD GAMIDPKRVS IPTSIPDIFR SLSEVSKIAD VGPLADIVKT MWQATTGTDA AMPRLTQAGE LLTSTLVSRM PQIRSMFEQT QVYSADLEWM GPAITDFGPA FRSTIGVIRP AVDKVLTLVT ELGLPGPFND VLHPFATRLQ PYLTELIPKV AEILGPSLPI LKAVNGTVPP IDLSAFLTSA LAMVGDDGTP RLSITVPIPG GPDAAVPSAR PGPAAPSATT PPRSAAPAPS TSPTPR
|
| |