Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_4175 |
Symbol | |
ID | 9158363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 4297940 |
End bp | 4299052 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | Mammalian cell entry related domain protein |
Protein accession | YP_003649083 |
Protein GI | 296141840 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCCGTT CGGCTCTGGG ATCGAAGTGG TTGATCAGCG GCATCGCGGT CGCGATCGTG ATCGTCCTCG GACTGGGCGG CGCCTGGTAC CTGCGCGGCG GCACCGAACG CAGCGTGACC TACTGCGCGC AGATGCCCGA TGCGATCGGT CTCTACGAGG GCAATCCCGT GACGCGACGC GGAGTCACCG TGGGCAAGGT GACCGCGGTG AGCGCGGAGG GCGCGAAGGC TCGGGTCGAG ATGTCGGTCT CCGCGAACGA GCGGATCCCC GCCTCGGCGA GCGCCGTCAC CGTGGCCAAA TCCCTGATCG CATCCCGCCA ACTCGCCCTC ATCGGCGACG ACAACGGCGG CCCCACTCTG CAACCCGGCA AGTGCATCGT GGATACCAAG ACTCCCCTCT CCCTGACCAA GTCGATGGAG GGCGTCTACC GCCTCACCTC GCAGGTCACC ACCGGGGGCG GGCCGGAACA GACCAAGCAG GCACAGCAGG CGCTCGGTGC GCTGGCGCGC GAGACCAACG GCACCGGACC GCAGATCAAC GGCATCCTCA ACCAACTGTC CTCGGTACTC GACAATCCCG GCCCCGGTAT GGACGATCTG GCGCGCGCCT TCGATGCCCT GGCTCCGCTG ACCTACGGCA TGACCTCGAA CTGGGGCGAT ATCAAGTCGC TGTTCAGCAA TCTGCCCGCG TACCTGGTCA ACGTGATGCT GCCGCTCGGA TCGACGGTCG ATGCACTCGC CACCACCCTG CTGCCGCTGG GCAAGATGCT GTTCAACCTC GTCGGACAGT ACGGCCACAT GATCTTCCCG GTACTCGATC TCGCGGTGCC GGTCACCCGC CTGGCCGCCG CGGGCATCCG CAACTACGGC GACCTGATGG GGATCCTCCC ACCGCTGATC TCCGCCTTCA ACGTCAACTA CGACCAGGCC AACGCCCGGA TCAAGATCAA CTACACACCG CTACCCGATA CCGCTTTCGC GGCGGCGAAC CCCGAACTGA CGTGCACCAA CGTCAATCGG ATCGCACCGG GACAGTGCCA AGTCGTCGGT GACGGCAAGA TCCGGGTCGA CGTGATGTCG GTCGTCCTGC GGGCGACGGG GGCGGCGCGA TGA
|
Protein sequence | MVRSALGSKW LISGIAVAIV IVLGLGGAWY LRGGTERSVT YCAQMPDAIG LYEGNPVTRR GVTVGKVTAV SAEGAKARVE MSVSANERIP ASASAVTVAK SLIASRQLAL IGDDNGGPTL QPGKCIVDTK TPLSLTKSME GVYRLTSQVT TGGGPEQTKQ AQQALGALAR ETNGTGPQIN GILNQLSSVL DNPGPGMDDL ARAFDALAPL TYGMTSNWGD IKSLFSNLPA YLVNVMLPLG STVDALATTL LPLGKMLFNL VGQYGHMIFP VLDLAVPVTR LAAAGIRNYG DLMGILPPLI SAFNVNYDQA NARIKINYTP LPDTAFAAAN PELTCTNVNR IAPGQCQVVG DGKIRVDVMS VVLRATGAAR
|
| |