Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_4172 |
Symbol | |
ID | 9158360 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 4294814 |
End bp | 4295830 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | Mammalian cell entry related domain protein |
Protein accession | YP_003649080 |
Protein GI | 296141837 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.795057 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATCGA TCTCCTCCGT ACCCGGCATG ACCGTCGAAC CCAAGCAACT GGTTCGACGG GGAGCCCTCA CCGTCGCGAT CGTCGTCGTG GTCGCACTAC TCGCGTGGGG CATCACCGCC CTCGCGCGAC CCGCCCCGAT GCGGATCACC CTGCTCACCT CGGAGGTGGC CCCGGGAGTG AACAACGGCA CGGTGATCGA GATGAACGGG ATCCGGATCG GGACCGTTTC GGGCATCACG CAGGAGGGAT CGGGACGCCT CGGCATCGAC ATGGATCTCG ACAAGGACAA GGCGGCGGGA CTCACCGACG CGCTGCTCGC GGACTTCGCC CCCAACAACC TGTTCGGCAT CACCTCCGTG CAGATCATTC CGTCGCCCGC GGGCTCGCCG CTGCAGGACG GCGCGCGGGT GCGCCCGGCC ACCGCACCCA GTGATTCGAC GATGTCCGGC CTACTGCGCA CCCTCGCCGA TGTGGAATCG ACGGCCCTGC GGCCCTACAT GTCCGATCTG CTGCGTCAGT CCGACACCGC GACGAAGGGC TTCCTGCCCA TGGTGCGCGC GCTCGGTGCG GTCGCCCAGG CCAATGCCGA GACGCAGACC GTGCCCACCT CGTACACGCT CCCGATCCTG TCCAGCGCCC TGCAGGGCAT CGGCGCCTCG ACCGACGACC TCCTCGCGGC GATCAAACTC CTCTGGGACT GGCCCGGACC CGATACACCC GGCTACCCGA AGGCCCAGAC CGCGACGGTC GACACGCTGG TCAACGTGAC ATCACCCGAT ATCGCCAATC TCGTCAAATC CCTGCAGCCG CTGGCTCCGA CGCTCTCGGT GCTGGCCGAT GCGGAGCGGC GGGGCGTCGC CTCGATGCCC GGTGCCGCCC GGAACGGACA GCAGATCTCC GATCTCATCG AGGCGATCCG CCGCGCCATC AAGGACACCC CGAACGGCAA GGTTCTCAAT CTCGATGTGC AGGCGCGGCC CGTCCCGATC CCGCCGCAGG GCGGTGGTGG ACGATGA
|
Protein sequence | MASISSVPGM TVEPKQLVRR GALTVAIVVV VALLAWGITA LARPAPMRIT LLTSEVAPGV NNGTVIEMNG IRIGTVSGIT QEGSGRLGID MDLDKDKAAG LTDALLADFA PNNLFGITSV QIIPSPAGSP LQDGARVRPA TAPSDSTMSG LLRTLADVES TALRPYMSDL LRQSDTATKG FLPMVRALGA VAQANAETQT VPTSYTLPIL SSALQGIGAS TDDLLAAIKL LWDWPGPDTP GYPKAQTATV DTLVNVTSPD IANLVKSLQP LAPTLSVLAD AERRGVASMP GAARNGQQIS DLIEAIRRAI KDTPNGKVLN LDVQARPVPI PPQGGGGR
|
| |