Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_4176 |
Symbol | |
ID | 9158364 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 4299049 |
End bp | 4300086 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | Mammalian cell entry related domain protein |
Protein accession | YP_003649084 |
Protein GI | 296141841 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACGC GGATACGCAC GATCTGGCCC GCACTACTCG CTGCGGTCCT CGTGATCACC GGGCTGTGGA GCCTCCGGGA CATCGGACCC GCCACCCTGC CGACGCCGGG CTCCGCAGCC TCCGGCTGGC GCCTGAAAAT CCAGTTCCCC GACGTCCTGA ATCTTCCCGA TCAGGCGAAG GTTCGTCTTC TCGGCCGCGA TGTCGGTTCG CTGGAAGGGG TTCGACTGAC GGACAAGAAC GCGGAGGCGA CCTTGCGGAT CCAGTCGGAT ACCCGGGTGC CGCGCGCCGC CGTCGCGGAG CTGCGCCAGG ACACGCCGCT CGGAGACATC TACGTCGCGC TCACTCCGCC GAACACCACC CAGAGCACCG CCTTCTTCGC CGACGGAGAT CTGGTGCCGC AGTCACAGAC CCGGGCACCG GTCCAGATGG AGCAGATGCT CACCAGCCTC GCCGATTTCC TCGGCAGCGG CAGCCTGAGC CAGTTGGGCA ACACCTTCGA CCGGCTCAAC AGCAGCTTCC CCGAAGACCC CGCGCTGACC AAGAAGATCA TCGGCAACCT GACGACGACG GTTGACGCCT GGTCCAACGA CACCAAGAGC CTCGACACCG CGGTGGTCTC CCTGGTCGAG CTCACGACCA AGCTCGCAGC GCAGCGTGAT CTCGTGGGCG CCTATCTCGC GCCGGAGGCG GCGCCCCGAT GGAAGACGGT GATCGAGACC GGCGATATCG TCGCCGTCTT CGCCGCCCTA GCCCCGACGT TCAACAACGC CGCCTTCCTG GTCCCGACGC TGCGGGAGAC GAACACCCTG GTTCGGGCCG TTCTGCGACC CCTGCTGTAC TTCGACCGCC CGGCGGGATC GACCCGGCCG GACAACATCG TCAACCTGCG GAATCTGCTA CGCGACACCG TCATCCCGTA CCTCAGTGAG GGCGCGAAGG TGAACGTGGT CCGGCTCGGT CTCGGCGATC AGATGCCCAG TGGCGAGCAG GCCGATCAAG CGATCAAGAC ACTGCGCATG CTGGGGGTGG TCCGATGA
|
Protein sequence | MTTRIRTIWP ALLAAVLVIT GLWSLRDIGP ATLPTPGSAA SGWRLKIQFP DVLNLPDQAK VRLLGRDVGS LEGVRLTDKN AEATLRIQSD TRVPRAAVAE LRQDTPLGDI YVALTPPNTT QSTAFFADGD LVPQSQTRAP VQMEQMLTSL ADFLGSGSLS QLGNTFDRLN SSFPEDPALT KKIIGNLTTT VDAWSNDTKS LDTAVVSLVE LTTKLAAQRD LVGAYLAPEA APRWKTVIET GDIVAVFAAL APTFNNAAFL VPTLRETNTL VRAVLRPLLY FDRPAGSTRP DNIVNLRNLL RDTVIPYLSE GAKVNVVRLG LGDQMPSGEQ ADQAIKTLRM LGVVR
|
| |