Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_1244 |
Symbol | |
ID | 9155385 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 1278514 |
End bp | 1279668 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | Mammalian cell entry related domain protein |
Protein accession | YP_003646214 |
Protein GI | 296138971 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.994908 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAGTCG TCTCCGTTCC CGGAATGTCG ATCGACCGGC CCGTGCTGCG GCGCCGGGGC CTGATCGCGG CCGCAGTCAT CCTGGTGATC GCACTCATCG TCTGGATCGC GCAGGCGGTA TGGCCGAAGG ATGAATTCTC CTTCACCCTC CGCACGCCCA CCGTCGCGGC CGGCATCGTC AACGGTGCGC CGGTGCGCAT CCAGGGTGTT CAGGTGGGCG AGGTCACGGG CGTCAAGGCC GTCTCGAGCG GCCAGCAGGG CGTGACAGTC ACCATGAAGT CGGCCGATGG GAAGTCGCTG ACCAACAATG TCGAGGCGGC GTTCTCCGCG GGTAACCTGT TCGGCGTCTC GGAGGTCATC CTGACCCCGC GTGACGGCGG CGGCGAGCTC AAAGACGGCG CCACCATCTC CCCCACCAAG CAGATCACCG ATAACACGGT CTCGAACATG ATCGTCACGA TCGGCGACGT CAACAACGAC GCCCTGCGAC CCAATATGAG CACGATCCTG CTCAACTTCG ACGCCTCGTC GAAGGCGATG CTCCCGCTGT TCACCGCGCT CGGCAACGTC GCGCAGGCGG TGCAGGACAC GCAGCGGCTG ACCACCGCGC AGACCTTCCC CGTGATCACC GACACCCTGA TGGGCGCCGA TTCGGCGATC GCCCAGATCA TTCCCTCGGT GCGCACCCTG TTCAATTACG CGCCCGTGCA CGACAAGGGC TGGGTGAACC GTGGCGCAGC CACGCTGGAC GCGATCACCA ACCAGAAGGA CAGCCTCAGC GCGGCACTGC AGAAGCTCCT CGACGCCAAG GCCCTCAAAG GCCTGGAGAC CGCGACGCCT ATGCTGGTGA ACCTGATGCA ACCGCTGCTC AACGCCTTCC CGAACGGAAG CGCCACCGGT GTGGGCATCC AGATCGGGCA GCTGCTCGAC AACGTGCGCA AGGCCATGCC GAACACCCCG AACGGCCCGG TCCTCAACGT GCGGCTGTCG GTCGACTTCC CCGCGATCGC GGCGGGACTA CCGCCCGCGC CCACGTTCAC CTACGTGCCG CCGAACCCGG GAGCCAAGCC CGGCGACGCC AAGCCGAGCG CCAAGCCCGC TGCGGGCAGC AGCACCGCGA CCCCGAGCAC CCCGACCCCC AAGCCAGGGA GCTGA
|
Protein sequence | MGVVSVPGMS IDRPVLRRRG LIAAAVILVI ALIVWIAQAV WPKDEFSFTL RTPTVAAGIV NGAPVRIQGV QVGEVTGVKA VSSGQQGVTV TMKSADGKSL TNNVEAAFSA GNLFGVSEVI LTPRDGGGEL KDGATISPTK QITDNTVSNM IVTIGDVNND ALRPNMSTIL LNFDASSKAM LPLFTALGNV AQAVQDTQRL TTAQTFPVIT DTLMGADSAI AQIIPSVRTL FNYAPVHDKG WVNRGAATLD AITNQKDSLS AALQKLLDAK ALKGLETATP MLVNLMQPLL NAFPNGSATG VGIQIGQLLD NVRKAMPNTP NGPVLNVRLS VDFPAIAAGL PPAPTFTYVP PNPGAKPGDA KPSAKPAAGS STATPSTPTP KPGS
|
| |