Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2247 |
Symbol | |
ID | 9156403 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 2336317 |
End bp | 2337408 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | |
Product | virulence factor Mce family protein |
Protein accession | YP_003647195 |
Protein GI | 296139952 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.864975 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCCGC TCACCGGGCC GCTGCTCAAG TTGATCGTCT TCGCAGTCGT CACCGCGACC GCGACAGGCA TTCTCGGCGC GTCGATCGCG CAGCTCAACT TCAGCTTCGG CCGTGACTAC CATGCGATCT TCTCCGACGC GACGATGGTG CAGAAGGGCG ATGAGGTCCG GATCGCCGGG GTGCGTGTCG GCCAGATCAC CGGCGTGGCG ATTCACGACC GCGGCCAGGC CGATGTCGCA TTCAATCTCA GCGATCGCGA CTCCATCCCG GGGTCGGCCA CCGCGGCGTT GCGGCTGCGG AACCTCGTGG GACAGCGCTA CCTCGAACTC ATGCAGGCCC CAACGGACAA CGATCCGGTG CGTGGCGAAT CCCGGGCCAC CCCGTACCGA CCGGGCGATA CCATCCCGAT CGAGCGCACC CGGCCCGCGG TGAATCTGAC CGACCTGTTC AACGGGTTCA AGCCACTGTT CAAGCAGCTC ACTGCCGACG ATGTGAACAA GCTGGCGAGC CAGATCATCA CGGTCTTCCA GGGGCAGGGC GGCACTGTGA ACGACCTCGT GGTCAATACC GCGTCGCTGA CCAACACCAT CGCCGACAAG GACGCTGTGA TCGGCGAGCT CATCACCAAC CTCACGAAGG TGCTGGGCAC GATCAACGAG CGCGACAAGC AGTTCACCGA CCTGCTCGAC ACCACCCAGC AATTGGTGAC CGGGTTGTCT GAAGATCGTG GCGCGATCGG ATCCGCCCTC GGCTCGGTAT CGCAACTGAC TGCGGTCACG GATTCGATCC TCACGCCCAC GCGTGGCGCG ATTAAGGACG ATGTGGCTGC CCTGCGTGCT CTCACCGACA AGCTCAACGG CCGCAGCACC GACGTGGCGC ACACCCTGAA CTTCCTTCCG GAGAAGATCC AGGCCGTGGG CCGACTCGCG TCCTTCGGCG GGTGGTTCCA GTTCTACCTC TGCGGCGCCG ACATCGTCGC CGGCACAGGC AAGGGCGACA ACATCGCGCT CGCGATCGAC GTGCCGTCGG TGAACCAGCC CGTGTACACC AACACTGCCA CTCGCTGCTA CCGGGACGGG AACCCGCGAT GA
|
Protein sequence | MKPLTGPLLK LIVFAVVTAT ATGILGASIA QLNFSFGRDY HAIFSDATMV QKGDEVRIAG VRVGQITGVA IHDRGQADVA FNLSDRDSIP GSATAALRLR NLVGQRYLEL MQAPTDNDPV RGESRATPYR PGDTIPIERT RPAVNLTDLF NGFKPLFKQL TADDVNKLAS QIITVFQGQG GTVNDLVVNT ASLTNTIADK DAVIGELITN LTKVLGTINE RDKQFTDLLD TTQQLVTGLS EDRGAIGSAL GSVSQLTAVT DSILTPTRGA IKDDVAALRA LTDKLNGRST DVAHTLNFLP EKIQAVGRLA SFGGWFQFYL CGADIVAGTG KGDNIALAID VPSVNQPVYT NTATRCYRDG NPR
|
| |