Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2244 |
Symbol | |
ID | 9156400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 2332819 |
End bp | 2333889 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | virulence factor Mce family protein |
Protein accession | YP_003647192 |
Protein GI | 296139949 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCTGC GACGCCGGAT CTTCGCGGGC GGAGCGTTGG CGACCGCCGC CGTGGTCCTC ACCGCGTGCG GCAGCCAGGG CATCTACTCG CTGCCGCTCC CCGGCGGTCC CGCGCTCGGT GCCCGCCCCA CGACCCTGCA CCTTCAGTTC TCGAACGTGC TGGACCTGGT GCCCGAATCG TCGGTCCGGA TCGGCGGTGC CACCGTCGGC AAGGTCGAGT CCATCGGGAT CGCGGCGGAT GGCTGGACCG CCGAGGTCAC CGTCAAGGTT CGCGACGGGG TCACGGTGCC CGCGGATGCC CGGGCCCAGA TCGAACAGTC GAATCTCCTG GGCGAGAAGT ACATCACCCT CGTCTCGGCC GGCGGATCGG ACGGCCGGTC CCTGCCCAGC GGCGCCACCA TCCCCGCCAC CCAGAACAAG ACCACCGCCG GTATCGAGGA GGTGCTCGGG GTGCTGTCCC TCGTCCTCAA CGACGGGGGA GTCGGTCAGC TGAAACCGAT CGTGGATGAA CTCAACACCG CCATGAGCAA TCCGCGGCAG GTGCGCTCGT TGATCACCCA GACCGACGAA CTGATCAAGG GGCTCAACGA GCAGCGGGGC GATATCACTC GCGCCATCGA CGGGTTGGCC GCGTTGTCGC GACGAGCACA ATCGCAGACC ACGCAGATCA GCCGGATACT CGCCGAACTG CCCCGCGGTA CAGCGGTACT CAATCAGCAG CGTCCCGAGA TCATCGAGAT GATCAAGCAA CTCGATCGGC TCGGCACGGT GGGCACGGAC GTGCTCAGCC GGTCGCAGCA GGAGACGATC GAGAACCTGC TCGCGCTTCG TCCCACGCTG CGCGCCCTCG CCGGTGCTGC TGATGACATC GTCACCGCGC TGCCCTTCGT CGCCACGTTC CCGTTCCCGG ACGCGGGCGT CGACGCGATC AAGGGCGACT CGATGAACCT GTTCATCTCG CTCGACACCC GTCTGATCAA CCAGCTCGAG GCACTCGGTG CGGGCAAACC CGCACCGACC TACGTCCCGC CGAAGTACGG TCCGGGTTCG GGACGGGGAG GCAACCGATG A
|
Protein sequence | MSLRRRIFAG GALATAAVVL TACGSQGIYS LPLPGGPALG ARPTTLHLQF SNVLDLVPES SVRIGGATVG KVESIGIAAD GWTAEVTVKV RDGVTVPADA RAQIEQSNLL GEKYITLVSA GGSDGRSLPS GATIPATQNK TTAGIEEVLG VLSLVLNDGG VGQLKPIVDE LNTAMSNPRQ VRSLITQTDE LIKGLNEQRG DITRAIDGLA ALSRRAQSQT TQISRILAEL PRGTAVLNQQ RPEIIEMIKQ LDRLGTVGTD VLSRSQQETI ENLLALRPTL RALAGAADDI VTALPFVATF PFPDAGVDAI KGDSMNLFIS LDTRLINQLE ALGAGKPAPT YVPPKYGPGS GRGGNR
|
| |