Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_0809 |
Symbol | |
ID | 9154949 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 822929 |
End bp | 824599 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | type III restriction protein res subunit |
Protein accession | YP_003645784 |
Protein GI | 296138541 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGACG GACCCCTCAT CGTCCAGTCC GATAAGACCC TCCTGCTGGA GGTGGACCAC GACCAGGCCG ATGCGGCCCG TGCCGCGATC GCGCCTTTCG CCGAACTGGA ACGTGCACCG GAGCACGTGC ACACGTACCG GGTGACGCCG CTGGCGCTGT GGAACGCGCG GGCCGCAGGG CACGACGCGG AGCAGGTGGT GGACGCTCTG GTCACCTTCT CCCGCTACCC GGTGCCGCAA CCGCTGCTGG TAGACGTGGT CGACACCATG AGCCGGTACG GCCGGCTGCA ACTGGTGAAG AGCCCGGTGC ACGGGCTCAC CCTGGTCTCG CTGGATCGCG CGGTGCTGGA GGAGGTGCTG CGGCACAAGA AGATCGCGCC GATGGTGGGC GCCCGCATCG ACGACGACAC CGTGGTGGTG CACCCGTCCG AGCGCGGCCA CCTCAAGCAA CTGCTGCTCA AGGTGGGCTG GCCCGCCGAG GATCTCGCCG GTTACGTGGA CGGCGAATCG CACCCCATCG CGTTGGACAC GGAAACCGAT CCGTGGGAGC TGCGCGACTA CCAGAAGACG GCGGCGGACT CGTTCTGGCT GGGCGGTTCC GGTGTCGTGG TCCTGCCCTG TGGCGCGGGT AAGACGATGG TGGGCGCGGC TGCCATGGCG CGAGCGCAGG CCACCACGCT GATCTTGGTG ACGAACACGG TCGCGGGACG GCAGTGGAAG CGGGAGCTGC TGGCGCGCAC ATCGCTCACC GAGGAGGAGA TCGGCGAGTA CTCGGGCGAG AAGAAGGAGA TCCGCCCGGT CACCATCGCC ACGTATCAGG TGCTCACGCG GAAGTCGAAG GGCGAGTACA AGAACCTCGA CCTGTTCGAT TCGCGGGACT GGGGCCTGAT GATCTACGAC GAGGTGCATC TACTGCCCGC GCCGGTGTTC CGGATGACCG CCGACCTGCA GTCCCGCCGG CGCCTGGGCC TCACGGCGAC GTTGGTACGC GAGGACGGCC GCGAGGGCGA TGTCTTCTCG CTGATCGGAC CCAAGCGCTA CGACGCACCG TGGAAGGACA TCGAGGCGCA GGGCTGGATC GCGCCCGCGG ACTGCGTCGA GGTGCGGGTG ACGCTCACCG AGAACCAGCG GATGCAGTAC GCCACCGCCG AGCCCGACGA GCGGTACAAA CTGGCCTCGA CCGCACCCGC GAAATCCGCT GTGGTGAAGG CGATCCTGGA GCGGCATCGG GGCGCGCAGA CGCTGGTGAT CGGTGCGTAC ATCGATCAGT TGGAGGAGCT GGGCGCCGCG CTCGACGCCC CGGTGATCCA GGGCTCCACC AAGACGAAGG AGCGCGAGGC GCTCTTCGAC GCCTTCCGTC GCGGCGAGAT CTCCACGCTG GTGGTGAGCA AGGTGGCGAA CTTCTCCATC GATCTACCGG AAGCCTCGGT GGCCGTGCAG GTCTCGGGCA CGTTCGGGTC GCGGCAGGAG GAGGCGCAGC GCCTGGGCCG GCTGCTGCGC CCCAAGCACG ACGGTGGCAC GGCGCACTTC TACTCGGTGG TCTCGCGCGA CACCCTGGAC GCCGAGTACG CGGCACACCG GCAGCGCTTC CTCGCCGAGC AGGGCTACGC CTACCGGATC GTCGATGCCG ACGATCTGCT CGGCCCCGCT GTCGGCGAAA CCGCGGACTG A
|
Protein sequence | MTDGPLIVQS DKTLLLEVDH DQADAARAAI APFAELERAP EHVHTYRVTP LALWNARAAG HDAEQVVDAL VTFSRYPVPQ PLLVDVVDTM SRYGRLQLVK SPVHGLTLVS LDRAVLEEVL RHKKIAPMVG ARIDDDTVVV HPSERGHLKQ LLLKVGWPAE DLAGYVDGES HPIALDTETD PWELRDYQKT AADSFWLGGS GVVVLPCGAG KTMVGAAAMA RAQATTLILV TNTVAGRQWK RELLARTSLT EEEIGEYSGE KKEIRPVTIA TYQVLTRKSK GEYKNLDLFD SRDWGLMIYD EVHLLPAPVF RMTADLQSRR RLGLTATLVR EDGREGDVFS LIGPKRYDAP WKDIEAQGWI APADCVEVRV TLTENQRMQY ATAEPDERYK LASTAPAKSA VVKAILERHR GAQTLVIGAY IDQLEELGAA LDAPVIQGST KTKEREALFD AFRRGEISTL VVSKVANFSI DLPEASVAVQ VSGTFGSRQE EAQRLGRLLR PKHDGGTAHF YSVVSRDTLD AEYAAHRQRF LAEQGYAYRI VDADDLLGPA VGETAD
|
| |