Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_0331 |
Symbol | |
ID | 9154466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 340258 |
End bp | 341997 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003645313 |
Protein GI | 296138070 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGACG GTGCAACGAA CGTTCACCTG CCTTTTGCAG AGCAGAACAA GACTGAGGGC ACACCGGTGC CCACCACGCC GCAGTCCGTC GCGGGCGCGC TGGCGAGTAA TGCGGCGGCC GTGCGACCCG TGGTGACCCC CGGGCCGGTG TCCGACGAGG GCACGGATTC GGCACCGTCG ACCACGGAGT GGGAGGAACC CGAGCTCCCC GTCGTCGACG ACGCGGGCGA GGCCGGCCCC GTGCCCACCG CCGCCGAGGA AGCCCCCGCC GCCGCGGTCG CCGACGCCGC CGCGGTCGCC GACGCCGTCG CGTCCGACGA CGACCGACCC ACCCGGGAGC AGCCCGTGAT CGTCGACGAG CCGGCCGGCC GGCCGGAGGC GGAGGCCGCC CCCACCGCCG ACACGGCGGC AGCGGGCGGG CAGGCCTCGC CGCAGGGAGG CCCGCAGGTC ACCGGTCCGC AGCCGGCGGC GCAGGCCCCA CAGCAGGCGC GGACCGTGCC CGTTCCGGCG CAGCAGCCCC AGCAGCAGTG GCAGGGCCAG GCCCCGCAGG CGCAGCCCGG CCAGCATCAG TTCGCGCAGC CGCACACCGG TCAGTTCCAG CAGCCGCAGT TCGCCGGCCA GCCGCCCACC GGGCAGTTCC AGGCTCCCGT GCAGGGCCGC CCGCCGTCAC CGCTGCCGTT CGGCCACGCC GCACCCGGCC AGATCGCACC GCCGCAGCCC CAGCACGCGG GTATGCCGAT GCGGCCGATG GGGGAGTCGG TCTCCGCGCC GATGTCCACG GCCTCGTCAT CGTTCCAGCA GCACATCCGG ATGTCCGATT CGGCGATGCG GGTCACCGGC TGGCGCGCGT GGCTCAAGCG CATGGGTATC GACGTGGGCC CCAGCACCTC CGATCAAGAG CATGCGGAGC GGGTGCACCG GATCCGCGTC CCGAAGAACA AGTTCCACGT CACCAGCGTC TTCGCCGATA GCGCGGGCGG CACCCTGCTG GTGGGCGTGC TGGGCCAGAT TCTGGAGCGC ACCCGCGCCG ACAACGTGGT CGCGCTCGAC CTCGATCCCG ACGGCGGCGA TCTCGACCAG GTGACCGCGT GGCATCAGGG CGGTTCGACC GCGCGCACCC TGATCCAGCA GAGTGATCTG TCGGACCGCA ACCAGGTGGA CAAGCACCTG GCCGTCACGT CGACGAATCT GCACGTGCTG CCCACCCCGT GGCGGTTCAA CGGCCGCGAC GTCGCCGATT ACGATGACGT GCTCGATCTG TACTCGATAT TCCGCCCGCA CTACAGCCTG GCGCTCGTCG ATGCGGGCCG AGGACTCCAG ACCGTCACGG GCACCGGCGT TCTGGAGATC TCTTCCGCGT TGATCCTTCC CGCCTCGGCC ACGACCCGCG GCGTGCGCAA GGTGGCCGCG ACCATCGATT GGCTGCGTCA CCACGGGTGG CACGGATTGC TCGCCAACAC CATCGTGGTC ATCAATCACA CCAAGAAGCG CGGCAGTGTC ACCGTCGAGC AGTTCGACGA GCTGTTCCGC GCCGGCCAGA AGCTGCGGGT CCACGAGATC CCCTACGACC CGCACCTCGA TGCCGACACC CCGATCGACC TCGATCTGCT CAAGCCGCGC ACCGTGCGCG CGTTCGAGCT GCTCGCGGCC GACCTCGCCG ACACCTTCAA TTCCGGATAC GAGCCGCCGG CCGCCACCAA GCTGGCCGAA CTGCAGGTGC GCTCGCACGA GGGACGCTGA
|
Protein sequence | MSDGATNVHL PFAEQNKTEG TPVPTTPQSV AGALASNAAA VRPVVTPGPV SDEGTDSAPS TTEWEEPELP VVDDAGEAGP VPTAAEEAPA AAVADAAAVA DAVASDDDRP TREQPVIVDE PAGRPEAEAA PTADTAAAGG QASPQGGPQV TGPQPAAQAP QQARTVPVPA QQPQQQWQGQ APQAQPGQHQ FAQPHTGQFQ QPQFAGQPPT GQFQAPVQGR PPSPLPFGHA APGQIAPPQP QHAGMPMRPM GESVSAPMST ASSSFQQHIR MSDSAMRVTG WRAWLKRMGI DVGPSTSDQE HAERVHRIRV PKNKFHVTSV FADSAGGTLL VGVLGQILER TRADNVVALD LDPDGGDLDQ VTAWHQGGST ARTLIQQSDL SDRNQVDKHL AVTSTNLHVL PTPWRFNGRD VADYDDVLDL YSIFRPHYSL ALVDAGRGLQ TVTGTGVLEI SSALILPASA TTRGVRKVAA TIDWLRHHGW HGLLANTIVV INHTKKRGSV TVEQFDELFR AGQKLRVHEI PYDPHLDADT PIDLDLLKPR TVRAFELLAA DLADTFNSGY EPPAATKLAE LQVRSHEGR
|
| |