Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2117 |
Symbol | |
ID | 9156272 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 2209924 |
End bp | 2211171 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | secreted protein |
Protein accession | YP_003647067 |
Protein GI | 296139824 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.512527 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCTTGA TGACGACGAC TGTATCCCTC GCCGCCGTGC TCGCCCTGGC AGGCGGCGTG CTCGCGCCCG GAACCGCGTT CTCTGCCCCC GCCCCCGGCC CGACACCGGA GGAACGGATG TACGACGCGG GCGACGTGCC CGGCCGACTG GTGACGGGCG CCGGTTTCGA CCGGTTCGCC CGCGACCTCG CCTCGGCACT GGACCGGGCG CGCACCGCCG ATGAGGCGGA CCGGGAGGCC TCGCGCGTGT CGTCCGCGTT GTTCCGCGCG GCGGTACAGC GGGCGCAGGG CGCCGGTCCG GGTTTCACCG CCGACGATCG GCCGCTGTAC TGGGCGCGGA TCGCGCTGAT CCGCACCGTC CGGAACTGGC AACCGGCGAC CGCGGTGACG GAGTCGGAGC GCGACCGGAT CGTCGGCACC ATCGATGAGG TCTCCCGTGG CCAACGGCGC GCGCCGGTCG GCCCGACGGT GCTGGTGACG GGTTTCGACC CGTTCCGTCT CACTCGCGAT ATCCGCCAGG GCAATCCGTC CGGCGCGATC GCGCTCGCCC TGGATGGAAC ACAGGTCGAT ACGCCGGCGG GCCGCGTGAC GGTGGTGGCG ATGCTGTTCC CGGTGCGGTG GCGCGATTTC GGTGCGGGCA TGGTCGAACG TGCCGTGACT CCTTTCCTGG CGCCCGGATC GTCCCGCGTG GTCGGGTTCA CCACGGTCAG TCAGGGACGG CCCGGCAAGT TCGACCTGGA GGCGATCAAC GGAGCCTGGC GCGGCGGCGC CGTCGATAAC GAGGGGGCCT GCTACCGCGG CCCGGCACCG GTGGCCGGAG CCGCCCCGCA GTGGACCCGG AGCACGCTGC CGATGGATGC GATCGTCTCG GCGGCACGGG GCACGTACCC GGTGGTACGC AACACCGAGG TCAGCTATGC CACAGGCTCC GATCCTGCAC CGAGCACCGT CTGCGACCTG CCGAAGCTGC CGTCCACCAC CACGACGGCG GAACCGCCGA TCGATGCCCG GGCGCGCCAG GGAGCGGGCG GGGACTATCT GTCCAACGAG ATCGGCTACC GCGTGACACT GGTTCGCGAC CGGCTCGCGG CACCGATTCC GGGCGGTCAC CTGCACACGC CGGTGCTCGA TGGGCTACCC GCCGAACGCA GTGCTCTCGA ATCGCCGCAA TACCGGGCGA ACCTCGCGGC GATCGTCACG CAGGCACGGG CCGTGGTGAG CGTGGTGGCG CACGGCCAGA GAAACTAG
|
Protein sequence | MRLMTTTVSL AAVLALAGGV LAPGTAFSAP APGPTPEERM YDAGDVPGRL VTGAGFDRFA RDLASALDRA RTADEADREA SRVSSALFRA AVQRAQGAGP GFTADDRPLY WARIALIRTV RNWQPATAVT ESERDRIVGT IDEVSRGQRR APVGPTVLVT GFDPFRLTRD IRQGNPSGAI ALALDGTQVD TPAGRVTVVA MLFPVRWRDF GAGMVERAVT PFLAPGSSRV VGFTTVSQGR PGKFDLEAIN GAWRGGAVDN EGACYRGPAP VAGAAPQWTR STLPMDAIVS AARGTYPVVR NTEVSYATGS DPAPSTVCDL PKLPSTTTTA EPPIDARARQ GAGGDYLSNE IGYRVTLVRD RLAAPIPGGH LHTPVLDGLP AERSALESPQ YRANLAAIVT QARAVVSVVA HGQRN
|
| |