Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2317 |
Symbol | |
ID | 9156473 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 2409023 |
End bp | 2410375 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | protein of unknown function DUF21 |
Protein accession | YP_003647263 |
Protein GI | 296140020 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTAGCA CGCTGATCAC GATCGCTGCG GTCGTCGGAT TCCTCGCGCT CACGTTGGGC ACCGCACTGT TCGTGGCGGC CGAGTTCTCA CTGACCGCCC TCGAGAAGTC GACTATCGAC GCCGATGTCC GCGACCGGGG TGACCGCCGG TCGAAACAGG TGCAGAGTGC GCACCGCACC CTGTCGTTCC AGCTGTCCGG CGCCCAGCTC GGCATCACCA TCACCACGCT GGTGACCGGA TACCTCGCCG AACCGGTGTT GTCGAAGTTC CTGCGCCCGG CGCTGGAATG GACCGGTATG CCCGGCGTCT GGTCGTCGGC CCTGACGCTG ATCCTGGCGC TGGTGATCGC GACCTCGCTG TCGATGGTGC TCGGCGAGCT GGCGCCGAAG AACCTCGCCA TCGCCCGGCC CCTGAACACC GCCCGCCTGA CCGCCGGTGC GCAGGCGGCG TTCTCCACGG TCTTCCGCTG GGCGATCACC TTCCTCAATA CGACGGCCAA CCGATTGGTG CGCCGGCTTG GCATCGAACC CGCCGAGGAG CTGAGCTCGG CGCGGTCCGC ACAGGAATTG AGCGCGCTGG TCCGCAATTC CGCGGCACAG GGCGCGATCG ATGAGATGAC GGCCGAACTG GTGGGCCGCT CGCTGGAGTT CGGTGAGCTC ACCGCCGAGG AGCTGATGAC CCCGCGACAG CGGGTGCACT CGGTGGGGGT CGACGACACG GTGGCGGATC TGGTGGCGCT GGCGATCGAC TCGGGGCATT CGCGGTTCCC GGTGATCCGC GGCGACCTCG ACGACACCGT GGGCTTCATT CACGTCAAGC AGGCCCTGAC CGTGGCCGCG GACCGTCGCG CGTCCACCAC CGTGGGCTCG ATCGCCACGG CACCCCCGGT GGTCCCGGCG GCGCTGGACG GCGACGCACT GATGGAGCAG CTGCGGGCGA ACGGCCTGCA AATGGCGCTG GTCGTGGACG AGTACGGCGG CTCCGCGGGG ATCGTCACGG TGGAGGATCT GATCGAGGAG ATCGTCGGCG ACGTGCGCGA CGAGCACGAC GACAACGAGC CGGTCGACGT GCAGCGCACC GACACCGGCT ACCTGTGCGC AGGCCTGCTC CGCATCGACG AACTCGAACG CGATACCGGG TACCGCGCAC CGGAGGGCGA CTACGACACT CTCGGTGGCC TGGTGATGTT CCTGCTGGGC CGGATCCCTG AGGTGGGTGA TCAGACCGAG CTGCCGCCGC ATCGTGTGGA GGACGACGAA TCCGGTACCG ATCGCAGCTG GATCGCGCGG GTGGCCCGGA TGGACGGCCG CCGCGTCGAC CTCGTCGAGT TGGTGGAGGT GACCGATGAA TGA
|
Protein sequence | MSSTLITIAA VVGFLALTLG TALFVAAEFS LTALEKSTID ADVRDRGDRR SKQVQSAHRT LSFQLSGAQL GITITTLVTG YLAEPVLSKF LRPALEWTGM PGVWSSALTL ILALVIATSL SMVLGELAPK NLAIARPLNT ARLTAGAQAA FSTVFRWAIT FLNTTANRLV RRLGIEPAEE LSSARSAQEL SALVRNSAAQ GAIDEMTAEL VGRSLEFGEL TAEELMTPRQ RVHSVGVDDT VADLVALAID SGHSRFPVIR GDLDDTVGFI HVKQALTVAA DRRASTTVGS IATAPPVVPA ALDGDALMEQ LRANGLQMAL VVDEYGGSAG IVTVEDLIEE IVGDVRDEHD DNEPVDVQRT DTGYLCAGLL RIDELERDTG YRAPEGDYDT LGGLVMFLLG RIPEVGDQTE LPPHRVEDDE SGTDRSWIAR VARMDGRRVD LVELVEVTDE
|
| |