Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3669 |
Symbol | |
ID | 9157849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 3781181 |
End bp | 3782581 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | transcriptional regulator, XRE family |
Protein accession | YP_003648586 |
Protein GI | 296141343 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAAGA CCTTCGTCGG TGCCCGGCTC CGCGGTCTGC GCAAGGAGCG CGGATTGTCG CAGGCGTCGC TGGCCGAGGC CCTGGAGATC TCTCCGTCGT ACTTGAACCA GATCGAGCAC GACGTGCGAC CGCTCAGCGT GCCGGTGCTG CTCAAGATCA CCGACGTCTT CGGCGTGGAC ACCAGCTTCT TCAATTCGCA GGACCAGACC CGGCTCATCG CCGAGCTGCG TGAGGTCACG ATGGATGTGG ACGCTCCGAC CAGCACCGAG GAACTGTCCG ACCTCGCCCG GGACCACCCC GGCTTCGCCC GCGCCATGGT CGCGTTGCAC CGCCGCTACC TCGGTGCCGC CGACCAACTG GCCCAGGTCA CCGACGGCCG CAACGATCCC GGCGCGCGCG GCGCGATCCC CAACCCGCAC GAGGAGGTGC GCGACTTCTT CTACCAGCAG CAGAACTACT TCCACGACCT CGACACCGCC GCCGAGGAGC TCACCGCCCG GATGCGGATG CACAGCTCGG ATGTGCGCAC CGAGATCGTC ACCCGTCTCG AGGCCGTGCA CAACGTCTCC ATCCGACGCC GCGTCGACCT GGGCGAGACG GTGCTGCACC GGTACGACCC GCGCACCCGG GTGCTGGAGA TCAACCTGCA CCTGTCCCCC GGGCAGCAGG TCTTCAAGAT GGCGGCCGAG CTCGCGTTCC TGGAGTGCGG ACGCGAGATC GACGCCCTGA TCGACGGTGC CGGGTTCGGC TCCGACGAGG CGCGCAGCCT CGCGCGACTG GGGCTCGCGA ACTATTACGC CGCCGCGGTC GTCCTGCCGT ACACCCAGTT CCATGGCGCC GCCGAGGAGT TCCAGTACGA CATCGAGCGA CTCTCCGCGT TCTTCTCGGT GAGCTACGAG ACCATCGCGC ATCGGCTGTC CACCTTGCAG CGCCCGAACC TGCGCGGGGT GCCGCTGTCG TTCGTCCGCG TCGACCGCGC CGGGAACATG AGCAAGAGGC AGAGCTCCAC GGGCTTTCAC TTCTCGGCAT CCGGCGGAAC CTGCCCGCTG TGGAACGTCT ACGAAACGTT CGCCTGGCCG GGCAAGATCA TCACCCAGAT CGTGGAGATG CCCGACGGGC GCAACTACCT GTGGGTGGCG CGCACCGTCG AGCGGAGGGC GGCGCGGTAC GGACAGCCCG GCAAGACCTT CGCGATCGGC ATCGGCTGCG AACTGCGGCA TGCACACCGG CTGGTGTATG CCCGTGGTCT CGACCTCTCC GATGCCAACG CCACCCCGAT CGGCGCCGGT TGCCGGGTGT GCGAGCGCGC CGGCTGCTCG CAACGCGCCT TCCCCGCCAT CGGTAAAGCA CTCGATATCG ACGAGCACCG CTCGACGGTG AGCCCCTACC TGGTCAAGTA G
|
Protein sequence | MSKTFVGARL RGLRKERGLS QASLAEALEI SPSYLNQIEH DVRPLSVPVL LKITDVFGVD TSFFNSQDQT RLIAELREVT MDVDAPTSTE ELSDLARDHP GFARAMVALH RRYLGAADQL AQVTDGRNDP GARGAIPNPH EEVRDFFYQQ QNYFHDLDTA AEELTARMRM HSSDVRTEIV TRLEAVHNVS IRRRVDLGET VLHRYDPRTR VLEINLHLSP GQQVFKMAAE LAFLECGREI DALIDGAGFG SDEARSLARL GLANYYAAAV VLPYTQFHGA AEEFQYDIER LSAFFSVSYE TIAHRLSTLQ RPNLRGVPLS FVRVDRAGNM SKRQSSTGFH FSASGGTCPL WNVYETFAWP GKIITQIVEM PDGRNYLWVA RTVERRAARY GQPGKTFAIG IGCELRHAHR LVYARGLDLS DANATPIGAG CRVCERAGCS QRAFPAIGKA LDIDEHRSTV SPYLVK
|
| |