Gene Tpau_3669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_3669 
Symbol 
ID9157849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp3781181 
End bp3782581 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content68% 
IMG OID 
Producttranscriptional regulator, XRE family 
Protein accessionYP_003648586 
Protein GI296141343 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAAGA CCTTCGTCGG TGCCCGGCTC CGCGGTCTGC GCAAGGAGCG CGGATTGTCG 
CAGGCGTCGC TGGCCGAGGC CCTGGAGATC TCTCCGTCGT ACTTGAACCA GATCGAGCAC
GACGTGCGAC CGCTCAGCGT GCCGGTGCTG CTCAAGATCA CCGACGTCTT CGGCGTGGAC
ACCAGCTTCT TCAATTCGCA GGACCAGACC CGGCTCATCG CCGAGCTGCG TGAGGTCACG
ATGGATGTGG ACGCTCCGAC CAGCACCGAG GAACTGTCCG ACCTCGCCCG GGACCACCCC
GGCTTCGCCC GCGCCATGGT CGCGTTGCAC CGCCGCTACC TCGGTGCCGC CGACCAACTG
GCCCAGGTCA CCGACGGCCG CAACGATCCC GGCGCGCGCG GCGCGATCCC CAACCCGCAC
GAGGAGGTGC GCGACTTCTT CTACCAGCAG CAGAACTACT TCCACGACCT CGACACCGCC
GCCGAGGAGC TCACCGCCCG GATGCGGATG CACAGCTCGG ATGTGCGCAC CGAGATCGTC
ACCCGTCTCG AGGCCGTGCA CAACGTCTCC ATCCGACGCC GCGTCGACCT GGGCGAGACG
GTGCTGCACC GGTACGACCC GCGCACCCGG GTGCTGGAGA TCAACCTGCA CCTGTCCCCC
GGGCAGCAGG TCTTCAAGAT GGCGGCCGAG CTCGCGTTCC TGGAGTGCGG ACGCGAGATC
GACGCCCTGA TCGACGGTGC CGGGTTCGGC TCCGACGAGG CGCGCAGCCT CGCGCGACTG
GGGCTCGCGA ACTATTACGC CGCCGCGGTC GTCCTGCCGT ACACCCAGTT CCATGGCGCC
GCCGAGGAGT TCCAGTACGA CATCGAGCGA CTCTCCGCGT TCTTCTCGGT GAGCTACGAG
ACCATCGCGC ATCGGCTGTC CACCTTGCAG CGCCCGAACC TGCGCGGGGT GCCGCTGTCG
TTCGTCCGCG TCGACCGCGC CGGGAACATG AGCAAGAGGC AGAGCTCCAC GGGCTTTCAC
TTCTCGGCAT CCGGCGGAAC CTGCCCGCTG TGGAACGTCT ACGAAACGTT CGCCTGGCCG
GGCAAGATCA TCACCCAGAT CGTGGAGATG CCCGACGGGC GCAACTACCT GTGGGTGGCG
CGCACCGTCG AGCGGAGGGC GGCGCGGTAC GGACAGCCCG GCAAGACCTT CGCGATCGGC
ATCGGCTGCG AACTGCGGCA TGCACACCGG CTGGTGTATG CCCGTGGTCT CGACCTCTCC
GATGCCAACG CCACCCCGAT CGGCGCCGGT TGCCGGGTGT GCGAGCGCGC CGGCTGCTCG
CAACGCGCCT TCCCCGCCAT CGGTAAAGCA CTCGATATCG ACGAGCACCG CTCGACGGTG
AGCCCCTACC TGGTCAAGTA G
 
Protein sequence
MSKTFVGARL RGLRKERGLS QASLAEALEI SPSYLNQIEH DVRPLSVPVL LKITDVFGVD 
TSFFNSQDQT RLIAELREVT MDVDAPTSTE ELSDLARDHP GFARAMVALH RRYLGAADQL
AQVTDGRNDP GARGAIPNPH EEVRDFFYQQ QNYFHDLDTA AEELTARMRM HSSDVRTEIV
TRLEAVHNVS IRRRVDLGET VLHRYDPRTR VLEINLHLSP GQQVFKMAAE LAFLECGREI
DALIDGAGFG SDEARSLARL GLANYYAAAV VLPYTQFHGA AEEFQYDIER LSAFFSVSYE
TIAHRLSTLQ RPNLRGVPLS FVRVDRAGNM SKRQSSTGFH FSASGGTCPL WNVYETFAWP
GKIITQIVEM PDGRNYLWVA RTVERRAARY GQPGKTFAIG IGCELRHAHR LVYARGLDLS
DANATPIGAG CRVCERAGCS QRAFPAIGKA LDIDEHRSTV SPYLVK