Gene Tpau_4179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_4179 
Symbol 
ID9158367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp4301899 
End bp4303089 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content70% 
IMG OID 
Producttranscriptional regulator, TetR family 
Protein accessionYP_003649087 
Protein GI296141844 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCCAGG CGCGGAGGAA GCGGCCCAAG GACCGGCGCG AGCAGATCGC GCGGGTGGCT 
GCAGAGGACT TCTCCCGGCG CGGCTACCAC GGTGTGGGCA TCGAGCAGAT CGCGGCATCC
CTCGACATCT CGGGTCCGGC TGTGTACCGG CACTTCCCGA ACAAGTACGC ACTGCTCGAA
CACGCGATCA CCTCGGCGTC GGACGCCCTG AGCGCGGCTG TCGCACAGGC GGCGCAGGAG
ACGGAGGGAC AGGACGCGGA AGCACGCATC GATGCCATCG AGGACGCGGC GCTGCGAGTG
GCCTTCGAGC GTCGCAACAC CGGTGGCCTG TTCCGCTGGG AGCAGCGACT CCTGGAGCGG
CAGGATCGGA ATCGGATCCG GGACCAGTTG ACGCAGATGA TCGGCACCTT CGCGGATGCC
GCTGAACCCG CCCTTCCGGG GGTGGGCCGT GCGGAGCGCG AGCTCCGTTG CGTCGGTGTC
ATCAGTGTCA CCGGAAGCGT GACCGCACAC CGCACGGTGC TCGCGCAGAG GCGCGCCGAA
GCAGTGCTCA GGACCGCGGG CCGGAGGTTG CTGGCGCTGC CCGCCCCGCC CACCGAGTAC
GTCCTCGCTC CGCCGCCGTC CTCCGTGGCG GGGACGGACG CGGGCCGTCA GGAGCAGGTC
CTCGACGAGG CCGTGGAGCT GATCTTCAGC CACGGATTCC ACAACGTGAG CATGGGGCAG
ATCGGTCAGG CGGCCGGCAT CGTTCCGTCC GGGATGTACC GGTACTTCCC GAACAAGGCC
GGGATCCTCG TGCGCGCGCT CGAACGATCC GGCGCGGCGA TGGTCGATGC GATCGCCGCG
GTGGTCGAGG CGAACCCCGA ACCCCGGGCC CGGCTCGCCG CCCTCGCGCA GGCCTACGTC
CAACTGTCCT TCGGGCAGTC GAAGTTGATG ACGGTCTACT TCCGCGAGAT CGGCAACGTG
CCGGACTCCG ACCGCAGCCG TCTCGCGAGC GTGCAGCGCG CCAACATCGC CGCGTTCGCC
GATGCCGTGA TGGCCGTGCG TCCGGATCTG GGTGCGGCCG AGGCCACGTT CCTCGTCCAC
GCGGCCTTCG CCGTGGTCTT CGACGTCGGA CGCACCCGCC GCTTCGACGC CGACCCGCAC
TTCCAGGCCG AGGTCTTCGC GATGGTGTGC GCGGTGCTCT TCGATTCCTA G
 
Protein sequence
MTQARRKRPK DRREQIARVA AEDFSRRGYH GVGIEQIAAS LDISGPAVYR HFPNKYALLE 
HAITSASDAL SAAVAQAAQE TEGQDAEARI DAIEDAALRV AFERRNTGGL FRWEQRLLER
QDRNRIRDQL TQMIGTFADA AEPALPGVGR AERELRCVGV ISVTGSVTAH RTVLAQRRAE
AVLRTAGRRL LALPAPPTEY VLAPPPSSVA GTDAGRQEQV LDEAVELIFS HGFHNVSMGQ
IGQAAGIVPS GMYRYFPNKA GILVRALERS GAAMVDAIAA VVEANPEPRA RLAALAQAYV
QLSFGQSKLM TVYFREIGNV PDSDRSRLAS VQRANIAAFA DAVMAVRPDL GAAEATFLVH
AAFAVVFDVG RTRRFDADPH FQAEVFAMVC AVLFDS