Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_1519 |
Symbol | |
ID | 9155669 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 1589791 |
End bp | 1590858 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | transcriptional regulator, AraC family |
Protein accession | YP_003646482 |
Protein GI | 296139239 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0248055 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTACGCC TGGGACAACA GGGCCCGCTG CCGCCCGGTG CACGGGACCA CACGGCAGCC AGGCCCGCGC CCGTGGTCGC CTACCTCATC GAACGTCTAG GGCGCGAGGG ACATCCGGTA CGGGCATGGG CCGACCGCGC GGGAATCGCG CGGATCGATC GGCTGTCCGG GCTGCGCCTG ACCTTCCCCC AGACGGTGGC GTTCATCACC GAGGCGGTCC GTGCCGCGCC CGATCGTCCG CTCGGACTCC AGGTGGGTGC CCGGCCGTTA CTCCAGTCTT TCGGGATGGT CGGCGTCGCG GTGCAGACCG CCGACGGTCT CGCCGCGGCA GTCGGGATCG GGCTGCGCCT GCACGAGGAG GCCGGCAGCC TGGTCGACTT CACCGTCGCC GAGGACGCCC GGACGGTCAG CGTCGGCGTG CTGCCCCGGT CCGATGTCGC CGAGATCCTG CCCTTCCTCT GCGAGGAGAC GCTCCTCAGT TCCCTCACCC TGGTTCGATC TGCGCTGGCG GACGAAGATC TCGCGCCCAT CGGCGTCGAG CTGGCGTACC CCGCGCCGTC GTACGCCGAT GTCTATGACG ATGTGTTCGG GTGCCCCGTG CGGTTCGACG CGCCCCGCAC CGCAGTGACG ATTCCCGGGC ACCTGCTCGA CCTCCCGCTG CCGGGCCGCC AGCCCGCCGT GCACGCGGCG GCGGTGGCCG CGTGCCGCGG GCTCATCGGA ACGGATGACG CCGATGAGCT CGACCACGTC TGGGCGGTGG AACAGTTGCT TCGCGCCGAT CTGGCGCGGG CCGCAACCAT CGCGACAGTC GCCCGCCACT TGCGGACCAC GGAGCGCACG CTGCGCCGGC GTCTGTCCGA CGGCGGTGAG TCGTTCCGGT CGATCCACAA CCGGGTCCGG CGTGAGCGTG CCGAATCCCT GCTGCGCAGC ACCTCGATGC CGATCGGCGA GGTCGCAGCT GCGGTCGGTT TCGCCGATGC CCGGGACTTC CGGCGGGCGT TCCGGGCCTG GACCGGCCGG ACCCCGGCCG ACCTGCGCGG GAGCGGAGAC GGCTCGGTGA GCGGATGA
|
Protein sequence | MLRLGQQGPL PPGARDHTAA RPAPVVAYLI ERLGREGHPV RAWADRAGIA RIDRLSGLRL TFPQTVAFIT EAVRAAPDRP LGLQVGARPL LQSFGMVGVA VQTADGLAAA VGIGLRLHEE AGSLVDFTVA EDARTVSVGV LPRSDVAEIL PFLCEETLLS SLTLVRSALA DEDLAPIGVE LAYPAPSYAD VYDDVFGCPV RFDAPRTAVT IPGHLLDLPL PGRQPAVHAA AVAACRGLIG TDDADELDHV WAVEQLLRAD LARAATIATV ARHLRTTERT LRRRLSDGGE SFRSIHNRVR RERAESLLRS TSMPIGEVAA AVGFADARDF RRAFRAWTGR TPADLRGSGD GSVSG
|
| |