Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3990 |
Symbol | |
ID | 9158172 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 4119668 |
End bp | 4121074 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | transcriptional regulator, GntR family with aminotransferase domain |
Protein accession | YP_003648900 |
Protein GI | 296141657 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAACG ATAGCAGCGA CGGCCGCTTG GCGACGACGC TTCGCACCTG GATCGAGTCG GCACCGCCCG GTGCACAGCT CCCGTCCACC CGAAGCCTGG TGACGGCACA CCAGGTCAGT CCCGTCACGG TGCAGCGCGC CCTGCGTACA CTCAGCGCGC AAGGACTGAT CGAATCTCGA CCCGGTGTAG GAACTTTCGT GCGCAGAGAG CGAGCTACCC GGCCTGTCGA TTTCGGTTGG CAGACCGCGG CACTCGGCGT ACAGGGGCGG CGCACACCTG GTACGCCCAC CGCGCTCCGG CCGAACGACG GCGACGCCAT CGCCCTTCAG TCGGGATACC CCGCTCGCGA ACTGCTACCC GAACGACTCG TGCGTACTGC CTTCGCCCGA GCGGCGCGCA ATGAGGCCTT GCTCGATCGC ACGCCCAGCG CGGGGATCGA CGGATTGCGC TCCTGGTTCG CCACCGAATT GCAGTCGCTC ACACCCGCCC GAGCGGCGCC CGCGGTGGCC CGTGACGTGG TGGTGACTCC CGGCAGCCAG GGCGGCCTCC TCACAGCCTT CCGCGCCCTC GTCGGCGCGG GCCGGCCGCT CCTGATGGAG TCGCCCACCT ATTGGGGCGC CATCTTGGCG GCCGAGAGCG CCGCTGTCGA CGTGATTCCG GTTCCCAGCG GACCCGACGG CCCTGACCCC GACGTGGTCG AGCGCATCCT CGCCGAGACC GGGGCTCGCG CCTTCTACGC GCAGCCCGCC TTCGCCAGCC CGACCGGCGC GGTGTGGAGT ACCGAACGGT CGGAACGCAT CCTGGACGCG GTACGCGCCG CCGGGGCATT CCTCATCGAG GACGACTACG CCCGGGATTT CGGGATCGAC GCGGAGCCGA CGCCGCTGGC CGCCCGCGAT GATTCCGGTC ATGTCGTGTA CATCCGGTCG CTGAGCAAAT CGGTAGCGCC GGCGGTCAGG GTGGCGGCGA TCATCGCGCG AGGCCCTGCC CGCGACCGGA TCCTGGCCGA TATCCAGGCC GACGCGATGT ACACGAGCGG CGTTCTGCAA ACCGTGGCAC TCGATGTGAT CACGCAGCCT GCGTGGCGCA CTCATGTCCG GCGGCTAGGG CGACAACTCG GCGAGCGCCG AGACCTCTTG CTGCGCTCCC TCCGCGACCA CGCACCGTCG GTCACGGTGA ACCGGGTGCC ACGCGGCGGG CTCGCCCTCT GGGCACGACT GCCCGACGAC GTGGACCTGG CTGCTCTCGT GGACGAGTGC CGGCGGAACG GGCTCATCGT GGGTTCCGGC GACGAATGGT TCCCCGCCGA ACCCACGGGG AATCATCTGC GGCTGGCGTA CGCCGGACCT GAGCCCTCCC GGTTCGAGGA GGCGGCGAAG ATCCTGGGGG CCGCCCTCAC CCGGTGA
|
Protein sequence | MSNDSSDGRL ATTLRTWIES APPGAQLPST RSLVTAHQVS PVTVQRALRT LSAQGLIESR PGVGTFVRRE RATRPVDFGW QTAALGVQGR RTPGTPTALR PNDGDAIALQ SGYPARELLP ERLVRTAFAR AARNEALLDR TPSAGIDGLR SWFATELQSL TPARAAPAVA RDVVVTPGSQ GGLLTAFRAL VGAGRPLLME SPTYWGAILA AESAAVDVIP VPSGPDGPDP DVVERILAET GARAFYAQPA FASPTGAVWS TERSERILDA VRAAGAFLIE DDYARDFGID AEPTPLAARD DSGHVVYIRS LSKSVAPAVR VAAIIARGPA RDRILADIQA DAMYTSGVLQ TVALDVITQP AWRTHVRRLG RQLGERRDLL LRSLRDHAPS VTVNRVPRGG LALWARLPDD VDLAALVDEC RRNGLIVGSG DEWFPAEPTG NHLRLAYAGP EPSRFEEAAK ILGAALTR
|
| |