Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2064 |
Symbol | |
ID | 9156219 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 2149777 |
End bp | 2153139 |
Gene Length | 3363 bp |
Protein Length | 1120 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003647015 |
Protein GI | 296139772 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.203107 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACGCGAC GCCACCTCGG CCAGTACCGC CTCACCCGCC TGCAGGTGGT GAACTGGGGC ACCTTCGACG GCTACAAGGA CTTCCCCATC GACGAGCGCG GGGTGATCCT CACGGGCCCC TCGGGTTCGG GCAAGAGTTC GCTGATGGAC GCGCACTCGG TGGTCCTGCT CCCCACCTAC GACCAGTCCT TCAACGCCTC GGCGGATATG ACGGCGAAGG GCGCCAAACG TGCCGCCCGA TCGATGGCCG ACTACGTGCG CGGCGCCTGG TCGGTCAACG ATGACGAGCA CCAGCAGTCC AAGGTGCAGT ATCTGCGCGG AGATTCGGCC ACCTGGTCAG CGGTCGCCGC GACCTATGAC AACGGCGCGG GCACCGTGAC CACCGCGGTG GCGGTGAAGT GGTTCGCCGG TTCCGGCACC GACGGTTCCT CACTGAAGTC CTGGTACCAC CTGCACTCGG GCCCCTTCGA TCTGCTCGAC TTGCAGGACT GGGCGACTGC GGGCTTCGAC ACCCGCGGCT GGCGATCACG CCATGCCGAC GTCGAATCCT TCGACACCCA GGCTGCCTAC CAGGAGGCGC TGCGCCGGCG CGTGGGCATC GGTACCTCGT CGGCCGCGCT CGCCCTGCTG GGCAAGGCCA AGGCGATGAA GAACGTCGGC GATCTCGACG CCTTCATCCG CACCTACATG CTGGACCGGC CGCAGACCTA CGCCCGCGCC GAGCGGCTGG TGGAGAACTT CACCCAGCTC GACGAGGCCT ACCGTGCCGC CGCCCGCGCC GAGGCGCAGG AGAAGGTGCT GCGCCCCATG CCCGAGGCGT ACGAGAAGTA CCGCACCGCC ACCGTCGGGA CGACGCGCGC GGCCGATCTC CTGGGCGCCC CGGTGCGCTC GTATCTGCGT GGTCATCGGC TCCGGTTGCT CCAGGAGGAG ATGGACCGGA TCGAAGGCGA GCGCGCGGGT ATGGACGCGC AGATCGCCCG CTGGGAGGAC AAGGCCGAAG AGCGCAAACA GCGCTACAAC GATCTGCTGC ACCGGCTGCG CCAGGAGCAG GGCGAAGTGG GTGTCCTGGA GGCCGAACTC CAGTCCCGGG AATTGCAGAC CCAGGCCCGC CAGCGCGCCT ATGAGCGGTT CGCCACCGCA GTCGAGAAGC TCGGAGAGCG GGCGCCGGAG TCCGCGGAGG AATTCGCGGC CCTGCGCGAA CAGGTCCCGG GCATCCGTGA TCGCGCCACC GAGGCCGCCG AGGCGCTCGC CGGTCGCGTC CACAGCGCCT ACACCGACGA ATCCGACGCC CGGCGCGCCG TCGAATCCGT CGACGACGAG CTGGCCGTGC TGGATCGGCG CGCATCGCTG CTGCCACCCG CGCTACTCGA CCAGCGCGAC GTGATCGCCC GCTCCACCGG TGTCCCCGCC GAGGAGCTCC CGTACGCGGC GGAGTTGATC GATGTGAAGC CCGCGGAGCA GGCGACGTGG GCCGCCGCCG CCGAGCGCGT GCTGCGCCCG CTCGCCACCA CGCTGCTGGT GCCCGCGCGC CACCAGCGCG CGGTGGCCGA TCACGTGAAC GCCCATCGTG TGCAGGGCGT CCTCACCTAC CAGGTCGTCG AGGGGCCCGA CCGCACTGCC GCGGTCGAGG GTTCGCTGGC CACCAAGCTC ATCGTCGACG AGCGCACCGA CGCCGGACAG TGGCTCGCGG GCGCGGTGGC GCGGGCGGCC CGCCACATCT GCGTCGACTC CCCCGCTGAG CTCGAGGACC ACGACCAGGC GGTTACGCCG GAGGGCCTGG TCAAGGGAGC GGGCGGGCGG TTCCGCAAGG ACGACCGCCG CGCCGTCGCC GACCGGTCGC AGTGGGTGCT GGGAACAAAT ACCGGGTCCA AGCGCGAGGC GCTGCAGCGC CGCCGCGACG AATTGGCGTC GGTGCACGTC AAAGCCGCCG AGGCGTCGGC GACGCTGCGT GATGACCTCG CCGAGTACAA GGCCGTGGCC GACGGGGCCA CGGGGGTGCT CGGCTACGCG TCCTGGTCGG AGCTCGATCA CTGGGCGGCG AAAGCCGAGG CCGAGGACCT CGCCGACCGG ATCTCCGACG CCCGGTCGGG CAATGCCGAC CTGAGCGAGC TGGAGCTGCA GGCCGACAAC GCCTACGGCG AACTCGCCGA GGCGCAGACC CGGATCGGCG CGCTCACCGA GACGATGAAC CGTGCCGGGA GCGAATTCGA TGATCTGCTG GCCGAATTCG AGAAGCTCGA CGGTGAGAGC GCCCCGAAAC TGACGGACGA GGACCGGGAG TTCCTGGACA CCGCGCTGTG GCATCTGCTC ACGGCCGACG AAACCCGGCG GCGCCGCCTG ACACTGGACA CCTACGGCGA GATCCGCGCG GATCTGCGAG CCGAGTTGGA ACGCCAACAG GCCGCCGCGA CCGCCGAGCG CGATGCGGCG GAGACCCGCA TCGTGCGCAC CGCCGAACAA TTCCTGCGCG AGTGGCCGGA TGCCTCCACC GAGCTCCGGG CCGCTGTGGC GTCGGCACCG GACTTCGTCG CGGTGCACGA GAACCTGATC GCACACGGGC TGGCCGCCGC GACGGAGAAG TTCCGGCGGC TCATCACCAC CGACGTCAGC CACTCGGTCT CGAACCTTTT CAAAGAAGTC GATGACACGC ACCGGGCGAT CACCCGCGGT ATCGCGGACG TCAACGCCGG CCTGCGCCGG GTGGAGTTCA ATGAGGGCAC CTACCTGCAG ATCGCGTACG CCGCACGGCC GACGCCGGAG GCCACGGAGT TCGCGACTCT GGTCGACGAG ATGGTGCGCG ACGCGCCGGC CGCGAAACGG GCCGAACCCG AGGCGATGGC CGCACAGTTC AAGCGGATCC GCGGCCTGGT GTTGCGCCTG ACCGGCGACG ACCCCGAGTC GCGCAGGTGG ACCGAGAACG TATTGGATGT GCGCACCGGC TATTCGTTCT ACGGCCGGGA GAATGCGGTC GGCGCCGAAC CCGACGCGCC GGCGGTGGTG ACCTACCGCA ATACCGCCAC CAATTCCGGT GGTGAACAGG AGAAATTGGT GGCCTTCTGC CTGGCCGCGG CGTTGAGCTT CGCACTCGGC AGCCATGGCA TCGACGGATC GAACGAGCCG GCGTTCGCGC CGCTCATGCT CGACGAGGCC TTCAGCAAAT CCGATGAACG GTTCTCGGCG CAGTCCCTGC GCGCGTTCGA GCAGTTCGGT TTCCAGTTGA TCATCGCCGC CCCGATCCGC ATGGTGGGCA TCGTCGAACC CTTCATCGGG CAGGTCATCC TGGTCGACAA GCACACCGGG GCGGACGGTG CGCGCTCCGA CGCCCGGTAC GCCACCTTCG GTGAGCTGAC GACGGCGCGC TAG
|
Protein sequence | MTRRHLGQYR LTRLQVVNWG TFDGYKDFPI DERGVILTGP SGSGKSSLMD AHSVVLLPTY DQSFNASADM TAKGAKRAAR SMADYVRGAW SVNDDEHQQS KVQYLRGDSA TWSAVAATYD NGAGTVTTAV AVKWFAGSGT DGSSLKSWYH LHSGPFDLLD LQDWATAGFD TRGWRSRHAD VESFDTQAAY QEALRRRVGI GTSSAALALL GKAKAMKNVG DLDAFIRTYM LDRPQTYARA ERLVENFTQL DEAYRAAARA EAQEKVLRPM PEAYEKYRTA TVGTTRAADL LGAPVRSYLR GHRLRLLQEE MDRIEGERAG MDAQIARWED KAEERKQRYN DLLHRLRQEQ GEVGVLEAEL QSRELQTQAR QRAYERFATA VEKLGERAPE SAEEFAALRE QVPGIRDRAT EAAEALAGRV HSAYTDESDA RRAVESVDDE LAVLDRRASL LPPALLDQRD VIARSTGVPA EELPYAAELI DVKPAEQATW AAAAERVLRP LATTLLVPAR HQRAVADHVN AHRVQGVLTY QVVEGPDRTA AVEGSLATKL IVDERTDAGQ WLAGAVARAA RHICVDSPAE LEDHDQAVTP EGLVKGAGGR FRKDDRRAVA DRSQWVLGTN TGSKREALQR RRDELASVHV KAAEASATLR DDLAEYKAVA DGATGVLGYA SWSELDHWAA KAEAEDLADR ISDARSGNAD LSELELQADN AYGELAEAQT RIGALTETMN RAGSEFDDLL AEFEKLDGES APKLTDEDRE FLDTALWHLL TADETRRRRL TLDTYGEIRA DLRAELERQQ AAATAERDAA ETRIVRTAEQ FLREWPDAST ELRAAVASAP DFVAVHENLI AHGLAAATEK FRRLITTDVS HSVSNLFKEV DDTHRAITRG IADVNAGLRR VEFNEGTYLQ IAYAARPTPE ATEFATLVDE MVRDAPAAKR AEPEAMAAQF KRIRGLVLRL TGDDPESRRW TENVLDVRTG YSFYGRENAV GAEPDAPAVV TYRNTATNSG GEQEKLVAFC LAAALSFALG SHGIDGSNEP AFAPLMLDEA FSKSDERFSA QSLRAFEQFG FQLIIAAPIR MVGIVEPFIG QVILVDKHTG ADGARSDARY ATFGELTTAR
|
| |