Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_1267 |
Symbol | |
ID | 9155411 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 1303820 |
End bp | 1306765 |
Gene Length | 2946 bp |
Protein Length | 981 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_003646237 |
Protein GI | 296138994 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGGACA TCCTGGGAGG ACTGCCGGGC GCGAGGGCAC TGGAGGCCTC GGTGAACCGG CTCGCGGCTA CCGCGCGCAA CGGCATGGAG GTGCTGCGGC AGGGCGGCCT GGAGACCGGA GCCAAGCCCT CGCCGTTCAC CGTCGTCGAG CGCAGCCCGA TGTACCGGCT GCGGCGCTAC TACGCCGAGA CCGCGCCGGA CCCCGCCGAC GACGATCGTC CGGCGATCCT GCTCGTCCCG CCGATGATGG TGGACGCCAA CGTCTTCGAC GTCACCGCAG ATAAGGGCGC GGTCTCCGTG CTGCACCGAG CCGGTCTCGA CCCGTGGGTG ATCGACTTCG GTTCGCCCGA CCGGGAGGAG GGCGGTCTCG AGCGCAACCT CGCCGACCAT GTGGTCGCGA TCTCGCGCGC CATCGACCAG ATCGTGGCGC TGCGCGCCCG CGATGTGCAC CTCGGCGGGT ACTCGCAGGG CGGGATGTTC TGCTATCAGG TTGCCGCCTA CCGGCAGTCC CGGAGTCTCG CCAGCCTCAT CACCTTCGGC AGCCCCGTCG ACATCTCGGC CGGACTCCCG CTCGGCGCGC CGCCCGCTCT GGTGAACAAG GGCGCCGAGT TCGTCGCCGA CCACGTCTTC AACCGCTTCT TCCTCCCCGG CTGGATGGCG CAGCGCGGTT TCGAACTGCT CAATCCCGTC AAGGCGGTCC GCTCACGGGT TGATTTCGTG CGCCAATTGC ACGACCGGTC CGCACTGCTG CCCCGGGAGG ACCAGCGCCG GTTCCTCGAG TCCGACGGTT GGGTGGCCTA CTCCGGTCCC GCCGTCGCCG ACCTGCTCAA GCAGTTCGTG GTGCACAACC GAATGCTCAC CGGTGGCTTC TCCATCGACG GCGAGGCCGT TTCGCTGGCC TCGATCACCT GTCCCGTGCT CGCCTTCGTC GGACTGTCCG ACCAGATCGG ACGGCCCAGC GCGGTGCGCG GGATCCTGCA GGCGGCACCG TCGGCGCCGG TCTACGAGGC GCAGGCCTCC GCCGGACACT TCGGGCTCGT GGTGGGCAGT ACTGCCGGTA GCGTCACCTA TCCGACGGTG GCCGAATGGG TGAAGTGGCG CGAGGGCCGC GGCACCGAGC CGGCGAATAT CGACCGGATG CAGCCCCATG ACGGCCTCGA CGCGCCGGTG AATCCGCTCG TCGCCGGTGT CGCAGGCCTC GCGACCGTCG GTTTCGTCGC CGCCCGCGAC GTGATCGACG CTGCGGCCGG CGCCGCTGCG GGCGCGAGCG CGGTGGCCCG CGAAGTGACC CGCGGCCTGC CCAAACTGGC CCGCTTGGAC CAGCTTCAGG CGCATACCCG CGTTTCGCTG GGCAAGCTGC TCGCCGAGCA GGCCGCGAAG GACCCGTACG GCGAGCTGTT CCTCTACGAG GACCGGGTGC ACACAAAGCA GGCGGTGAAC GAGCGCATCG ACCGCGTGGT GAGCGGACTG CTGCAAGTGG GCGTCCGACA GGGCGAGCAC GTGGGCGTGC TCATGCACAC CCGTCCCTCG GCGCTGGTCA CCATCGCTGC GCTGTCCCGG CTCGGTGCGG TGTCCGTGCT GCTGTCGCCG GGCACCGACT ACGCGGCCTC GCTGAAACTG GGCGAGGCCA CCTCCGTCAT CACCGACCCG GAGCACGCCG AGGAGGCGAG TGCGGTCGCC GAGCGCGTCT TCGTCGTCGG CGGCGGTGAC CGTCGCGACC GGCCGGGTGC ATCGCGCCAG GACCTGGCCG CCGAACTGGT CGACCTCGAG CAGGTCGACC CGGACAACGT CCGGATCCCG CGCTGGTACC GCCCCGATGC GGGGCTCGGC CGCGACCTGG CCTTCGTCAT GTTCTCCGCA GTCGGCGGTA CGCTGCGCGC GAAGCGGGTC ACCAACGGCC GGTGGGCGCT GTCGGCCTTC GGTACGGCCT CGGCCACCCG GCTCTCGGAG AGCGATACCG TCTACTGCCT CACCCCGCTC AGCCACAGCT CCGGCCTGAT GACCAGCCTC GGCGGCGCGC TCGCCGGTGG TTCCCGGATC GCGCTCACCC GCGATTTCGA TCCGGCGCGG TTCATGATCG AGGTGCAGCG CTACGGCGTC ACGGTGGTCT GTTACACCTG GAACCTGATG CGGGCCGTGC TCGACGAGCC CGACCTGCAG ATCCCGCGCT ACCACCCGAT CCGCGCGTTC ATCGGCTCCG GCATGTCGTC CGAACTCTCC CGCCGCGTGA GCGAAGCCTT CAGCGCCCGC GTCGTCGAGT TCTACGCCTC CACCGAGGGA GAGATCGTGC TCGCCAAGGT GGGCGGCGGC AAGCCGGGCG CCAAGGGGCG CCGGCTGCCC GGCAGCGCCG AGGTGACACT GGTGGACTTC TACATCGACT CCGGACGGTT CGTCGAGACC GACGACGGTT ACCTCCAAGA GGTCGAGCGG GGTCAGGTGG GCGTCCTGAT CGGCCGTGCC GACCCGGATA CCACCCAGCG CGATGTGCTG CTGCGGGGTG CGTTCCGGCC GGGCGACGCC TGGTTCTCCA CGGGACATCT GTTCCGGCAG GACGACGACG GTGACTACTG GCTCGTCGAC GATGTCCGCA CCGTCGCCCT GACCGAGCGG GGCCCGGTGT ACTCGATCCC GATCGCCGAC GTCCTCGAGC AACTCGGCCA GGTGGACCAG GCGGTCGTGT ACCGAGTACC GGGGGAGACC GAGGCGGCAC CGCCCCGGGT GGTCGCCGCG GTCACGCTGC GCCCCGGCGG AGCGCTCACC GCTCATGAGG TGACCGACGC CTTCGCCGGG CGGCACGAGC AGTATCCCGA TGCGGTCCAG GTGGTCGACG AGGTGCCGCT CACCAGTTGG TACCGGCCCC GTGGCGGCGA ACTCGCGGCT CGGGGCATGC CTGAGCCGGG TCCCGCGTCC TGGCGTTACG ACGCCGTCGC GGCTCGGTAC GTGTAG
|
Protein sequence | MVDILGGLPG ARALEASVNR LAATARNGME VLRQGGLETG AKPSPFTVVE RSPMYRLRRY YAETAPDPAD DDRPAILLVP PMMVDANVFD VTADKGAVSV LHRAGLDPWV IDFGSPDREE GGLERNLADH VVAISRAIDQ IVALRARDVH LGGYSQGGMF CYQVAAYRQS RSLASLITFG SPVDISAGLP LGAPPALVNK GAEFVADHVF NRFFLPGWMA QRGFELLNPV KAVRSRVDFV RQLHDRSALL PREDQRRFLE SDGWVAYSGP AVADLLKQFV VHNRMLTGGF SIDGEAVSLA SITCPVLAFV GLSDQIGRPS AVRGILQAAP SAPVYEAQAS AGHFGLVVGS TAGSVTYPTV AEWVKWREGR GTEPANIDRM QPHDGLDAPV NPLVAGVAGL ATVGFVAARD VIDAAAGAAA GASAVAREVT RGLPKLARLD QLQAHTRVSL GKLLAEQAAK DPYGELFLYE DRVHTKQAVN ERIDRVVSGL LQVGVRQGEH VGVLMHTRPS ALVTIAALSR LGAVSVLLSP GTDYAASLKL GEATSVITDP EHAEEASAVA ERVFVVGGGD RRDRPGASRQ DLAAELVDLE QVDPDNVRIP RWYRPDAGLG RDLAFVMFSA VGGTLRAKRV TNGRWALSAF GTASATRLSE SDTVYCLTPL SHSSGLMTSL GGALAGGSRI ALTRDFDPAR FMIEVQRYGV TVVCYTWNLM RAVLDEPDLQ IPRYHPIRAF IGSGMSSELS RRVSEAFSAR VVEFYASTEG EIVLAKVGGG KPGAKGRRLP GSAEVTLVDF YIDSGRFVET DDGYLQEVER GQVGVLIGRA DPDTTQRDVL LRGAFRPGDA WFSTGHLFRQ DDDGDYWLVD DVRTVALTER GPVYSIPIAD VLEQLGQVDQ AVVYRVPGET EAAPPRVVAA VTLRPGGALT AHEVTDAFAG RHEQYPDAVQ VVDEVPLTSW YRPRGGELAA RGMPEPGPAS WRYDAVAARY V
|
| |