Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3836 |
Symbol | |
ID | 9158016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 3959347 |
End bp | 3960879 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_003648750 |
Protein GI | 296141507 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGGGCA CCATTGCATC GGTCCTGCAC ACCGCCGTCG AGCGGTTCGC GGACCACCCC GCAGTGGTCG ACGGGGATCT GCGCATCAGC TACGCGTCGC TCCGCGATCG GTCGCGGTCA GTGGCGAAGG CATTGCTCGC GGAGGGAGTG CGACCGGGAG ACCGGGTGGC GATCTGTGCG CCGAACGGCC ACGAGTGGAT CGAGGCCGCC CTCGGCCTCG CCACCATCGG TGCGGTACTC GTACCGGTGA ACACCCGCTA CACGGGTCCG GAGATCGTTG ATCTGCTCGA GCGCACCCGC GCCCGCGCCT TCGTCGTCGC CGGCACGTTC CTCGGCGTCG ACCGCCTGAA CCTAGTACAC CAGAGCGCCG GGGGACTGCC CGAAAACGTC GCGACGGTGC TGCGCCTGCC CGAGTGGGAC TCGTTGGCCG CGCGCGGATC CGGCATCACC GACACGGAAC TCGACGTTAT CGGCGCCGAG GTCACTCCGC AGTCACCATC GGACATCTTC TTCACTTCCG GTACGTCGGG CCGGAGCAAG GGCGCGGTGA GCACGCATGC GCAGACGCTC GCGAATGCGG CGAACTGGGC CGAACTCGTC GGCGTCACCG ACACCGACCG TTACCTGATC CTGAGCCCGT TCTTCCACAT CTTCGGATAC AAGGCGGGCA TCCTGGCGGC ACTGCAGCGC GGCGCAACGA TGTATCCCGC GCAGACCTTC GATGTGGTTC AGGCCTTCGA TCTCATTCAC CGCGAACGCA TCTCGGTTCT CCCGGGCGTA CCGACGATCC ATCAGATGAT GCTCGACCAT CCGGACCGTG AGAAGTACGA CCTGTCCAGT CTGCGCGCCG CGACCACCGG CGCCGCCACG ATCCCGGTGG TACTCATCGA GCGGATGCGC GACGAGCTGC GGTACGACCG CGTGCTCACC GCGTACGGGC TCTCCGAGGC ACCCGTGGTC ACGATGTGCC GGGCCGACGA CGATCCCGCC GTCATCGCCA CCACATCGGG CCGGGCGGTG CGCGACATGC AGGTGCGGAT CGCCGACGAT GGCGAAATCC TGGTACGCGG TCCCAACGTG ATCGCGGAGT ACTTCGAAGA TCCGGATGCC ACCGCCCGAG CCTTCGACGC CGACGGTTGG TTCCACACCG GCGACGCCGG GAGCATGGAC GAGGCGGGCA ATCTCCGGAT CACCGACCGG ATCAAGGACA TGTTCACCAA CGGCGGCTTC AACGTCTATC CGGCCGAGGT GGAGCAGGTG ATCGCGCGGA TCGAGGGGGT TGCCGAGAGC GCCGTCGTCG GCGTCCCCGA GCCACGGCTC GGCGAGGTGG GCAAAGCCTT CGTGGTACTC ACCGGGAGCC GCGAGCTCAC CGAGGATGCG GTGATCGCGC ACTGCCGCGA GTCGCTCGCC AACTTCAAAG TCCCTCGCAG CGTGGAATTC GTCAGCGAAC TTCCCCGCAA CGCAACCGGA AAGGTGCTCA AACGCGTTCT CCGCGGCGAG CCCGATCCCG CACCAGGAGA GAAGAATCGA TGA
|
Protein sequence | MSGTIASVLH TAVERFADHP AVVDGDLRIS YASLRDRSRS VAKALLAEGV RPGDRVAICA PNGHEWIEAA LGLATIGAVL VPVNTRYTGP EIVDLLERTR ARAFVVAGTF LGVDRLNLVH QSAGGLPENV ATVLRLPEWD SLAARGSGIT DTELDVIGAE VTPQSPSDIF FTSGTSGRSK GAVSTHAQTL ANAANWAELV GVTDTDRYLI LSPFFHIFGY KAGILAALQR GATMYPAQTF DVVQAFDLIH RERISVLPGV PTIHQMMLDH PDREKYDLSS LRAATTGAAT IPVVLIERMR DELRYDRVLT AYGLSEAPVV TMCRADDDPA VIATTSGRAV RDMQVRIADD GEILVRGPNV IAEYFEDPDA TARAFDADGW FHTGDAGSMD EAGNLRITDR IKDMFTNGGF NVYPAEVEQV IARIEGVAES AVVGVPEPRL GEVGKAFVVL TGSRELTEDA VIAHCRESLA NFKVPRSVEF VSELPRNATG KVLKRVLRGE PDPAPGEKNR
|
| |