Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_4103 |
Symbol | |
ID | 9158291 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 4227826 |
End bp | 4229463 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_003649011 |
Protein GI | 296141768 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAACGA GCCTGGCCTC CGATCTGCTT CCTCAGGGTC ACGCACTCTT GCGCATGATC CAGCGACGGG TCGTGGACCC CGCTCGCCCG GACGTCGCTC TGCGTGCCCT CGGCTACAAC CGGCAGTACG GCCCGCAGGC CGCGCTGGTG ATCAAGGGGG CCGCGGAGAA CCCGGACCGG GCCGCGATCG TCGACGAGCA CGGCACCCTC ACCTACGCCC AGTACGAGGC GCAGTCGAAT GCGCTGGCCC GCGGCCTGCG GTCGACCGGC CTCAAGGCGG GCGATGTGAT CGCGGTGCTG GCGCGCGATC ACCGCGGGCT CATGCTGATC ATCAGCGCCG CGGCTCGGGC CGGGCTACGG CTGGCCATGA TGAACACCGG CTTCGCCAAG CCCCAGTTCG CCGAGGTGTG TGCGCGCGAG AAGGTGCAGG CGGTGTTCCA CGACAGCGAG TTCACCTCGC TGCTGGACGC GCTGCCCGAC GATATGCCCC GTTACCTCAC CTGGGTCGAC GACACCGACA CGATCCCCGA GGGCGCGCAG ACCATCGACC AGCTCGCCGC GGGCCGCGAG ACCAAGCGGG TGCCCCCGCC GGCGCAGCAG GGCGGCTTCA TCATCCTCAC CTCCGGCACC ACCGGCCTGC CGAAGGGCGC CACCCGGAGC AAGGTGCCCT CGCTCGCGAC CGCGATGCTG GTCGATCGCA TTCCGTTCCA GCGCCGCGGC ACCGTGGTGA TCGCCTCGCC GATCTTCCAC TCCACCGGCT TCGCGATGTG GTCGGCGGGA ATGTCGGTGG GCTGCACCAC CGTCACCATG CGCCGTTTCG ATCCCGAGAA CACGCTCAAG CTGATCGCCG ACAACAAGGC CGACATGCTG GTCGCGGTAC CCACGATGCT GACCCGCATG CTCTCGCTCC CCGCCGAGAC CCTGGCGAAA TACGACACCA GCTCGCTGAA GTCGGTAGTG GTCGCGGGTT CGGCTGTCTC ACCGGAGCTT TCGGAGCGAT TCCAGGACAC GTTCGGTGAC GTGCTCTACA ACGTCTACGG TTCCACCGAG GTCGCCGTGG CCACCGTGGC GACGCCGCAG AACCTGCGGA CCGCGCCGGG CACCGTCGGT AAGCCGCCGG TCCTGACCAC GGTGCGGCTG TACGACGAGA ACGATCGCCT GGTCGAGGGA GTCGGTGTGC GCGGCCGCGT GTTCGTCCGC GCCGGTGCGC CCTTCGAGGG CTACAGCGAC GGCCGCACCA AGCAGATCAT CGACGGCCAT CTCTCGTCGG GCGATATGGG CCACTGGGAC GGAAACGGCC TGCTGCACAT CGACGGTCGT GATGACGACA TGATCGTCTC CGGCGGCGAG AACGTGTATC CACTCGAGGT GGAGAACCTT CTGGTGACCC GCGACGACGT CGTCGAGGCT GCGGTGATCG GTGTGCCCGA TGAGGAGTTC GGTCAGCGGC TGCGCGCCTT CGTGGTGCTG TCCGACGGTG CACCCGAGGG CGATGGCGAG GAGCTGACCA AAGACCTCAA GGACTTCGTC CGCGGGAATC TGGCGCGGTT CAAGGTGCCG CGCGACGTCG TCTTCCTCGA CACGCTCCCC CGCAACCCCA CCGGCAAGAT CGTGCGCCGG GAACTCCCCA AGGACTGA
|
Protein sequence | MGTSLASDLL PQGHALLRMI QRRVVDPARP DVALRALGYN RQYGPQAALV IKGAAENPDR AAIVDEHGTL TYAQYEAQSN ALARGLRSTG LKAGDVIAVL ARDHRGLMLI ISAAARAGLR LAMMNTGFAK PQFAEVCARE KVQAVFHDSE FTSLLDALPD DMPRYLTWVD DTDTIPEGAQ TIDQLAAGRE TKRVPPPAQQ GGFIILTSGT TGLPKGATRS KVPSLATAML VDRIPFQRRG TVVIASPIFH STGFAMWSAG MSVGCTTVTM RRFDPENTLK LIADNKADML VAVPTMLTRM LSLPAETLAK YDTSSLKSVV VAGSAVSPEL SERFQDTFGD VLYNVYGSTE VAVATVATPQ NLRTAPGTVG KPPVLTTVRL YDENDRLVEG VGVRGRVFVR AGAPFEGYSD GRTKQIIDGH LSSGDMGHWD GNGLLHIDGR DDDMIVSGGE NVYPLEVENL LVTRDDVVEA AVIGVPDEEF GQRLRAFVVL SDGAPEGDGE ELTKDLKDFV RGNLARFKVP RDVVFLDTLP RNPTGKIVRR ELPKD
|
| |