Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_0465 |
Symbol | |
ID | 9154601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 490277 |
End bp | 493672 |
Gene Length | 3396 bp |
Protein Length | 1131 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | protein of unknown function DUF224 cysteine-rich region domain protein |
Protein accession | YP_003645445 |
Protein GI | 296138202 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGACACCG ACACATTGGC CGCCGGTGCG GCTGAGACCA ACGCGTTGAC CCTGACTCTG GGCACCATCG GCGCGATCGC TTTCGCTCTC GGGTGGGTGG TGTTCTTCCG CGGGGCACTG CGCATGGTGC GGATCATCGC CAAGGGGCAG CCCGCGCCCG ACCGCTGGAC GCCGTTCATC CCGCGCGCCG CGGAGGTGGT GGTCGAGTTC CTCGCCCACA CCAAGATGAT CAAGAACCGC ACCGTGGGCA TCGCCCACTG GCTGGTGATG ATCGGCTTCC TCGCGGGCGC CATCCTGTGG TTCGAGGCCC TCCCGCAGAC CTTCGACCCG TCGTTCCACT GGCCCGTCAT CGGCTCGTGG CCGGTGTACC GCCTGATGGA CGAGCTGCTC GGTATCGGCA CCGTCGTCGG CATCGTCGTG CTGATCGTCA TCCGCCAGAT CAAGCACCCC CGCAACCCCG AGCTGTTCTC CCGGTTCGCC GGCAGCAAGT TCCTGCCCGC GTACACGATC GAGGCCATCG TCTTCGTCGA GGGCTTGGGC ATGATCCTGG TGAAGTCGAG CAAGATCGCC ACCTACGGCG ATGCCGACCC GATCACCGAC TTCTTCACCC GCCAGGTGGC CACGCTGCTG CCCGCCAGCC CGAACCTGAT CACAGTCTTC GCCTTCATCA AGCTGATGTC CGGCGTGATG TTCCTGCTTC TGGTGGGGAG CAACCTCACC TGGGGCGTCG CATGGCACCG GTTCTCCGCG TTCTTCAACA TCTACTTCAA GCGCGAGCTG GGCAGCCGCC GCGCGCTCGG TGCCGCGAAG CCGATGATGT CGGACGGCAA GGTGCTCGAG ATGGAGACCG CCGACCCCGA CACCGATGCC TTCGGCGCCG GCAAGATCGA GGACTTCTCC TGGAAGGGCT GGCTCGACTT CACGACCTGT ACCGAGTGCG GTCGCTGCCA GAGTCAGTGC CCCGCCTGGA ACACCGGAAA GCCGCTGAGC CCGAAGCTGC TCATCATGGG TCTGCGCGAC CACGGCTTCG CCAAGGCTCC CTACCTCCTG GCCGGTGGCA GCACCGATAT GGGAGGCGAC GAGACCGGCC TCGTGGACGC CGACGGCAAC CCGGACGAGA CCAAGCTGGG CAAGATCCCC GCGGAGGCGC GCGCCGAGGC GGCCCGTCCG CTGGTCGGAC CGGCCGGTGA GGGCGACGAC CTCGGCGGCG TCATCGACCC CGAGGTGCTG TGGAGCTGCA CCACCTGCGG CGCCTGCGTC GAGCAGTGCC CCGTGGACAT CGAGCACGTG GACCACATCG TCGATATGCG CCGCTACCAG GTGCTCATCG AATCGGAGTT CCCGCACGAG CTGGCCGGCC TGTTCAAGAA CCTCGAGAAC AAGGGCAACC CCTGGGGCCA GAACTCCAGC CAGCGCACCG CGTGGATCGA CGAGATGGAC ATCGACATCC CGGTCTACGG CGAGCACACC GATTCCTTCG ACGGATACGA ATACCTGTTC TGGGTCGGCT GCGCCGGTGC CTACGAAGAC CGCGCCAAGA AGACCACCAA GGCCGTGGCC GAGCTGCTCG ACGTGGCCGC GGTGAACTTC CTGGTCCTGG GCCAGGGCGA GACCTGCACC GGCGACTCGG CCCGCCGCGC GGGCAATGAG TTCCTGTTCC AGATGCTGGC GCAGCAGAAC ATCGAGCAGC TCAACGAGGT GTTCGACGGC CTGCCGGTGA ACCAGCGCAA GATCGTGGTC ACCTGCGCGC ACTGCTTCAA CGCCCTCGGA AACGAGTACC CACAGGTGGG CGGCCAGTAC GAAGTGGTCC ACCACACCCA GTTGTTGAAC AAGCTGGTCC GCGAGAAGCG CCTCGTTCCG GTGGCGCCCC CGAGCGAGGA CGTGACCTAT CACGACCCGT GCTACCTGGG CCGCCACAAC CAGGTCTACG AGGCTCCGCG CGAGCTGATC GGCGCCTCCG GCGCGACGCT GAAGGAGATG CCGCGGCACC TCGAACGGTC CATGTGCTGT GGCGCCGGCG GCGCGCGCAT GTGGATGGAG GAGCAGCTCG GCAAGCGCAT CAACATCGAC CGGGTGGACG AGGCGCTCGA GACACTGTCC TCGACGCCCG AGGCGTCGCC GAAGAAGATC GCGACCGGCT GCCCCTTCTG CCGCGTGATG CTCTCCGACG GTGTGACCGC GCAGACCGCG GATTCGGACG CTCCGGCGCC CGAGGTGGTC GATGTGGCGC AGATGCTGCT CGAGTCGGTC AAGCGCGGTC TGCCGGACGG ACTGGTGCGG GGGAACCCGA ACCGCAAGAC CGACGGTGCC GCCCCCGCGG AGGTGCCCAC CGAGGCACCG GCCGAGGTCG AGGAGGCTCC GGCGGCGACC GCGGCGGCAA CCACGGCCGA GCCTGCGGAG AAGAAGGACG CCAAGCCGAC GGTGGGCCTG GGCCTGGCCG GGGGCGGCAA GCGCCCCGGA GCTCCGAAGA AGCCGGGCGG CGCCCCCAAG CCGGGCGGCG CGGCTCCCGC GGCGGAGGCG GCCGAACCCG CAGCGCCCGC CGCACCGGCC AAGCCGGCCG TCGGGCTCGG CCTCGCCGGT GGCGCCAAGC GCCCCGGTGC CCCGAAGCCG GGTGGCGCCA AGCCTGCGGC CGCGGCCCCC GCTGCGGAGA CACCGGCCGA GACGGCCGCT GAGACCCCCG CGGCTCCCGC CGCACCGGCT AAGCCGGCCG TGGGCCTCGG GCTGGCCGGC GGCGCCAAGC GTCCGGGAGC GAAGAAGCCG GGAGCTCCTA AGTCCCCGGC AGCAGCGGCT CCGGCCGAGG CACCCGCAGC CGAGCCCACC GCGGCTCCGT CCGCCGCCCC CGCAGCGGAT GCTCCCGCCG AGGACCGCAC GCTGAGCACC GCCGGTGAGT CCAAGGCCGC GGGCTTCGGC ATCGCCGCGG GCGCCAAGCG TCCGGGACCG AAGAAGCCGG GGGCGCCCAA GCCCGCGGCC GCGGCCCCGG CCGCACCGGA GACCGCAGCT CCTGAGGCAG CCGCACCCGA AGCTCCGGCT GCTCCCGCCG CCGAGGCTCC CGCTGCTCCC GCCGAGGACC GCACGGTGAC CACCGCCGGT GAGTCCAAGG CCGCGGGCTT CGGCATCGCC GCGGGCGCCA AGCGCCCCGG AGTGAAGAAG CTGGGCGGCG AGATCAAGGA GAAGGTCGCC GAGGCACGTA CCGAGGTGCG CGAGCACACC GAGCACAAGA CCGAGCCCGA GGCACCGTCG GCCGCGGAGC CCGAGCAGCC TGAGGCACCG GAAACTCCCG CGGAGGCCCC GAAGGCCTCG TCGGGCGACG ACCGGAGCCT GGCGGAAAAG GGTGAGTCGA AGGCGAAGGG CTTCGGCATC GCCGCGGGCG CCAAGCGCCC CGGCGGACGC AAGTGA
|
Protein sequence | MDTDTLAAGA AETNALTLTL GTIGAIAFAL GWVVFFRGAL RMVRIIAKGQ PAPDRWTPFI PRAAEVVVEF LAHTKMIKNR TVGIAHWLVM IGFLAGAILW FEALPQTFDP SFHWPVIGSW PVYRLMDELL GIGTVVGIVV LIVIRQIKHP RNPELFSRFA GSKFLPAYTI EAIVFVEGLG MILVKSSKIA TYGDADPITD FFTRQVATLL PASPNLITVF AFIKLMSGVM FLLLVGSNLT WGVAWHRFSA FFNIYFKREL GSRRALGAAK PMMSDGKVLE METADPDTDA FGAGKIEDFS WKGWLDFTTC TECGRCQSQC PAWNTGKPLS PKLLIMGLRD HGFAKAPYLL AGGSTDMGGD ETGLVDADGN PDETKLGKIP AEARAEAARP LVGPAGEGDD LGGVIDPEVL WSCTTCGACV EQCPVDIEHV DHIVDMRRYQ VLIESEFPHE LAGLFKNLEN KGNPWGQNSS QRTAWIDEMD IDIPVYGEHT DSFDGYEYLF WVGCAGAYED RAKKTTKAVA ELLDVAAVNF LVLGQGETCT GDSARRAGNE FLFQMLAQQN IEQLNEVFDG LPVNQRKIVV TCAHCFNALG NEYPQVGGQY EVVHHTQLLN KLVREKRLVP VAPPSEDVTY HDPCYLGRHN QVYEAPRELI GASGATLKEM PRHLERSMCC GAGGARMWME EQLGKRINID RVDEALETLS STPEASPKKI ATGCPFCRVM LSDGVTAQTA DSDAPAPEVV DVAQMLLESV KRGLPDGLVR GNPNRKTDGA APAEVPTEAP AEVEEAPAAT AAATTAEPAE KKDAKPTVGL GLAGGGKRPG APKKPGGAPK PGGAAPAAEA AEPAAPAAPA KPAVGLGLAG GAKRPGAPKP GGAKPAAAAP AAETPAETAA ETPAAPAAPA KPAVGLGLAG GAKRPGAKKP GAPKSPAAAA PAEAPAAEPT AAPSAAPAAD APAEDRTLST AGESKAAGFG IAAGAKRPGP KKPGAPKPAA AAPAAPETAA PEAAAPEAPA APAAEAPAAP AEDRTVTTAG ESKAAGFGIA AGAKRPGVKK LGGEIKEKVA EARTEVREHT EHKTEPEAPS AAEPEQPEAP ETPAEAPKAS SGDDRSLAEK GESKAKGFGI AAGAKRPGGR K
|
| |