Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_0364 |
Symbol | |
ID | 9154499 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 379474 |
End bp | 382467 |
Gene Length | 2994 bp |
Protein Length | 997 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | large membrane protein |
Protein accession | YP_003645346 |
Protein GI | 296138103 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.149912 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTTCGCAG CGATAGGGCG CTTCTCGTAC GCGTTCCGAT ACACGATCAT CGCGGTCCTG TTCGTCTCGA TCGCCGCCAC CGGCGTCTGG GGCTTCCTCG GTCTCGGCTC GAAGACCAAG ATGAACGGCC TCTACGACGA GAAGAGCGAT TCGGTCGCCG CGTCGAAACT GACGGACGAG GCCAATGGCC GGTTCGCGCA GTCCGACATC CTGGTGCTGG TGAAGGCCGC CGACGGCAAG AAGGTCACCG ACGCAGAGGT GAAGGACTCC GTCGAAAGCG CACGGACCAA GATCCTCGAT CAGTTCGCCA AGGGACCGGA CGCGAAGCTG TACGGCGCGG ATGCAGAGAA CACCTCGATG CGCCCGGCCA GCTACACGAC GCTCCCCGCG GCCGACAATC CGTTCGTCCG CAACGAGGCC AACGGCGAGT CGTGGGGCTT CTTCACGCTG CCGCTCAAAT CGAACGACGA CACGCAACGC GGCCTTGATT GGGCGGATAT CCGGGAGCCG ATCCGGGACA TTCTGAAGAC CCAGGCCGAC GGGAAATTCG AAGCCAGATT CGCCGGCCTC GTACCGGTGA CATCCGAGGT CACCGAGGGC GTTCAGCGCG ATCAGAAGAA GGCGGAGCTC ATCGCCCTGC CGCTCGTCGC AATCGTGTTG TTCTTCGTCT ACGGCGGCAT CGTGGCGGCG GTGCTGCCGA TGATCATCGG TGTGATGACG ATCGGCGCAT CGATGGCGAT CGCCTCCGTC CTCGCACAGT TCATGGAAGT CAACCTGTTC GTCGGACCGA TCATCTCGCT GATCGGCCTG GGCATCGCGC ACGACTACGG CTTGTTCATG GTGTCCCGGT TCAGAGAGGA ACTGGACGAG GGGTACGACA CACCGACCGC GGTCAGACGC ACGGTGATGA CGGCGGGCCG CACCATCGTG ATGTCCTCCC TGCTGATCGT GGGCGCGCTC GCGTGTCTGT ACATCTCGCC GCTGGGCCTG CTCACCTCGA TGGCCACCGG CGCGCTGCTG GGCGTGTTCA TCGCGGCGCT CCTGTCGCTG GTGCTGCTGC CCGCGATGCT CGCCGTGCTC GGCCCGCGCG TGTTCGCGCT GAGCTTCCAC TGGTTCCTCA CCATCTTCGA CAAGTGGAAG CTGCCGCTGA TCGGCGGCAT CATCCACTGG GCCGCGGAGA AGACCGGTAA GCCGAAGACC AAGGCCAACA TCGAGAACGG CATGTGGGGC AAGGTGACCG ATGCGGTCAT GCGCAAGCCC ATCGCCTACA CGGTCGGTGT GGTCACCCTC CTGCTGGCGA TCACCATTCC GGTGATCGGC CTGCAGTTCG GCGGTATCTC GCAGACCTAC CTGCCGCCCA CCAACGAGAC GCGGCAGGCG CAGGACAAGT TCTTCGAGAC CTTCCCGGCG TATACCGGTC AGTCGATTCA GATCGTGTTC ACCAATGCCG ATGACGCCCA GATCAAGCAG GTGCTCGACG AGGCGAACAA GATTCCCGGA TTCGTCCACA CCGATCCGGC CGATCGCAAT TCAAGCCGCT TCAGCCCCCC GACTGCAGCG GTCGCGACCG CCAAGTCCGA TGGCTCCCCG CTCCAGATCC GAGGCGAAGG GCCCGAGGTC AAGGTCCGTA CCTCCAACGG CAACCTCCAG GATCCGAACG ACACCAAGGC CGTCAGTCAG GCGGTGGAGC AGATCCGCGG CATCACGATG GACGGTGCGG ACGCCAGCAA GGGCGTCGGG TTCATGGTCG GCGGCATGCC GACGCTGCAG AACGACTCGA TCGACTCGAT GCTCGGGAAG TTGCCGCTAC TCATGGGGCT GTTGGTCGTG GTGACCACGC TGATGATGTT CCTGGCGTTC GGATCGCTGG TGCTGCCGAT CAAGGCGGTG CTGATGTCGA TCCTGTCGCT GGGTTCCACC CTGGGCTTCC TCACCTGGAT GTTCGTGGAC GGGCACGGCG GCTCGATCTT CAACTTCTCC GGCGGCCCAC TGATGGCGCC GGTGCTCGTG CTGGTGATCG CGGTGGTCTT CGGTCTCAGT ACCGACTACG AGGTGTTCGT GATCTCCCGG ATGATCGAGG AGAGAGCCGC GGGAGCCACC ACGGTGCAGA GCATCCGCGC CGGCACCGCG AACACCGGCC GGATCATCAC CGCGGCTGCG ATTCTGCTGG CCTGCGTCGC CGCGGCCTTC GCCACATCCG ACCTGGTGAT GATGAAGTAC GTCTCGTTCG GCCTGATGTT CGCGCTGCTC ATCGACGCCA CCGTGGTCCG CATGGTGCTG GTGCCCGCGG TGATGAAACT TCTCGGCGAC GACTGCTGGT GGGCTCCCGA ATGGATGCGC AAGATCCAGC ACGCGATCGG CCTGGGCGAG ACGGCGCTGC CCGCGGAGAC GCGCGACGGC CGTCCGCTGG CGTCGGTGCG TCCTCCGTCG GCGGTGCCCG GCCGTGCAGC GCCGCAGCTG GTGGGCGCGG GCGCGCCCGT GGTCCCCGGA TCGTTGGGGA GTGCCGAGCA GCGTCCGATC CCGGACAGCG CGGCCGATCC CACCGCACCG TCGAACAACG AGCGTCCGTC GGGCCAGATC CGGATGGGCA ATCGTCGTCC GGGACCGCCG CCCGCCGGCC AGCCGCCGCG CCCGTCGGCG CCGATGCAGC GTCCCCCGCA GGACCCGCGG AGCTATCAGC CGCCGGTCCC GGGGCAGCGG CGCCCCGGGC CTCCGCCGCA GGGGCCGGGC GGTCCGCGCG GGCCGCAGGG GCCCTACGGT CCCCCCGGAC AGGGCGGCGG TCCGCGCCCG AGCGGCCCGA TGCCGCGCCA GCAGCGACCC GGCGGCCCCG CCGGACCGGG TGGTCCCGGT GGGCCGCGCC CGCCGTACCC GCCCCGGCCA CAGGGCCCGG GTGGGCCCGG TGGACCGGGC GGACGAGGGC CGGGTCAGCG CCCCTTCCCC GGCCCGTCCG GTGACGGCCC CCGCCCGCAC CAGCAGCGCG ACCGGGACGA GTAG
|
Protein sequence | MFAAIGRFSY AFRYTIIAVL FVSIAATGVW GFLGLGSKTK MNGLYDEKSD SVAASKLTDE ANGRFAQSDI LVLVKAADGK KVTDAEVKDS VESARTKILD QFAKGPDAKL YGADAENTSM RPASYTTLPA ADNPFVRNEA NGESWGFFTL PLKSNDDTQR GLDWADIREP IRDILKTQAD GKFEARFAGL VPVTSEVTEG VQRDQKKAEL IALPLVAIVL FFVYGGIVAA VLPMIIGVMT IGASMAIASV LAQFMEVNLF VGPIISLIGL GIAHDYGLFM VSRFREELDE GYDTPTAVRR TVMTAGRTIV MSSLLIVGAL ACLYISPLGL LTSMATGALL GVFIAALLSL VLLPAMLAVL GPRVFALSFH WFLTIFDKWK LPLIGGIIHW AAEKTGKPKT KANIENGMWG KVTDAVMRKP IAYTVGVVTL LLAITIPVIG LQFGGISQTY LPPTNETRQA QDKFFETFPA YTGQSIQIVF TNADDAQIKQ VLDEANKIPG FVHTDPADRN SSRFSPPTAA VATAKSDGSP LQIRGEGPEV KVRTSNGNLQ DPNDTKAVSQ AVEQIRGITM DGADASKGVG FMVGGMPTLQ NDSIDSMLGK LPLLMGLLVV VTTLMMFLAF GSLVLPIKAV LMSILSLGST LGFLTWMFVD GHGGSIFNFS GGPLMAPVLV LVIAVVFGLS TDYEVFVISR MIEERAAGAT TVQSIRAGTA NTGRIITAAA ILLACVAAAF ATSDLVMMKY VSFGLMFALL IDATVVRMVL VPAVMKLLGD DCWWAPEWMR KIQHAIGLGE TALPAETRDG RPLASVRPPS AVPGRAAPQL VGAGAPVVPG SLGSAEQRPI PDSAADPTAP SNNERPSGQI RMGNRRPGPP PAGQPPRPSA PMQRPPQDPR SYQPPVPGQR RPGPPPQGPG GPRGPQGPYG PPGQGGGPRP SGPMPRQQRP GGPAGPGGPG GPRPPYPPRP QGPGGPGGPG GRGPGQRPFP GPSGDGPRPH QQRDRDE
|
| |