Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_4226 |
Symbol | |
ID | 9158414 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 4358976 |
End bp | 4362635 |
Gene Length | 3660 bp |
Protein Length | 1219 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | virulence factor MVIN family protein |
Protein accession | YP_003649133 |
Protein GI | 296141890 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.105203 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCGCG CCGAGCCTCG GCACATTCCG CCGCGCAAAG GCACCGTCGC CACCGACGCC GGTGGCAGCG ACGACGGTGG CACGTCGAGT CTGCTGCGTT CGACCGGCTC CGTCGCCATC GCCACTCTCA CCTCACGCCT GACCGGCTTC CTGCGGACGG TACTGCTGGC CGCGATCCTC GGCGGGGCCG TCTGGTCGTC GTTCACCGTC GCCAACCAGA TGCCCCAGCA GGTCTCCGAG CTGGTGCTGG GCCAAGTGCT GGCGGCGCTG GTGATCCCGG TGCTGATCCG CGCCGAGATG GAGGACAAGG ACCGCGGTCA GGCCTTCTTC GAGCGGCTGT TCACCATGTC GCTGGTGATC CTCGGCGGGG CGCTGATCAT CGCGATGCTG ATCTCGCCGC TGCTGGTCGG CTGGCTGGTG GGCAAAGCCG ACAGCCAGGT CAACGCTCCG CTCACGCAGG CACTCGTCTA TCTGCTGCTC CCGCAGCTGG TTTTCTACGG GCTGTCTGCG CTGTTCACGG CCGTGCTCAA CACCCGCGCG GTGTTCCGGC CCGGTGCCTG GGCGCCGGTC GCGACCAACG TCATCCAGAT CGGCACGCTG GTGCTGTTCT ACCTGATGCC GGGCGAGCTC ACCCTGAACC CGGTCGAGAT GAGCGATCCC AAGCTCCTCG TCCTCGGGCT CGGAAGCACT CTCGGCGTGA TCGTGCAGGC CTGTATCCAG CTGCCTGCAC TGAAGCGCTC CGGCATCAAA CTCCGGCTGC GTTGGGGTGT CGACGACCGG CTCAAGCACT TCGGCGGTAT GGGCGTGGCG ATCATCGCCT ACGTCTTCAT CTCGCTGCTC GGCTCGTACC TGGTGACCCC GGTCGCCGCG GCGGCCTCGG AGACGGGGCC CGGTGTTTAC GCCAACGTCT GGCTGGTGTT GCAGCTGCCC TACGGTGTCC TCGGCGTCGC GCTGCTGACG GCCGTGATGC CGCGCCTGTC CCGGCATGCG GCGGAGGGGA ACCGCACCGC CGTCGTCGAC GACCTCTCGC TGGCCACCCG GATCACCATG GTCGCGCTCG TCCCGGTCGT GGCTTTCGCC ACCGCGTTCG GCCCGTCCAT CGGGCGCGCA CTGTTCAACT ACGGCCAGAT GTCCGTCGCC GAGGCCAACC ACCTCGGTAC GGCCATCTCG TTCGAGGCAT TCGTCCTGAT TCCCTACGCG ATGGTGTTGA TCCATCTTCG GGTGTTCTAC GCCCAGGAGC GGCCCTGGAC ACCCACGTTC ATCGTGCTCG CGATCACCGG GGTCAAGACC GGTCTGTCCT ATCTGGTGCC GCAGTTCGTC GATGACGGCA ACCGGGTGGT GGAGTTGCTC GGAACCGCGA CCGGTCTCGC GTACGCGGCC GGAGCACTCG TGGGCTGGAT CCTGCTCCGC CGGAATCTCG GCCGGATGCA GCTGACCAAT GTGGCCCGCA CCCTGCTCCA GACCACCGCG GTATCCGCCC TGGTCGTGGT CACCGTGTAC GCGATCATGC ACGTGAGTGT GCTGCAGAAG CTGGACAAGA GCGGACCGCT CGGCGCGCTG ATCTACCTCG CCCTCGCGGG CGTGCTCTCC ATGACGCTGA TCTACGCGCT GCTGGCTCTG TGGCGTGTGC CGGACGTGCT GGCGATCCTG GCACCTCTGC GCCGCATCGC GGGCCGGTTC GTGCCCGCCC TGCGGCCGGC GGCACCGTCG GCGCCCGCAC CCGCGCGCGA ACCCGCCGAG GCGGTCACCG CCGAGTTCCG GCTCGAGGAA CTCGAGCGCT TCGCCAGCCT CCGGCAGGAG GAGATCACCG CCCAGATGCC GAGGATCGCC GACGATATCG GCCTCCCGTA TGCTGGCCAA AGCCATGTTC CCCGGCGGTC GGAGGCACGT GCCGACAATC CGACCACTGG TCTCAGGTAC CGCAGAAGAG GAGCATTCGC AGTGACCGAG GACGACTCCG CACGCCCCAC CGGACCCGCA GCCCCGGGCA CCGGTCCGAT GCCGCTGCCG TCCACCCCGC AGCCCGCCGC CGGGCAGCCC GGAGCGCCCG CGCCGATCGA CGGATACGAC GATGCGCCGC GCGGCCCCCA GCTGATCCCC GGTGCCGTTG TGGCCGGCGG CCGCTACCGG CTCACGGAGC ATTACGGCGG CATCCGCGGT CTCCAGTTCT GGCAGGCCCG CGATATCAAC CTCGACCGCG ATGTGGCACT CACCTTCGTC GACTCCGAAC AGCGCGAGCC GGTGCCCGAG CGCGGCGCAC AGATCAGTAT GCGCGGCGAG GGCCCGCAGT CGATTCTCTC CCGCACGCTG CGCCTGGGCC GCGTGCATTC CAACGGCCTG GCCCGCGTGC TCGACGTGGT ACGCGGTAGC TCCGGCGGCA TCGTGGTCGC GGAGTGGATC CCCAGTTACT CCCTGGCCGA CGTGGCCGGA ACCCACCCGT CGGCGATCGC CGCCGCGAAG GCGGTGCGCT CACTCGCCAG CGCCGCCGAG GGCGCCCACC GGGCAGGTGC GGCGCTGTCG ATCGACCATC CCGACCGGGT CCGGATCTCG CAGGACGGCA ACGCCTTCCT GGCCTTCCCC GGCACCCTGG CCGATGCCAC CAAGGAGTCG GATGTCCGCG GCCTCGGTGC GGTGCTCTAC GCGCTGCTGC TGGAGAAGTG GCCGCTGGAC GAGACCACGG GCCGCATCGT GACCACCGGC TCGGGCACCG TATCCGGCCT GCAGCAGGCC GACGCGGATG CCGCCGGTAA CCCCGTCGAA CCCCGGGATG CCAAGAGCGA CATCCCGTTC GAGATCTCCG CCGTCGCCAC CCGCACCCTC GAGGGCGGGC AGGGCATCCG CACCGCGGCC ACCGTGCAGC AGTTGTTGGA TCAGGCCTCG GTGGTCGATG TGAAGACCGA CCTGCTGGCC GCAGTGCGCG ACGATGCGCC CGCCGCGGCC CGGCCCGCCG CCGCGACCGC CGCCCCCGCC GCACAGCAAT CGCTGCAATC GCGGCCCACA CCGCCGAAGC GGGTGCAGAC CGGCAAGCGG CCGAAGAACG TTCCCCTGCT CATCATCGCG GCCTGCGCCG TGGTGCTCCT GTTGATCATC GGCCTCACGC TGCTGATCAG CAGTCTGACC AGTGAGAAGG ACACGCAGTC CGGTGTGGGT TCGTTGTACC CGGCGGGACA GAGTTCGTCC ACCATCGCGG CGCCGGCGGC GCCCGGGATC GGTTCGCCGG TGAAGGCGAC CGCCGTCTCA CTGGTGGACT TCTCGTCGAA CAAGGACTCG GCGGTGAACA TCGGCAACGT CCTCACCGGC CAGGAGCCCG CGTGGAAGAC CGACACCTAC AAGGCCGGAC CCGTGTTCGG AAACCTGAAA AAGGGCGTGG GACTGTTGAT CACGCTGGAC AAGCCGGTCT CGTTGACCGG TGGCGCGATC ACCTCGCCGA GTGCGGGATC CACGATCGAG GTACGCACCT CGTCGAAGGA GACGGTGAAG TCGATCGACG AGACCTCGGT GGTCTGGTCG GGCACGCTGC GTCCGGGCGC CAACAACTTC CACGTCAGTG CGGTGGCACC GCGCACGAAG TACGTGGTCG TGTGGATCAC CGGCCTCACC AAGGTCGAGG GCAACGACTG GTACACCACG ATCAACCAGG TGTCCTTCCA GGGCACGTAG
|
Protein sequence | MSRAEPRHIP PRKGTVATDA GGSDDGGTSS LLRSTGSVAI ATLTSRLTGF LRTVLLAAIL GGAVWSSFTV ANQMPQQVSE LVLGQVLAAL VIPVLIRAEM EDKDRGQAFF ERLFTMSLVI LGGALIIAML ISPLLVGWLV GKADSQVNAP LTQALVYLLL PQLVFYGLSA LFTAVLNTRA VFRPGAWAPV ATNVIQIGTL VLFYLMPGEL TLNPVEMSDP KLLVLGLGST LGVIVQACIQ LPALKRSGIK LRLRWGVDDR LKHFGGMGVA IIAYVFISLL GSYLVTPVAA AASETGPGVY ANVWLVLQLP YGVLGVALLT AVMPRLSRHA AEGNRTAVVD DLSLATRITM VALVPVVAFA TAFGPSIGRA LFNYGQMSVA EANHLGTAIS FEAFVLIPYA MVLIHLRVFY AQERPWTPTF IVLAITGVKT GLSYLVPQFV DDGNRVVELL GTATGLAYAA GALVGWILLR RNLGRMQLTN VARTLLQTTA VSALVVVTVY AIMHVSVLQK LDKSGPLGAL IYLALAGVLS MTLIYALLAL WRVPDVLAIL APLRRIAGRF VPALRPAAPS APAPAREPAE AVTAEFRLEE LERFASLRQE EITAQMPRIA DDIGLPYAGQ SHVPRRSEAR ADNPTTGLRY RRRGAFAVTE DDSARPTGPA APGTGPMPLP STPQPAAGQP GAPAPIDGYD DAPRGPQLIP GAVVAGGRYR LTEHYGGIRG LQFWQARDIN LDRDVALTFV DSEQREPVPE RGAQISMRGE GPQSILSRTL RLGRVHSNGL ARVLDVVRGS SGGIVVAEWI PSYSLADVAG THPSAIAAAK AVRSLASAAE GAHRAGAALS IDHPDRVRIS QDGNAFLAFP GTLADATKES DVRGLGAVLY ALLLEKWPLD ETTGRIVTTG SGTVSGLQQA DADAAGNPVE PRDAKSDIPF EISAVATRTL EGGQGIRTAA TVQQLLDQAS VVDVKTDLLA AVRDDAPAAA RPAAATAAPA AQQSLQSRPT PPKRVQTGKR PKNVPLLIIA ACAVVLLLII GLTLLISSLT SEKDTQSGVG SLYPAGQSSS TIAAPAAPGI GSPVKATAVS LVDFSSNKDS AVNIGNVLTG QEPAWKTDTY KAGPVFGNLK KGVGLLITLD KPVSLTGGAI TSPSAGSTIE VRTSSKETVK SIDETSVVWS GTLRPGANNF HVSAVAPRTK YVVVWITGLT KVEGNDWYTT INQVSFQGT
|
| |