Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_5419 |
Symbol | |
ID | 4646612 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 5797954 |
End bp | 5800179 |
Gene Length | 2226 bp |
Protein Length | 741 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639808894 |
Product | phage integrase family protein |
Protein accession | YP_956195 |
Protein GI | 120406366 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.244002 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCCAC CGCCGCCCCG GGGCGCGGCC TCATCCCCGG TCGCGGTCAG TTCCTGGCAG TCCCAATGGG CCCGAGTGCC CGGGCAGTGG CGTAAGCCCG TCTATCCGAT CGACACCGCA CCCTTCCAGG AGGTGTTCCT GCGCAACCAG TTCTACCTGC GGGGCAACCG CGCCGGGGCC GCCCACGACT TCACCCCCGC CGCACCACCG CGATTCGCCG AGGAAATCGC TTGGTGGGTG TGGTGGTGCT GGGATCAGCA GCTGCGCAAG ATCGAGCCGT CGCTGCTGGC GTGGCTGGTG CGCACCCTGC CCGCGGCCAT CTCCGAGCAC ACCACCCGCA CCGGCGCTGC GCCCACTAGT ATCGCCGAGC TCGATCCCAC CGAGCTGATC CGCCAGGCCG CGTTGAGCTT TCAGCGCCGC AACAACCGGC TGCCCTCACC CGGATCGCGG CGCAACATCA GCCACCTCAT CGAACACCTG CACTTGAACG TGTCGGTATC GTGCACCGAC ACCCCGTGGT GGGCCCACGA CATCTGGGAT CTGCGCGCCG ATCCCCGCAT CCCGCAACGC CCGCACGAAC CCTGCCACGA CCAGACGGTG CGGCTGCGCG GGATCACCCC AGACTGGCTG CGCGAAGGCC TGCGGTTCTG GCTGCGCAGC GCCCTGACCT ACGACCTGCT CACCTGGTCC TCGGTCGTTG ACCGGGCCCG CAACCTCGGC TCGCAGCTGG GCCACTTCGC CACCACCGCA GGTCATCTGC AAGACCCCTT GATCAGCACC GACCCCGACC AGCTGCGCAC GGTGTTCCTC GACTACCTCG ACTACCTGCG CTCACCGCAG GCCGCCACCC ATTCCGAGCG GCTCACCTCC GACACGGTCG CTAGCCTGCA AGCCCAAACC CAGTCGTTCT ACACGTTCAT GCACGACCAC GCCGCCGAGG CAGCGACGGC CACCGCCACG GCACGCTGGC GCGACATCAC CCTGACCCAC ACGGCGTTAT GGTCGCCCAT CAACGCCCCC AAACACCGCC GCCGCGCACG CGAACTCACC TGGCATTCCA CCGCCGACCT GCAACGCATG CTCGCCTACC TCGACGTTTT GGCCGCAGAA CCCAAACAGA AAGTGGTGCT CACCGGCCCC GACGGGGACC TCTCGGTCCT CGCAGGCCTC GGTGACCCCC AAGCCGCCCG GATCTGGCTG CTGCAAGCAC TCACCGGCAG GCGGGCCTCG GAGATCTTGA TGCTCGATTA CGACCCGCTC GAGGCGATCC CGGGCCAGGA CCGGCCCGTC GGCACCGAAC CCGACAACGG GGCGTTCGTG GCGCGGCTGC GCTATCAACA GACCAAAGTC GACGGGATCG TCCCCACCAT CCTGGTCGAG CAAGCCATCG TCGACATCAT CGGTGAGCAA CAACGATGGC TCACCACCAA ATACCCTCAG CTGCAATCCA AATACCTGTT TCTCGGGCTG AAGAATCAGC ACCGCGGGCA ACGGCCCCGC TCCTACACCA CCTACCGGGC CATGCTCGAC AAACTCGACA AGTGCCACAC CCTCACCGAC AGCGCCGGGC GGGCACTGCG ATTCACCCAA ACTCACAGGC TGCGTCACAC CCGGGCCACC GAACTGCTCA ACGATGGCGT TCCGTTCCAC GTCGTGCAGC GCTACCTCGG CCACAAAAGC CCCGAAATGA CCGCCCGCTA CGCCGCCACG CTGGCCGCGA CCGCCGAAGC GGAATTCCTC AAACACAAGA AGATCGGGGC ACACGGTGCC GACATCGACA TCACCCCGCA CGACATCTAC GAGATGACCC AGCTGGCCGC CCGCACCGAC CGCGTCCTGC CCAACGGGGT CTGTCTGTTG CCCCCGCTCA AACAATGCGA CAAAGGCAAC GCCTGCCTGG GCTGCGGGCA CTTCGCCACC GACACCACAC ACCTGGACGA ACTGCGCGCC CAGCTCGCCG CGACCGAGGC GCTCATCGCG ACACGGCGCG ACCAATACCG GCAACGCGCC GGCCGCGAAC TTGGTGACGA CAACATCTGG ATCATCGAAC GACACCGCGA AATCGACTCG CTGCATGCCA TCATCGACCG CCTCGCCGCC ACCGCCGACA ACTCCGTCGC CGGCCCAGGC ACAGGCAAAC GACTGCCGCT GCTGCAGATC CAAACCCGCG GAGCCCACCA ATCCGCCCTC GACAAGGCCA GCCGACCCCG CACCGGTGAG CAATGA
|
Protein sequence | MDPPPPRGAA SSPVAVSSWQ SQWARVPGQW RKPVYPIDTA PFQEVFLRNQ FYLRGNRAGA AHDFTPAAPP RFAEEIAWWV WWCWDQQLRK IEPSLLAWLV RTLPAAISEH TTRTGAAPTS IAELDPTELI RQAALSFQRR NNRLPSPGSR RNISHLIEHL HLNVSVSCTD TPWWAHDIWD LRADPRIPQR PHEPCHDQTV RLRGITPDWL REGLRFWLRS ALTYDLLTWS SVVDRARNLG SQLGHFATTA GHLQDPLIST DPDQLRTVFL DYLDYLRSPQ AATHSERLTS DTVASLQAQT QSFYTFMHDH AAEAATATAT ARWRDITLTH TALWSPINAP KHRRRARELT WHSTADLQRM LAYLDVLAAE PKQKVVLTGP DGDLSVLAGL GDPQAARIWL LQALTGRRAS EILMLDYDPL EAIPGQDRPV GTEPDNGAFV ARLRYQQTKV DGIVPTILVE QAIVDIIGEQ QRWLTTKYPQ LQSKYLFLGL KNQHRGQRPR SYTTYRAMLD KLDKCHTLTD SAGRALRFTQ THRLRHTRAT ELLNDGVPFH VVQRYLGHKS PEMTARYAAT LAATAEAEFL KHKKIGAHGA DIDITPHDIY EMTQLAARTD RVLPNGVCLL PPLKQCDKGN ACLGCGHFAT DTTHLDELRA QLAATEALIA TRRDQYRQRA GRELGDDNIW IIERHREIDS LHAIIDRLAA TADNSVAGPG TGKRLPLLQI QTRGAHQSAL DKASRPRTGE Q
|
| |