Gene Mvan_5419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5419 
Symbol 
ID4646612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5797954 
End bp5800179 
Gene Length2226 bp 
Protein Length741 aa 
Translation table11 
GC content67% 
IMG OID639808894 
Productphage integrase family protein 
Protein accessionYP_956195 
Protein GI120406366 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.244002 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCCAC CGCCGCCCCG GGGCGCGGCC TCATCCCCGG TCGCGGTCAG TTCCTGGCAG 
TCCCAATGGG CCCGAGTGCC CGGGCAGTGG CGTAAGCCCG TCTATCCGAT CGACACCGCA
CCCTTCCAGG AGGTGTTCCT GCGCAACCAG TTCTACCTGC GGGGCAACCG CGCCGGGGCC
GCCCACGACT TCACCCCCGC CGCACCACCG CGATTCGCCG AGGAAATCGC TTGGTGGGTG
TGGTGGTGCT GGGATCAGCA GCTGCGCAAG ATCGAGCCGT CGCTGCTGGC GTGGCTGGTG
CGCACCCTGC CCGCGGCCAT CTCCGAGCAC ACCACCCGCA CCGGCGCTGC GCCCACTAGT
ATCGCCGAGC TCGATCCCAC CGAGCTGATC CGCCAGGCCG CGTTGAGCTT TCAGCGCCGC
AACAACCGGC TGCCCTCACC CGGATCGCGG CGCAACATCA GCCACCTCAT CGAACACCTG
CACTTGAACG TGTCGGTATC GTGCACCGAC ACCCCGTGGT GGGCCCACGA CATCTGGGAT
CTGCGCGCCG ATCCCCGCAT CCCGCAACGC CCGCACGAAC CCTGCCACGA CCAGACGGTG
CGGCTGCGCG GGATCACCCC AGACTGGCTG CGCGAAGGCC TGCGGTTCTG GCTGCGCAGC
GCCCTGACCT ACGACCTGCT CACCTGGTCC TCGGTCGTTG ACCGGGCCCG CAACCTCGGC
TCGCAGCTGG GCCACTTCGC CACCACCGCA GGTCATCTGC AAGACCCCTT GATCAGCACC
GACCCCGACC AGCTGCGCAC GGTGTTCCTC GACTACCTCG ACTACCTGCG CTCACCGCAG
GCCGCCACCC ATTCCGAGCG GCTCACCTCC GACACGGTCG CTAGCCTGCA AGCCCAAACC
CAGTCGTTCT ACACGTTCAT GCACGACCAC GCCGCCGAGG CAGCGACGGC CACCGCCACG
GCACGCTGGC GCGACATCAC CCTGACCCAC ACGGCGTTAT GGTCGCCCAT CAACGCCCCC
AAACACCGCC GCCGCGCACG CGAACTCACC TGGCATTCCA CCGCCGACCT GCAACGCATG
CTCGCCTACC TCGACGTTTT GGCCGCAGAA CCCAAACAGA AAGTGGTGCT CACCGGCCCC
GACGGGGACC TCTCGGTCCT CGCAGGCCTC GGTGACCCCC AAGCCGCCCG GATCTGGCTG
CTGCAAGCAC TCACCGGCAG GCGGGCCTCG GAGATCTTGA TGCTCGATTA CGACCCGCTC
GAGGCGATCC CGGGCCAGGA CCGGCCCGTC GGCACCGAAC CCGACAACGG GGCGTTCGTG
GCGCGGCTGC GCTATCAACA GACCAAAGTC GACGGGATCG TCCCCACCAT CCTGGTCGAG
CAAGCCATCG TCGACATCAT CGGTGAGCAA CAACGATGGC TCACCACCAA ATACCCTCAG
CTGCAATCCA AATACCTGTT TCTCGGGCTG AAGAATCAGC ACCGCGGGCA ACGGCCCCGC
TCCTACACCA CCTACCGGGC CATGCTCGAC AAACTCGACA AGTGCCACAC CCTCACCGAC
AGCGCCGGGC GGGCACTGCG ATTCACCCAA ACTCACAGGC TGCGTCACAC CCGGGCCACC
GAACTGCTCA ACGATGGCGT TCCGTTCCAC GTCGTGCAGC GCTACCTCGG CCACAAAAGC
CCCGAAATGA CCGCCCGCTA CGCCGCCACG CTGGCCGCGA CCGCCGAAGC GGAATTCCTC
AAACACAAGA AGATCGGGGC ACACGGTGCC GACATCGACA TCACCCCGCA CGACATCTAC
GAGATGACCC AGCTGGCCGC CCGCACCGAC CGCGTCCTGC CCAACGGGGT CTGTCTGTTG
CCCCCGCTCA AACAATGCGA CAAAGGCAAC GCCTGCCTGG GCTGCGGGCA CTTCGCCACC
GACACCACAC ACCTGGACGA ACTGCGCGCC CAGCTCGCCG CGACCGAGGC GCTCATCGCG
ACACGGCGCG ACCAATACCG GCAACGCGCC GGCCGCGAAC TTGGTGACGA CAACATCTGG
ATCATCGAAC GACACCGCGA AATCGACTCG CTGCATGCCA TCATCGACCG CCTCGCCGCC
ACCGCCGACA ACTCCGTCGC CGGCCCAGGC ACAGGCAAAC GACTGCCGCT GCTGCAGATC
CAAACCCGCG GAGCCCACCA ATCCGCCCTC GACAAGGCCA GCCGACCCCG CACCGGTGAG
CAATGA
 
Protein sequence
MDPPPPRGAA SSPVAVSSWQ SQWARVPGQW RKPVYPIDTA PFQEVFLRNQ FYLRGNRAGA 
AHDFTPAAPP RFAEEIAWWV WWCWDQQLRK IEPSLLAWLV RTLPAAISEH TTRTGAAPTS
IAELDPTELI RQAALSFQRR NNRLPSPGSR RNISHLIEHL HLNVSVSCTD TPWWAHDIWD
LRADPRIPQR PHEPCHDQTV RLRGITPDWL REGLRFWLRS ALTYDLLTWS SVVDRARNLG
SQLGHFATTA GHLQDPLIST DPDQLRTVFL DYLDYLRSPQ AATHSERLTS DTVASLQAQT
QSFYTFMHDH AAEAATATAT ARWRDITLTH TALWSPINAP KHRRRARELT WHSTADLQRM
LAYLDVLAAE PKQKVVLTGP DGDLSVLAGL GDPQAARIWL LQALTGRRAS EILMLDYDPL
EAIPGQDRPV GTEPDNGAFV ARLRYQQTKV DGIVPTILVE QAIVDIIGEQ QRWLTTKYPQ
LQSKYLFLGL KNQHRGQRPR SYTTYRAMLD KLDKCHTLTD SAGRALRFTQ THRLRHTRAT
ELLNDGVPFH VVQRYLGHKS PEMTARYAAT LAATAEAEFL KHKKIGAHGA DIDITPHDIY
EMTQLAARTD RVLPNGVCLL PPLKQCDKGN ACLGCGHFAT DTTHLDELRA QLAATEALIA
TRRDQYRQRA GRELGDDNIW IIERHREIDS LHAIIDRLAA TADNSVAGPG TGKRLPLLQI
QTRGAHQSAL DKASRPRTGE Q