Gene Mvan_1114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1114 
Symbol 
ID4648549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1184203 
End bp1186116 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content65% 
IMG OID639804614 
Productphage integrase family protein 
Protein accessionYP_951957 
Protein GI120402128 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCCG TGCTCGCTTA CGTTCCCGCT CCTGATGCTG ACCTGCTCAC CGATTACGAC 
GCGCACTGCA CGCGGCTGGG ATTGACCAGT AGGAACCTCC GGTCGGCAGC ACGAACGTTC
CTGCGATGCT GGCCCGATCC ACAGCAATGG GCCGGGGAAC CACTGCAGAC ACGGTTGTCG
GCGTCGCATA CCACCCGCTG CTTTGTGACG TTCTTGATGC TGGCTGGGCA CCTGCGGCCC
GGATACGACT ACCTGATCTG CCGCAAACTG TCCGTCTTTT GGCGCCACAT GCCGCCCGGG
CTGCTTGCGC AGGACCTTGC CCGGTTCCTG GGTGCCGCCG AAGAACTTGG CTTCACCGAA
CGCACCCGCA GCGCATCCGC GTCGCAGGTC ATCGGTCGGC TCCTCATTCA GACAGGGAGA
CGACTGGATG CGCTGACCAA CAACGACTTT GATGATCTAC TAGCTGCAAG TGCGGCTCGC
CGCGGCGCTG ATAAACCCAG CCGCCACTAC AGCAGCGGTG CCCACACCGC CCGGCAGGTG
ATGTTTCACC TTGGTGTGTT CACCGACCAG CCAGTCAACG CGACCAGCCT GCTGCGGCAA
AGCTTTGCCC AGCGGATGCG GGATGCGAGT CCGTCGCTGC GCGGGTCGTT TGTGGCCTAC
CTTGATCGGT TGACCGCCAC CCATAGCCGC GGCACGGTCG CTGGCACCGC AACCCGGCTC
AACCACTTCG CCGCCCACCT GTATGCGGTT GATCCGACAT TGACAACTCT GGCCAATTTG
GACCGACGCC GCCACATCGA GACTTACCTG ACCGCTACCG CCGAGGCAAC CAATTCGCGC
ACGGGCGCAC CGATTCAAGC CTCAGAGCGT CGCGGTCGGG TCTTGGCAGT GCACTGCTTT
CTCAACGACA TCGCTGAGTG GGGTTGGCCG GAGGCACCAC CTCGGCGGCT GGTATTCCGC
TCCGACATTC CGAAGCTGCC GCGAGCTCTG CCGCGCTACC TCACCCCGGA CCTGGATCGG
CGGCTGACTC AGGCACTGCA GGCCTGGCCA GACCGGCTGC CGGCTGACGC GCTGCTGTTG
CAGCGGGCGA CCGGATTGCG TATCGGTGAG CTCGTCGAAC TCGAACTCGA CAGCGTCCAC
GAAATCCCCG GCCAAGGCGC ATGGCTGAAA GTCCCACTGG GAAAGCTCAA TACCGAACGG
ATGGTGCCCC TCGATGACGA CACTGTTGCC CTCATCGATC GGATCGTGGC CCACCGCTCG
CTTGGGCGGC CGTTACCGCA CCCACGCACA GCACGCCCTA CCGACTACCT GTTCACCCAC
CACGGTCGGC GCCTGACCGT CGATCACATC CGCGACGTGC TGGGCCGCGT GACCACCGAC
GCCAACCTGC CGCATATCAC CCCGCACCAA CTGCGCCACA CCTACGCCAC TGCCTTGGTC
AACGCTGGTG TCTCGCTGCA GAGTCTGATG GCTCTGCTCG GGCACGTCTC GGCTGAGATG
AGTCTGCGCT ACGGACGCCT ATTCGACGCA ACCGTGCGCA CCGAATACGA ACGTGCGCTG
ACCGCGGCGA AGGCCCATCT GGGAACCCTG CCCACTGGTC CACCACAAGG CCGGATCTCG
CTGCCGATCG TCGACGGCGA CTGGAAAGAC GCACCAGCCA TCAAGGCCCG CCTGGCCGGC
GGATTCTGCA TCCGTGCCCA AGTCCAAGGC CCGTGCGCCT ACGCCAACAT CTGCGAACAC
TGCCCCAACT TCCGCACCGA CACCGGCTAC CTGCCCGTCC TGGCCGCTCA GCGCGCTGAC
ACCGAGACCC TGGCTCGCGA CGCCGAAGCC CGTGGCTGGA CCGAGGAAGC CGAACGCCAC
CGCCGACTCA TCGCACGCCT GGACGCCCAC ATCAGTCAGA CGCAGACCGG ATGA
 
Protein sequence
MPPVLAYVPA PDADLLTDYD AHCTRLGLTS RNLRSAARTF LRCWPDPQQW AGEPLQTRLS 
ASHTTRCFVT FLMLAGHLRP GYDYLICRKL SVFWRHMPPG LLAQDLARFL GAAEELGFTE
RTRSASASQV IGRLLIQTGR RLDALTNNDF DDLLAASAAR RGADKPSRHY SSGAHTARQV
MFHLGVFTDQ PVNATSLLRQ SFAQRMRDAS PSLRGSFVAY LDRLTATHSR GTVAGTATRL
NHFAAHLYAV DPTLTTLANL DRRRHIETYL TATAEATNSR TGAPIQASER RGRVLAVHCF
LNDIAEWGWP EAPPRRLVFR SDIPKLPRAL PRYLTPDLDR RLTQALQAWP DRLPADALLL
QRATGLRIGE LVELELDSVH EIPGQGAWLK VPLGKLNTER MVPLDDDTVA LIDRIVAHRS
LGRPLPHPRT ARPTDYLFTH HGRRLTVDHI RDVLGRVTTD ANLPHITPHQ LRHTYATALV
NAGVSLQSLM ALLGHVSAEM SLRYGRLFDA TVRTEYERAL TAAKAHLGTL PTGPPQGRIS
LPIVDGDWKD APAIKARLAG GFCIRAQVQG PCAYANICEH CPNFRTDTGY LPVLAAQRAD
TETLARDAEA RGWTEEAERH RRLIARLDAH ISQTQTG