Gene Mvan_5941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5941 
Symbol 
ID4644928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp6337524 
End bp6339158 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content68% 
IMG OID639809414 
Productphage integrase family protein 
Protein accessionYP_956708 
Protein GI120406879 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.345865 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGAAAC GACAACGAGA CTGCGTTGAC TGCGGCGCAC CGGTGGGATA CATCGGCCGC 
GAGCACTGCT GCCACTGCAC CCGCCGGTTG CGGGCGCAGG CCGCCAAGGC ACGATGCCCG
GGCTGCGGCT GGGACCGGAT CGTGCTGCCG GAGACAGACC GGTGCATGCT GTGCTCACGC
CGCTGCCGCG AATGCGGCGG TCCGATCAAA TTCCGCGGTG AAACGGTATG CCGGCCATGT
CAGAAGCGGG CCACGCTCGC CGCCGCGAAA TCGACATGCC CGCGCTGCGC AAAGCCCGGC
TATCTGCGCG AGTCGACGGG GTGGTGCGGC CCGTGCTCGC GGCCCCGCCC ACCAAAGTAT
CCGCCCCGCA TCTGTGTGTC CTGCGGTGAG CTGCGCCGCC ACGCCGCGCA CGGCATGTGC
GGGCGATGTT GGCAGCGCCA CCCGGACCGG CCGTTCGTGC GCGGTGAGAC CCTGGCCGCT
GCGCTCACTG AACCACCAGG CTGGCTCGGC GATTTCGTCG CGTATCTGGC CGCCCGTCAC
TGCCCCGCTC GCGCTTGCAG GATGATCGCC ACGCTGGCTC GACTCCTCGA AGACGAACAC
CCCAACCACC CGCAGAGCGT GCTCGAGCGG TCCCGCCGAC CTGGCCGGTC GATGGGATCA
CTGGCTCGCG CTCTGGAAGC GTTCTTCACC GAGCACGGCC TGGCGATGGC CACCGACCAG
GACCAGCGGC TGGCCGCCGG ACGACGGCAG AAACGGATCG ACGCCACACC GGGTCCGATG
CGCGTCGCTG TCGCGGCCTT CGCCGAATCC ATGCTGCACA ACCGCGACCG GGCCCGGCGT
GCAGGCACCC GCCCCCGCTC CGACCGCACC ATCGAGTCAG CGTTGACGAT AGTGCGAGAC
CTGGCTCACT TCCTCGAAGC GCAACGGGGC AAGCAGGATT GGGCACTCGT GGACGTCACG
GATATCGAGG CGTTCCTCGC CGAGTTGCCC AACACCCGAG CGCGACGTCT GACCGTGTTG
GGCCAGTTCT TCCGGTTCGC ACGCAATCGT CGCGTGGTGC TCGTGGACCC GACACACGGC
ATGTCAGCCA AGCGGCACAG GGGATTTCGT GGCCGCACAC TCACCATCAC GCAACAGCGC
GACCTATTTC GCCGCTGGAG CGCCGACCCC GCTGCGCATC CGCACGAAGC CCTCTTCGGG
CTCCTCGCCA TATTGCACGG TGTCTCCAGC CGTGAGCTGC GCCTGCTGCA GGTCGACCAC
ATCGACACCA GTGACCGCTC GATCCGCCTC GGCACGCGCC CCCACCCGGT GCCGCTGGAC
CCGGTCAGCT GGAACGCGCT GCAACGCAGC CTCGATCACC GCGACACCCA GCGCACCAAC
AACCCGCACG TGATCGTCAC GCGCGGTACC AAAGCCGACC GGCGACCGGC GTCGGAGGCC
TACATGAGTC ATCTGCTCGA TCCCTGCGGC CTGCCACCCA AGATGCTGCG CAGTACCCGG
CTGGCCGATC TGGTCAATAC GATCGACCCG AAACTCGTCG CGGCAGCCTT CGGGATGAGA
CCCGAAGGAG CGATGATCTA TCTCGCCGAT CACCTCGACG AGGGCCGACT GCCCGACCAC
CTCGCCCAAC CCTGA
 
Protein sequence
MGKRQRDCVD CGAPVGYIGR EHCCHCTRRL RAQAAKARCP GCGWDRIVLP ETDRCMLCSR 
RCRECGGPIK FRGETVCRPC QKRATLAAAK STCPRCAKPG YLRESTGWCG PCSRPRPPKY
PPRICVSCGE LRRHAAHGMC GRCWQRHPDR PFVRGETLAA ALTEPPGWLG DFVAYLAARH
CPARACRMIA TLARLLEDEH PNHPQSVLER SRRPGRSMGS LARALEAFFT EHGLAMATDQ
DQRLAAGRRQ KRIDATPGPM RVAVAAFAES MLHNRDRARR AGTRPRSDRT IESALTIVRD
LAHFLEAQRG KQDWALVDVT DIEAFLAELP NTRARRLTVL GQFFRFARNR RVVLVDPTHG
MSAKRHRGFR GRTLTITQQR DLFRRWSADP AAHPHEALFG LLAILHGVSS RELRLLQVDH
IDTSDRSIRL GTRPHPVPLD PVSWNALQRS LDHRDTQRTN NPHVIVTRGT KADRRPASEA
YMSHLLDPCG LPPKMLRSTR LADLVNTIDP KLVAAAFGMR PEGAMIYLAD HLDEGRLPDH
LAQP