Gene Mvan_3747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3747 
Symbol 
ID4646812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3985746 
End bp3987161 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content60% 
IMG OID639807211 
Productphage integrase family protein 
Protein accessionYP_954535 
Protein GI120404706 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.51033 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGCGA CCACAGTACA AACACCAGCA GGAGCGTGGA TGACGACGAC CGATAAACCA 
CCCCCTAAAC GCACCCGCCG CACAGAGCCA ATTATGCGGC GCAACGCCAA GAACGGCGCA
GTGAGCTACA CATTCCAAGT GGACGTCGGG ACCCACGCTG ACGGCACCCG CGACAGACAG
CGGTTCACTT TCTCCACGTT TGCGGAAGCC CGCCGTGAGT ACCGAAAAAT CTCGACGGAG
GTTGCAGCGG GGACGTTCGT AAAAAGAGAC CTCACCACCG TCGCCGAGTA TTTGGGGAGG
TGGCTCGACG GAAGACGCGA CGTCAGGGTT AATACCCTTG CAGGGTACCG ACATTCGTTG
AAGCCCGTTA TCGACAATCT TGGCGGCTTG GGTTTGCAAC AACTACGAAC GTCCGACATC
GACGCCCTGG TAACACTGCG GCTTAATGGC ACGCCGATTT CCCAGCGTGA GAGGCGCGGA
CGGCGCGCTG CTGAGGTATT AGCGTACTTG CGGTCAAGAT CCGGCGGTGC CCAATACGCG
GATGTTCGCG ACGAACTCGG CAATCCTGGC GTCAAGGCCT TAGATCGGCT GGTCGCCTCG
GGCGAGGTGA TACGGCCTAG CCGCGGCCGA TATGTTGCCG TTCGGGACAG CGATCCCGTG
CAGTCCAAGA TTCCAGGCAA CGTTAGTTCC CGGACTGTCG TGACAATGCT CGTGGTGCTG
TCGTCGGCCC TCGACGACGC TATGAGGGAG GGCCTTGTAG CCCGAAACGT AGCGCGTTTA
GTGAAGAGGC CCGCGGTGGA GCATCACGAA ATGGCCACTT GGACAGCAGA ACAAGCGGTT
CGGTTTCGCG AACATGTGCG CGGCGATCGG CTCGCCGCAT GTTGGCTGCT GACCCTGGCA
GGGCTTCGCC GGTCGGAAGT TCTAGGACTA CGTTGGTCCG ATGTTGACCT CGATGGCGGA
ACGGTCACCA TCGCTCAGGG TCGTGTTGTA GCAGAAGGCC AAGGCACTAT CACCGGAGAC
CCGAAGTCCA AGCGGTCTCG CAGGGCACTA CCGATGCCCG CCGAAGTGCT CGCCGCACTG
CGCGTCTTTA GGCTCCGCCA GTCTGAAGAA CGCCTCGCCA TCGGCTCGGA GTATCCCGAC
ACCGGGTTGG TCGCCGTCAA CGTGATTGGC CTCCCTATCC GACCGGAGAC GTATTCAGGC
GAATTCATGC GGCACGCGAA AGACGCTGGT GTGCCTTTAA TTCGGTTACA CGACGTTCGA
CATACTGCAG CCACCATGCT GCTCGATCGC GGAACAACAC CCTCGGCTAC TGCAAAATGG
CTCGGCCATG ACCCAGCGAT TACGCTTCGG GTATACGGGC ACGTATATGA CGGCGCACTT
GCGGCGGCAG GCGACACACT GCTTCGGGGC CACTAA
 
Protein sequence
MQATTVQTPA GAWMTTTDKP PPKRTRRTEP IMRRNAKNGA VSYTFQVDVG THADGTRDRQ 
RFTFSTFAEA RREYRKISTE VAAGTFVKRD LTTVAEYLGR WLDGRRDVRV NTLAGYRHSL
KPVIDNLGGL GLQQLRTSDI DALVTLRLNG TPISQRERRG RRAAEVLAYL RSRSGGAQYA
DVRDELGNPG VKALDRLVAS GEVIRPSRGR YVAVRDSDPV QSKIPGNVSS RTVVTMLVVL
SSALDDAMRE GLVARNVARL VKRPAVEHHE MATWTAEQAV RFREHVRGDR LAACWLLTLA
GLRRSEVLGL RWSDVDLDGG TVTIAQGRVV AEGQGTITGD PKSKRSRRAL PMPAEVLAAL
RVFRLRQSEE RLAIGSEYPD TGLVAVNVIG LPIRPETYSG EFMRHAKDAG VPLIRLHDVR
HTAATMLLDR GTTPSATAKW LGHDPAITLR VYGHVYDGAL AAAGDTLLRG H