Gene Mvan_0759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_0759 
Symbol 
ID4646794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp800100 
End bp801800 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content70% 
IMG OID639804259 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_951603 
Protein GI120401774 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.523445 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.362489 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGGCTG ATGCAGCGCC CAGCACCCAG GCGTTGCGGG GCTGGCAACG ACGGGCTCTG 
GTGAAGTACC TGTCGGCCGC TCCTCGTGAT TTCCTCGCGG TGGCCACCCC TGGCGCCGGC
AAGACCACCT TCGCGCTGCG CATCGTCGCC GAACTGCTCG CCGAAGGCAC CGTCGAGGCC
GTCACGATCG TCGTGCCCAC CGAGCACCTG AAGATCCAGT GGGCCCAGGC CGCCGCCCGG
CACGGCATCG CGCTGGACCC GAAATTCTCC AACTCGAACT CGCAGACCTC CTCGGACTAC
CACGGCGTCG TCGTCACCTA CGCGCAGGTG GCCAGCCACC CGACCCGGCA CCGGGTGCGT
ACCGAGAACC GCAAGACGCT CGTCGTCTTC GACGAGATCC ACCACGGCGG CGACGCCAAG
AGCTGGGGCG ACGCCATCCG GGAGGCGTTC GACGACGCGA CCCGCCGGCT CGCGCTGACC
GGGACGCCGT TCCGCAGCGA CGACAGCCCG ATCCCGTTCG TCAACTACGA GACCGGACCC
GACGGCTTCG CCCGTTCCAA GGCCGACCAC ACGTACGGCT ACTCCGACGC GCTCGCCGAC
GGCGTGGTCC GGCCGGTCAT GTTCATGGCG TACTCCGGAG AGGCCCGCTG GCGCGACAGC
GCGGGCGAGG AACACGCCGC CCGCCTCGGC GAGCCGCTGA CCGCCGAGCA GACGGCGCGG
GCGTGGAAGA CCGCGCTGGA CCCGAAGGGC GAGTGGATGC CCGCGGTGAT CGCCGCGGCG
GACAAACGGC TGCAGGGACT GCGTCAGCAC GTGCCCGACG CCGGCGGCAT GATCATCGCC
TCCGACCAGA CGACCGCCCG CGCGTACGCG GACCTGCTGG TGAAGATCAC CGGTGAAGCG
CCGACGGTGG TGCTCTCCGA CGACAAGGGC GCCTCCGACC GGATCTCGGA GTATTCGGCG
GGAACGTCGC GGTGGCTGGT GGCGGTGCGG ATGGTGTCCG AGGGCGTCGA CGTGCCGCGG
CTGGCCGTCG GGGTGTACGC GACGAGTGCG TCCACGCCGT TGTTCTTCGC GCAGGCGATC
GGCCGGTTCG TGCGGTCGCG GCGGCCCGGC GAGACCGCCA GCATCTTCCT GCCGTCGGTG
CCGAATCTGC TGCTGCTGGC CAGTGAGATG GAAGCGCAGC GCAACCATGT GCTGGGCAAG
CCGCACCGCG AACCGCTCGA GGACCCGCTC GACGCCGAAC TGCGTGAGCA GAAGCGCGAC
GAACCGGGCG AGGAGGAGAA CAAGATCGAG TACCTCGGCG CCGACGCCGA ACTCGATCAG
GTGATCTTCG ACGGGTCGTC GTTCGGCACC GCGACGCCGG CGGGCAGCGA CGAGGAGGCC
GACTACCTCG GCATCCCGGG CCTGCTGGAC GCCGACTCGA TGCGAGACCT GTTGCGGCGC
AGGCAGGAAG AGCAACTCAC CAAGCGCACC GAATCAGGCT TGGCGGTCCC GAAGACGACG
CACGGGCAGT TGCGCGATCT GCGCAGCGAA CTCAACACCC TGGTGTCGCT GGCGCATCAC
CGGACCGGCC GTCCGCACGG CTGGATCCAC AACGAGTTGC GCCGCCGCTG CGGTGGCCCG
CCGGTGGCCG CCGCGACCCG CGAACAGCTT CAGGAGCGCA TCGAAGCGGT GCGGGTCCTG
CAACGCGAGT TGTCGGCGTA G
 
Protein sequence
MRADAAPSTQ ALRGWQRRAL VKYLSAAPRD FLAVATPGAG KTTFALRIVA ELLAEGTVEA 
VTIVVPTEHL KIQWAQAAAR HGIALDPKFS NSNSQTSSDY HGVVVTYAQV ASHPTRHRVR
TENRKTLVVF DEIHHGGDAK SWGDAIREAF DDATRRLALT GTPFRSDDSP IPFVNYETGP
DGFARSKADH TYGYSDALAD GVVRPVMFMA YSGEARWRDS AGEEHAARLG EPLTAEQTAR
AWKTALDPKG EWMPAVIAAA DKRLQGLRQH VPDAGGMIIA SDQTTARAYA DLLVKITGEA
PTVVLSDDKG ASDRISEYSA GTSRWLVAVR MVSEGVDVPR LAVGVYATSA STPLFFAQAI
GRFVRSRRPG ETASIFLPSV PNLLLLASEM EAQRNHVLGK PHREPLEDPL DAELREQKRD
EPGEEENKIE YLGADAELDQ VIFDGSSFGT ATPAGSDEEA DYLGIPGLLD ADSMRDLLRR
RQEEQLTKRT ESGLAVPKTT HGQLRDLRSE LNTLVSLAHH RTGRPHGWIH NELRRRCGGP
PVAAATREQL QERIEAVRVL QRELSA