Gene Mvan_5056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5056 
Symbol 
ID4644793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5411709 
End bp5413358 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content67% 
IMG OID639808526 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_955833 
Protein GI120406004 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGACG GCCCACTGAT CGTCCAGTCC GACAAGACCG TGCTGCTCGA AGTCGACCAC 
GAGCAGGCCG GGGCCGCCCG CGCGGCCATC GCGCCGTTCG CCGAGCTCGA ACGCGCGCCT
GAGCACGTCC ACACCTACCG CATCACGCCG CTGGCGCTGT GGAACGCCCG GGCCGCCGGA
CACGACGCGG AGCAGGTCGT CGACGCGCTG GTGACGTTCT CCCGCTATGC CGTACCGCAG
CCTCTTCTGG TCGACATCGT CGACACGATG GCCCGGTACG GGCGGCTGCA ACTCGTCAAG
CATCCGGCGC ACGGTCTCAC GCTGGTCAGT TTCGACCGTG CCGTGCTCGA AGAGGTGCTG
CGGAACAAGA AGATCGCGCC CATGCTCGGC GCCCGCCTCG ACGACGACAC CGTCATCGTG
CACAACAGCG AACGTGGCCG CGTCAAGCAG ATGCTGCTCA AGATCGGCTG GCCGGCCGAG
GACCTGGCCG GGTACGTCGA TGGCGAAGCG CACCCGATCG AACTCGCACA GGACGGTTGG
CAGCTGCGCG ACTACCAACA GATGGCCGCC GACTCCTTCT GGGACGGCGG GTCTGGCGTG
GTGGTGCTGC CCTGCGGGGC GGGCAAAACA CTGGTGGGGG CGGCCGCCAT GGCCAAGGCC
GGCGCGACGA CCCTGATCCT CGTGACCAAC ACCGTCGCGG GCCGGCAGTG GAAGCGCGAA
CTGGTGGCGC GGACGTCGTT GACCGAGGAG GAGATCGGCG AATACTCGGG CGAGCGCAAG
GAGATCCGGC CGGTCACCAT CGCGACCTAT CAGGTCATCA CCCGCCGGAC CAAGGGCGAG
TACAAGCACC TGGAGCTCTT CGACAGCCGG GACTGGGGCC TGATCATCTA CGACGAGGTG
CACTTGCTGC CGGCGCCGGT GTTCCGGATG ACGGCCGACC TGCAGTCGCG CAGGCGGCTC
GGTCTGACGG CAACCCTGAT CCGCGAGGAC GGCCGGGAGG GTGACGTGTT CTCGCTGATC
GGTCCGAAGC GCTACGACGC GCCGTGGAAG GACATCGAGG CCCAGGGCTG GATCGCGCCC
GCCGAATGCG TCGAGGTTCG CGTCACGATG ACCGACAACG AGCGAATGAT GTACGCAACC
GCCGAACCCG ACGAGCGCTA CAAGCTGTGT GCGACCGCGC ATACGAAGAT CGCCGTGGTG
AAGTCCATTC TGGAGCGACA CCCCGATGAA CCGACGCTGG TGATCGGCGC CTACCTCGAC
CAACTCGACG AGCTCGGCAC CGAGTTGAAC GCACCGGTGA TCCAGGGATC GACGAAGAAT
GCCGAACGTG AGGCATTGTT CGACGCCTTC CGCCGCGGCG AGATCCGCAC TCTGGTGGTG
TCCAAGGTCG CGAACTTCTC CATCGACCTT CCCGAAGCGA GTGTGGCCGT ACAGGTTTCA
GGGACGTTCG GTTCCAGACA AGAAGAGGCA CAGCGGTTGG GCCGGTTGTT GCGCCCCAAG
GCCGACGGCG GCGGCGCCGT CTTCTACTCG GTGGTCTCGC GTGACAGCCT CGACGCCGAG
TACGCCGCGC ACAGGCAGCG GTTTCTTGCC GAGCAGGGCT ACGGCTACGT CATCAAGGAC
GCCGACGACC TGCTCGGTCC GGCGATCTGA
 
Protein sequence
MTDGPLIVQS DKTVLLEVDH EQAGAARAAI APFAELERAP EHVHTYRITP LALWNARAAG 
HDAEQVVDAL VTFSRYAVPQ PLLVDIVDTM ARYGRLQLVK HPAHGLTLVS FDRAVLEEVL
RNKKIAPMLG ARLDDDTVIV HNSERGRVKQ MLLKIGWPAE DLAGYVDGEA HPIELAQDGW
QLRDYQQMAA DSFWDGGSGV VVLPCGAGKT LVGAAAMAKA GATTLILVTN TVAGRQWKRE
LVARTSLTEE EIGEYSGERK EIRPVTIATY QVITRRTKGE YKHLELFDSR DWGLIIYDEV
HLLPAPVFRM TADLQSRRRL GLTATLIRED GREGDVFSLI GPKRYDAPWK DIEAQGWIAP
AECVEVRVTM TDNERMMYAT AEPDERYKLC ATAHTKIAVV KSILERHPDE PTLVIGAYLD
QLDELGTELN APVIQGSTKN AEREALFDAF RRGEIRTLVV SKVANFSIDL PEASVAVQVS
GTFGSRQEEA QRLGRLLRPK ADGGGAVFYS VVSRDSLDAE YAAHRQRFLA EQGYGYVIKD
ADDLLGPAI