Gene Mvan_1531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1531 
Symbol 
ID4648264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1621712 
End bp1622911 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content65% 
IMG OID639805028 
Productintegrase catalytic subunit 
Protein accessionYP_952368 
Protein GI120402539 
COG category[L] Replication, recombination and repair 
COG ID[COG2826] Transposase and inactivated derivatives, IS30 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.205612 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.437676 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTCGG GAGTGCCGGT GCCGTTGTCG GTGAAGCGGG TGTTTTTCGA TTTGGTCTGT 
GGCGGGATGA CCACCACGGA AGCGTCGGTG CAGGTGGGTG TGTCGCGCCG GACCGGATGG
TCGTGGTGGC GTCACGCTAG GGGCATGAAA CTGCGCAAAG GTGGCGACGG ACTGGGTGGA
TTGGCGCAGG TCGGTGATCT GACCCGTACC GGCGGGCGTG GGCATCGGCT CAGCTTCACC
GAACGCTATG AGATCGACCG CGCCTTACAG GCCGGGCTCA GCTACGCCGC GATCGGCGAT
CAACTGGGGC GCGACCGCTC GGTGATCTGG CGTGAAGTGC AGCGAAACCG GCTGCCCGAT
GGCACGTATC ACGCGTTGAT GGCCCATGCC CGGGCCACCG AGAACGCCCG CCGGCCCAAA
GCGTTCAAAC TCGACGACCC TGGGCTGTGC GCGGACATCG AGGCGTGGAT GGACCAGGGC
TGGAGCCCGA AACTGATCTC CCAGGTATTG GCCCGCGACG CCGGCAGTGC CAAGGTCAAG
CGGGTGAGCC ACGAAACCAT CTACCAAAGC CTGTATGTCC AAACCCGTGG CCAGCTACGC
GCTGATCTGC ACAAATGTCT GTCCACCTCA CGAACTCAGC GCAAGCCGCG CGGTCAGACT
GAACGACGCG GCAGGTTCGG CGACGTGATG CGCATCAGCG CGCGGCCTGC CGAGGCCGCC
GACCGGGCGG TGCCCGGGCA TTGGGAAGGG GATCTGATCA TCGGCGCCCG TGGTGGCAGC
GCGATCGGGA CTTTGGTTGA ACGCAGCACC CGGTTCACCA TCCTGCTGCA TCTACCTGGC
GATCACACCG CCGAGACCGT GGCTGCGGCG ATGCTCAAAG CCATGGGCGA TTTGCCCGAC
CATTTACGCC GTTCGATCAC CTGGGACAGG GGCAGCGAAA TGGCCGGATG GCAAGACATT
TCGCTGCAGC TGCAATCACC GGTCTACTTC TGTGATCCGC ACTCGCCGTG GCAGCGCGGC
ACCAACGAAA ACACCAACCG GCTCCTGCGG TTCTGGTTCG AGAAAGGCAC CGACCTCAGC
GGCTACACCC CCGACGACCT CAAAGCCGTC GCCGACAAAC TCAACACCCG ACCCCGACCC
ACCCTCGATC TGGACACACC CGCCCAACGC ATGGCCCAAC TCCTTAGCCA AGCCGCCTAA
 
Protein sequence
MPSGVPVPLS VKRVFFDLVC GGMTTTEASV QVGVSRRTGW SWWRHARGMK LRKGGDGLGG 
LAQVGDLTRT GGRGHRLSFT ERYEIDRALQ AGLSYAAIGD QLGRDRSVIW REVQRNRLPD
GTYHALMAHA RATENARRPK AFKLDDPGLC ADIEAWMDQG WSPKLISQVL ARDAGSAKVK
RVSHETIYQS LYVQTRGQLR ADLHKCLSTS RTQRKPRGQT ERRGRFGDVM RISARPAEAA
DRAVPGHWEG DLIIGARGGS AIGTLVERST RFTILLHLPG DHTAETVAAA MLKAMGDLPD
HLRRSITWDR GSEMAGWQDI SLQLQSPVYF CDPHSPWQRG TNENTNRLLR FWFEKGTDLS
GYTPDDLKAV ADKLNTRPRP TLDLDTPAQR MAQLLSQAA