Gene Mvan_3815 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3815 
Symbol 
ID4645955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4065594 
End bp4066868 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content71% 
IMG OID639807281 
Productdeoxyguanosinetriphosphate triphosphohydrolase-like protein 
Protein accessionYP_954602 
Protein GI120404773 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.548938 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.293858 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCCAC GACTGCAGGA CAGCTACGAC GAGTTCGACC GCCAGCGCCT GGTGGCCGAA 
CCGGCGAAGA GCGCCGGCCT GCCCGGGACC GACACCGAGC ACCGCTCGGA CTTCGCACGC
GACCGGGCCC GCGTCCTGCA CTGTGCCGCG CTGCGCCGGC TCGCCGACAA AACCCAGGTG
GTGGGCCCCC GGGACGGTGA GACGCCGCGC ACCCGATTGA CGCATTCGCT GGAGGTCGCC
CAGATAGGCC GTGGGATGGC GATCGGCCTG GGGTGCGACC CGGACCTCGT CGATCTGGCG
GGCCTGGCCC ACGACATCGG TCACCCGCCC TACGGTCACA ACGGGGAACG CGCCCTCGAC
GAGATCATCA AGGGCTTCGG CGGTTTCGAG GGCAACGCCC AGAACTTCCG CATCCTGACC
CGCCTTGAGC CCAAGGTGCT CGACGAGCAC GGGCGCAGCG CCGGCCTGAA CCTGACCAGG
GCGTCGCTCG ACGCGGTGGC GAAGTATCCG TGGCCGCGTC AGGAGGGCCG GCGGAAGTTC
GGGTTCTACG GCGACGACAT GGCTGCGGCG CAGTGGGTGC GTCACGGCGC ACCCGCCGCC
CGGCCGTGCC TGGAGGCACA GGTGATGGAC TGGGCCGACG ACGTGGCGTA CTCGGTGCAC
GATGTCGAGG ACGGCGTCAT CTCCGGCCGT ATCGACCTGC GTGTGCTGGC CGACGCCGAT
GCGGCGGCCT CCCTCGCCCA CGTGGGCGCC CAGTCGTTCC CGACGCTGAC CCCCGACGAT
CTGGTTGCGG CCGCCGAGCG GCTCTCCCAG GTTCCTGTGG TGGCGGCGGT GGGCAAGTTC
GACGGCACCC TGTCCGCATC GGTGGCCCTG AAAACGTTGA CCAGCGAGCT GGTCGGGCGG
TTCGCCAACG CCGCCCTCAC CGCGACCCGC GACGTCGCCG GACCGGGGCC GTTGCGTCGA
TTCGACGCCG AGTTGACGGT GCCGAGCCTG GTGCGTGCCG AGGTGGTGCT GCTCAAGACC
CTTGCGCTGC AGTTCATCAT GTCCGATCAC CGGCACCTGC AGATCCAGGC CGACCAGCGC
AACCGGATCC ACGAGGTGGC GCTGGCGCTG TGGGGCCAGG CGCCGGGGAG CTTGGACCCC
CAGTTCGCGG CGGAGTTCGC CGCGGCCCCC GACGACGGCG CGCGCCTGCG GGTGGTGATC
GACCAGATCG CCTCTTACAC CGAGAGCCGA CTGGAGCGAG TGCACGAGGC GCGCTCGCCC
CGGCCTCTAG ACTGA
 
Protein sequence
MSPRLQDSYD EFDRQRLVAE PAKSAGLPGT DTEHRSDFAR DRARVLHCAA LRRLADKTQV 
VGPRDGETPR TRLTHSLEVA QIGRGMAIGL GCDPDLVDLA GLAHDIGHPP YGHNGERALD
EIIKGFGGFE GNAQNFRILT RLEPKVLDEH GRSAGLNLTR ASLDAVAKYP WPRQEGRRKF
GFYGDDMAAA QWVRHGAPAA RPCLEAQVMD WADDVAYSVH DVEDGVISGR IDLRVLADAD
AAASLAHVGA QSFPTLTPDD LVAAAERLSQ VPVVAAVGKF DGTLSASVAL KTLTSELVGR
FANAALTATR DVAGPGPLRR FDAELTVPSL VRAEVVLLKT LALQFIMSDH RHLQIQADQR
NRIHEVALAL WGQAPGSLDP QFAAEFAAAP DDGARLRVVI DQIASYTESR LERVHEARSP
RPLD