Gene Mvan_2154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_2154 
Symbol 
ID4649131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp2298830 
End bp2300485 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content72% 
IMG OID639805639 
ProductDak phosphatase 
Protein accessionYP_952975 
Protein GI120403146 
COG category[R] General function prediction only 
COG ID[COG1461] Predicted kinase related to dihydroxyacetone kinase 
TIGRFAM ID[TIGR03599] DAK2 domain fusion protein YloV 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGCTC GGCGCCTCGA TGCATCCGCC CTGCGGCACT GGGCCCATGC GGCCGTCGCT 
GACCTGATCA GCCACACCGA CGAGATCAAC CAGCTCAACG TCTTCCCGGT CGCGGACGCC
GACACCGGGA CGAACATGCT GTTCACGATG CGTGCCGCCT GGGCGGAGGC CGACAGTTGC
GACGCCGGCG ACGACGTCAC CGCGGTCGCC GCCGCCCTGG CCGGCGGCGC GTTGCACGGT
GCGCGCGGCA ACTCCGGCGT GATCCTGTCG CAGATTCTGC GGGCGGTCGC CGAGGTCACC
GCCTCCGCCG CCGAGGACCG CGACGGAGTG CTGGCCGACA TCACCGGTCC ACTTCTGAGC
TCCGCGCTGC GGCACGCCGT CACCCTGGTC GTCACATCCA TGGGCGAGGC CGTCCCGGGC
ACCATCGTCT CGGTCCTGCA CGATGCGGCC GAGGCGGCCG AGGAGGCGGT GTCGGCGGAG
GCCGAGCTGG CCGACGTCGT CGCCGCATGT GCGCACGCCG CTGGGGCCGC GCTCGACCGC
ACGCCGACCC AGCTCGACGT GTTGGCCGAC GCCGGAGTCG TCGACGCCGG GGGCCGGGGC
CTGGTGGTCC TGCTGGACGC GCTGTGCGCC ACCCTGACCG GCCACGCGCC CCGGCGTGGG
GTCTATCAGC CGTCCCCGAA CAAGTCGTCG GCCCCGTCGT CGGCGGCACA CGCGACCCCG
AGCTTCGAGG TGATGTACCT GCTCGCCGAC TGCCACGCCG AGGACATCGA AAAATTGCGC
GCAGCGCTCG AGGAACTCGG CGACTCGATC GCCATCGCGG CGTCGGGCTT CGGCGAGGCC
GGGGGAGCCG GCCCTGGCCA CTACTCGGTG CACGTGCACG CGGACGACGC AGGTGCCGCC
GTCGAGGCCG GGCTGGCGGT CGGCAGCCTG AGCCGCATCC AGATCACGTC CCTGAGCGGC
GGCGGCGACC GCCACCCGGC CGGAGGCTGG AGCAGACAGC GTGCGGTGCT CGCCGTCGTC
GACGGTGACG GCGCGGAAAG CCTGTTCACC GGTGAGGGCG CCGAGATTCT GCGGCCGCAG
TCCGATGTGC CGGTCAGCGC CCAGCAGCTG CTGCACGCCC TGGTCAACAC CGGTGCCGCG
CAGGTGATGG TGCTGCCGAA CGGGTACGTG GCCGCCGAGG AGATCGTCGC CGGATGCGCC
GTGGCCTCCG ACTGGGGGAT CGACATGGTG CCCGTGCCGA CCGGGTCGAT GGTGCAGGGG
CTGGCCGCAC TGGCCGTCCA CGATCCCGAG CGTCGCCTGG TGGACGACGG CTACACCATG
GCCAGGGCCG CGGCGGGTGC CCGGCACGGG ACGGTGCGGG TGGCCGCCGA GGAAGCGCTG
ACCTGGGCCG GTGCCTGCAA ACCTGGCGAC GGACTCGGCA TCGCCGGCGA CGAGGTCGTC
ATCGTCGGCG CCGACGTCGT GGCTGCCGGT GCGGGACTGA TCGACCTGAT GTTGGCGGCG
GGTGGTGAGT TGGTCACCGT TCTCACCGGT GACGGCGTGG GACCCGAGAT CGGGGAGGCG
CTCGCCGAGC ACGTCCACCA CCACCACCCC GGTACCGACC TGGTCACCTT CCACACCGGG
CACCGTGTCG ACGCGCTGGT CATCGGGGTT GAGTGA
 
Protein sequence
MSARRLDASA LRHWAHAAVA DLISHTDEIN QLNVFPVADA DTGTNMLFTM RAAWAEADSC 
DAGDDVTAVA AALAGGALHG ARGNSGVILS QILRAVAEVT ASAAEDRDGV LADITGPLLS
SALRHAVTLV VTSMGEAVPG TIVSVLHDAA EAAEEAVSAE AELADVVAAC AHAAGAALDR
TPTQLDVLAD AGVVDAGGRG LVVLLDALCA TLTGHAPRRG VYQPSPNKSS APSSAAHATP
SFEVMYLLAD CHAEDIEKLR AALEELGDSI AIAASGFGEA GGAGPGHYSV HVHADDAGAA
VEAGLAVGSL SRIQITSLSG GGDRHPAGGW SRQRAVLAVV DGDGAESLFT GEGAEILRPQ
SDVPVSAQQL LHALVNTGAA QVMVLPNGYV AAEEIVAGCA VASDWGIDMV PVPTGSMVQG
LAALAVHDPE RRLVDDGYTM ARAAAGARHG TVRVAAEEAL TWAGACKPGD GLGIAGDEVV
IVGADVVAAG AGLIDLMLAA GGELVTVLTG DGVGPEIGEA LAEHVHHHHP GTDLVTFHTG
HRVDALVIGV E