Gene Mvan_1066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1066 
Symbol 
ID4648451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1120356 
End bp1121816 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content72% 
IMG OID639804567 
Productprotein kinase 
Protein accessionYP_951910 
Protein GI120402081 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACCC CGAAGGGCGG CTCTCGGTTG GGCACCCGGT TCGGGCCCTA CGAGCTGCAG 
TCGGTGATCG GGGTCGGGGG TATGGGTGAG GTGTACCGCG CCTACGACAC CGCGCGCGAG
CGGATGGTGG CGATCAAACT GCTGCGCCCG GAGATGGCCG CCGACCACAG CTTCCAGGAA
CGTTTCCGCC GGGAGTCGCG GGTTGCGGCG CGGCTGCAGG AGCCACACGT CATCCCGGTG
CACGATTTCG GCGAGATCGA CGGCGTGCTC TACATCGACA TGCGCTTGGT CGAGGGCGCC
AGCCTCAAGG ACGTGCTGCG CGCCGAGGGG GCGCTGCCGC CGGCGCGGGC GGTGTCGATC
CTCCGGCAGG TCGCCGCGGC GCTGGACGCC GCGCACGCCA ACGGCCTGGT GCACCGGGAC
ATCAAGCCGG AGAACGTGCT GCTGACCCCG GACGACTTCG CGTATCTGGT CGACTTCGGG
ATCGCGCACG GCGGCGGTGA GGCGTCGGTG ACGTCGACGG GCCTGGTGGT CGGGTCGAGC
GCGTACATGG CGCCGGAGCG GTTCAGCGGG GAGCGTGGCG GTCCGGCCTC GGACGTGTAC
TCGCTGGCAT GCCTGCTGTA CGAGTCGCTG ACGGGTCGGG CGCCGTTCGA GGCGGCCGAC
GTGCGGCAGG TGTGGAGCGC GCACATGTTC GCACCGCCTC CGCGGCCGAG CATCATGCGC
CGCGGCGTCA GCAGGACGTT CGACGACGTC GTCGCGCGCG GCATGGCCAA GCAGCCGCAC
GACCGGTATC CGACGGCGGG GGAACTGGCG CGGGCGGCCT CTGCTGCCGC CGAGGGCGCA
CCTGCCGCGG AGACGGTGGC CCCGGTGACG CCGCCGTCGA CGCGGCAGTT CTCGACGGTG
TACCCGACCC CGCCGCCGAT GCCGCCGATG CCTGCTGTGG CGCCGCCGGC CCGGCGGTTC
AGCCGCGGTC AGGTGGGGCT GGTGGCCGCG ACGGTCGTGA TGTTCACCGC GGCGCTGGTG
CTGGCCGCGG TGCTGGTGTT CAGCGGCGGG GACACCGGGT CGCCGTCGCC TCGGATCGCG
GCACCGCCGT CGTCGTCGTC GGAGGCGCCC TCGACTTCTT CGGCGTCGGC GGTCGAGGGG
GTGTCGGGCA CGGATTCGCA AGGCTTCGTG GGGCATACGG CCCGATGCGA CTCGGGCAGC
ACGCCTGCGG CGCTGATCCG GACCTCGCTG TCGCTGGCGA TCATCTGCGA GACGAGCGAC
GGTGACTACT ACTACCGCGG GGAGCGGCTG CGCGACGGCG CGAACCGGGA GATCCAGGGC
GCACAGCGTT CCGGTGACGG GTTCGTCGTC ACCGGTTCGG ACGGGGCCCG CTACGACGTG
CAGCCGGACC AGTTGACGAT CTCGAGCAAC GGAAGTGTGG ACTCGGCGGA GCCGGCGTTG
GAGTACGGCT CGGCGCAATA G
 
Protein sequence
MSTPKGGSRL GTRFGPYELQ SVIGVGGMGE VYRAYDTARE RMVAIKLLRP EMAADHSFQE 
RFRRESRVAA RLQEPHVIPV HDFGEIDGVL YIDMRLVEGA SLKDVLRAEG ALPPARAVSI
LRQVAAALDA AHANGLVHRD IKPENVLLTP DDFAYLVDFG IAHGGGEASV TSTGLVVGSS
AYMAPERFSG ERGGPASDVY SLACLLYESL TGRAPFEAAD VRQVWSAHMF APPPRPSIMR
RGVSRTFDDV VARGMAKQPH DRYPTAGELA RAASAAAEGA PAAETVAPVT PPSTRQFSTV
YPTPPPMPPM PAVAPPARRF SRGQVGLVAA TVVMFTAALV LAAVLVFSGG DTGSPSPRIA
APPSSSSEAP STSSASAVEG VSGTDSQGFV GHTARCDSGS TPAALIRTSL SLAIICETSD
GDYYYRGERL RDGANREIQG AQRSGDGFVV TGSDGARYDV QPDQLTISSN GSVDSAEPAL
EYGSAQ