Gene Mvan_1076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1076 
Symbol 
ID4648461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1133098 
End bp1136079 
Gene Length2982 bp 
Protein Length993 aa 
Translation table11 
GC content72% 
IMG OID639804577 
Productprotein kinase 
Protein accessionYP_951920 
Protein GI120402091 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGGCG AGATCGCCTA CGAGCTGGCA GCTGCCGGAT TCGTCGACGC CGTCGAAGTG 
GGCAGGGGCG GTGGCGGCGT CGTCTACCGC TGCCATCAGC AGTCACTCGG GCGCACCGTC
GCCATCAAGG TGCTGGCGTC GTCTCTCGAC GAGGACGACC GCGAACGTTT TCTGCGCGAG
GGCTACGCGA TGGGCGGCCT GTCGGGGCAT CCGAACATCG TCAACATCCT GCAGGTCGGC
ATGACCGAGC AGGACAGGCC GTTCATCGTG ATGCCCTATC ACGCAAGGGG TTCGCTCGCC
GACCAGGTGC GCCGCGAGGG CCGCATCCCG TGGCCCGACG CGCTGCGCAT CGGCGTGAAA
CTGTGCGGGG CGCTGGAAAC CGCGCACCGC ACGGGCACTC TGCACCGCGA CATCAAACCG
GCCAACGTGC TCGTCAACGA CTACGGCGAC CCGCAGCTCA GCGACTTCGG CACCGCACGC
ATCGTCGGCG GATACAAGAC GGTCACCGGG TTCTTCACCG GAACGCTGTC CTACACCGCG
CCCGAGGTGC TCACCGGTAA ACCGCCGACG GTCGAGGCCG ACGTCTACTC GCTCGGCGCC
ACGCTGTACG CGATGATCGC CGGCAAGGCC GCCCACGAAC GCAACACCGA CGAGGAGCTG
ATCGCCCACT ACCTGCGGAT CACCTCGCAG CCGGTGCCGG ACCTGCGCCA CCTCGGCATC
CCGTCCGACG TGTGCTCGGC GATCGAGAAG GCGATGTCGC TCGAGTCCGC CGCCCGGTTC
GACTCCGCCG CCGAATTCGG CCGCGCCTTA CAGGAAGCCC AGCGGCACAA CGGGTTGACC
GCAGACACGA TGGCGATCGG CGAGCCCACC CCTGCTGCCG CGCCACCCGA AGGGACGCAA
GCACTTCCGC TGTCCGTACC GTCTGATTCC GTTGTCCTGC CGCCGATCCC GGCCGCCGCG
CCACCAGCAT CGCCGCCGGC CCCACCGCCG CCGCCACCGC CCGACATGTT CGCCCGCAGC
CCCTCGAACA GCCCACCCAT CACGCCGTGG CCCGTCGCCC CTCCCCCTTA CCCGGCATCC
GTCGCGCCGG CCAAGAACCG CAAGAAGACC TTGATCGCGC TCGCCGCCGC GGCGGCCGCG
GTGCTGCTGG TAATCGGCAC CGTCGTCATC GTGGCGTCGC GGGACAACAG AAGCGGCGAG
GCCGGCGGCG TGACGACCGC GGCGCGGCCG ACCGCCGAAG CGCAGCCCGG GTGGCGGCCG
ATCGCCGACT CCCGCATCCC CCTCGCCGCG GCGGCGGCCA CCGAGGCCGA CGGCACCATC
TGGATCTTCG GCGGCATGGG CGCCGACAAC CGGGTCAGCG GCGCCCACGA GGGGTACGAC
CCGGCAATCG ACAGCTGGAA GGGCGGCGAG GCCCTGCCCG TTCCGGTGCA GCGGGCGATG
TCGGTGACGT GGCAGGACAC CCCCGTCGTG CTAGGCGGCT GGCGTTCCGA GGGCGCCGAC
ACCAAGGTCG CCACCGACCA GGTGTGGCGG GTGGTCAACA GTCGCTGGGT GCAGTTGCCG
CCGCTGCTGC AGCCGCGGGC CGCGGCCGCG GCCGCCGTGG TGGGCGACCG CATCGTCGTC
ACCGGCGGTG TGGACGCCGC CGGAAAGGTG CTCGACACCA CCGAGGTGTA CGACGGCAGC
GGGTGGACGC AGGCCGCGCC GATGCCGACA CCGCGGCAGC TGCTGGCCGC GGCGTCGGAC
GGCGAGCTCG TCTACGCGAT CGGCGGCACG AACGGGACCG CGGACCTGGC GACGGTCGAG
GCGTACGACC CGGCCGCCGA CACCTGGACG GCCATGCCCG CGCTTCCCGA GCCGCGCAGC
GACTTCGGGG TCGCGGTCAC CGACGCCCGA CTGGTGGCGG TCGGCGGCAC CGCCGCGGGG
CGGCCCCTGA AAACGGTTAC CGCGCTTGAC CTGACGACGT CGACGTGGTC CGACCTGCCG
GACCTGGGCA CCGCACGCCG CGGAGCGGCC GTCGCCGCTG TGGGCAAGTC GGTGTACGTG
ATCGGCGGGT CGACCGGCGC CGGCGACGGC CAGGCCACGT CGTCGGCCGA AGCACTGAAA
CTGGCGCCGC GCACGCCGCA ACCCGCGGCG CAATGGCGCT CGCTGCCGGA CGCGCCCACC
GCCCGGCTGA TGATGGCGTA CACCGTGCTT GACGACCAGA TCTGGATCGC CGGCGGAATC
CGGGAGGGCG AGACGCTGGA CACCGTCGAG ACCTACGACA CCCGCACCCA ACAGTGGCAG
TCGCAGCCGT CGCTGCCGAT TCCGCTCAAC CATGCGGTGG CGGCGACCTA CCGCGGCGAG
GTCGTGGTGA TCGGCGGCGC CACCGACACC ATCACGCAGG CCTCAGACAA GGTGTTCGCG
TTCCGCGACG GCACCTGGGT CGAGTTGGCG AGCCTGCAGC ACGCGCGGGC GGCGCCCGCG
GCGGCGGTGG TCGACGACAA GCTCGTCGTG GTCGGCGGGC AGAACGACAA ACAGGTCGTG
CCCCAGACCG AGGTGTTCGA CGGCCAGTCG TGGACACAGG CAGCCGACAT GCCCACTCCC
CGTGAGCATC TCGCGGCGGT GTCCGACGGC GTGTACGTGT ACACGGTCGG CGGCCGGCTC
CTGAGCGCCG ACGAGAACTT GGCGGCCTTC GAGCGCTTCG ACCCGGAGTC CGGGAACTGG
GAGAAGCTGC CGGACATGCC GACTCCGCGC GGCAGCTACG GCGCGGCGTA CCTCGACGGT
CGCATCGTGG TCGTCGGCGG CGAGGAGCCG ACGCGGGTGC TGCCCACCGT CGAGATCTAC
GACATCGCCA ACCGAAAATG GAGCACCCAG GCACCGGTCA ACACGCCCGT GCACGGGCAG
GCGGTCGCGG CGGTCGGCTC CACGGTGTAC TGCATCGGGG GCGCCGACCG GCCGACTCAC
GAAGGGCCGG TGGCCACGGT CGAGGCGCTG GACTTCACGT AG
 
Protein sequence
MAGEIAYELA AAGFVDAVEV GRGGGGVVYR CHQQSLGRTV AIKVLASSLD EDDRERFLRE 
GYAMGGLSGH PNIVNILQVG MTEQDRPFIV MPYHARGSLA DQVRREGRIP WPDALRIGVK
LCGALETAHR TGTLHRDIKP ANVLVNDYGD PQLSDFGTAR IVGGYKTVTG FFTGTLSYTA
PEVLTGKPPT VEADVYSLGA TLYAMIAGKA AHERNTDEEL IAHYLRITSQ PVPDLRHLGI
PSDVCSAIEK AMSLESAARF DSAAEFGRAL QEAQRHNGLT ADTMAIGEPT PAAAPPEGTQ
ALPLSVPSDS VVLPPIPAAA PPASPPAPPP PPPPDMFARS PSNSPPITPW PVAPPPYPAS
VAPAKNRKKT LIALAAAAAA VLLVIGTVVI VASRDNRSGE AGGVTTAARP TAEAQPGWRP
IADSRIPLAA AAATEADGTI WIFGGMGADN RVSGAHEGYD PAIDSWKGGE ALPVPVQRAM
SVTWQDTPVV LGGWRSEGAD TKVATDQVWR VVNSRWVQLP PLLQPRAAAA AAVVGDRIVV
TGGVDAAGKV LDTTEVYDGS GWTQAAPMPT PRQLLAAASD GELVYAIGGT NGTADLATVE
AYDPAADTWT AMPALPEPRS DFGVAVTDAR LVAVGGTAAG RPLKTVTALD LTTSTWSDLP
DLGTARRGAA VAAVGKSVYV IGGSTGAGDG QATSSAEALK LAPRTPQPAA QWRSLPDAPT
ARLMMAYTVL DDQIWIAGGI REGETLDTVE TYDTRTQQWQ SQPSLPIPLN HAVAATYRGE
VVVIGGATDT ITQASDKVFA FRDGTWVELA SLQHARAAPA AAVVDDKLVV VGGQNDKQVV
PQTEVFDGQS WTQAADMPTP REHLAAVSDG VYVYTVGGRL LSADENLAAF ERFDPESGNW
EKLPDMPTPR GSYGAAYLDG RIVVVGGEEP TRVLPTVEIY DIANRKWSTQ APVNTPVHGQ
AVAAVGSTVY CIGGADRPTH EGPVATVEAL DFT