Gene Mvan_6044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_6044 
Symbol 
ID4644020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp6453473 
End bp6455548 
Gene Length2076 bp 
Protein Length691 aa 
Translation table11 
GC content68% 
IMG OID639809509 
Producthypothetical protein 
Protein accessionYP_956803 
Protein GI120406974 
COG category[R] General function prediction only 
COG ID[COG3211] Predicted phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.173775 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCTCG TGCCGCTGAA TCTGCTCGTC ACCCACAACG GAAAGTCCAA GCGCCAGCAT 
GTGACCTGCG TGCACAAGTG CGCCGACGCA TGCTCGAAGC CGGTGCCGAA CAAGACCGAC
AACGAGTACT TCGGCGACAT CGCCAAGGCC GTCTCCCGAC GCTCGCTCCT GCACGCCGGC
GGGGTGGCCG TCCTCGCCGT CGGAGCGGGC TCCGCGCTGG CCGCCTGCTC GAACACCACC
GAGCCGGCCC CGACGTCGTC GTCGCCGACC GCGGCCGCCA CCGAACCGCC GGCAGGGATG
AACTTCGCGT CCGTCGCGCC CAACAGCGAG GACGCCGTCG TCGTCGCCGA CGGCTACCAG
CAGGCCGTCG TGATCAGCTG GGGCGACCCG GTGCTGCCCG ACGCACCGAA GTTCGACGTC
AACAACCAGA CCGGCGCCGC GCAGCGCGGC CAGTTCGGCT TCAACAACGA CTTCGCCGGA
CTGCTGCCCA TCGACGGACA GCCCGGCCGC TTCCTGCTCG TCACCAACTT CGAGTACGCG
ACGCCGCAGT TCATGTTCCC CGGCTATGAC GCCGAGGCCC CGACCCGCGA CCAGTTCGAC
GTCGAGATCG CCTCGATGGG GATGGGTGTG GTCGAGGTCG AGCGCACCCC CGACGGCGGG
CTCCGCCCGG TCATGGGCCG CTACAACCGG CGCATCACCG CCGACAGCCC CTTCGCGATC
ACCGGACCCG CGGCGGGGAC CGACTTCGTC AAGACCCAGG CCGACCCCGA GGGTCGCACC
GTGCTGGGTA CCATCGCCAA CTGCGCGGGA GGTGTAACAC CCTGGGGCAC AGTTCTTTCC
GGCGAGGAGA ACTTCCACGG CTATTTCGGC GCCCCGGAAG GATCCCCCGC CCCCAACCCC
GTCGACGCGG ACCGCCACGA CCGCTACGGG GTCTCCCTGG AGCCCTCCGA ATTGCGCTGG
GAGACTTTCG ATCCGCGCTT CGACCTGGCC AAGACGCCCA ACGAGGTCAA CCGGTTCGGC
TACATCGTGG AGCTCAACCC GTGGGACCCC ACCTCGACAC CGGTCAAGCA CTCCGCACTG
GGCCGGTTCA AGCACGAGGG CGCCAACATC CACGTCACCG ATGACGGCAC CGTGGTCGCC
TACACCGGTG ACGACGAACG CTTCGACTAC ATGTACAAGT TCGTGTCCAG TCGCAAGGTG
CAGCCCGGTA AGGATCCCGC GGCGATGGCC AACAACATGG CGATCCTCGA CGAGGGCACC
CTCTACGTCG CCAAGCTCTC CAGCGACATC CCCGCCAACG AGATCGACGG GTCCGGCAAG
CTGCCCACCG CAGGGGCTTT CCGCGGCACC GGCACCTGGC TTCCGCTGCT GCGCTCGGGC
CCGAACGGAC GGGCCGAATC CCTCGTCGAC GGCATCACCG CGCAGGAGGT CGCCGTGTTC
ACCCGCATGG CCGCCGACAA GGTGGGCGCC ACCAAGATGG ACCGGCCCGA GGATTTCGAG
GCGAACCCCG CGACCGGGAA GGTCTACGTC GCGCTGACCA ACAACTCCAA GCGCGGCGCC
GAGGGCGAAG CCGCCGCAGA TGCCTCAAAC CCCCGCAACG ACAACAAGAG CGGCCAGATC
CTGGAGATCA CCGACAACCA CGCCGGCACC GACTTCACCT GGGATCTGCT GCTGGTCTGC
GGAGACCCGC AGGCCGCCGA CACCTACTAC GGCGGTTTCG ACAAGACCAA GGTGAGCCCG
ATCTCCTGCC CGGACAACCT CGCCTTCGAC AGCCACGGCA ATCTGTGGAT CTCGACCGAC
GGCAACGCGC TCGACTCCAA CGACGGCCTG TTCGCGGTGG CACTCGACGG ACCCAACCGC
GGTGAGACGA AGCAGTTCCT GACCGTGCCG CTGGGGGCGG AGACGTGCGG ACCGGTGGTC
ACCGACGATC TGGTGACGGT GTGCGTGCAG CATCCGGGCG AGAACGACGA GAACAGCATC
GACAGCCCGC AGTCCCGGTG GCCCGAAGGC GGCAACGGCA CGGCGCGGCC GTCGGTCGTG
GCGGTGTGGA AGAACGGCGG CCAGATCGGC GTCTAG
 
Protein sequence
MALVPLNLLV THNGKSKRQH VTCVHKCADA CSKPVPNKTD NEYFGDIAKA VSRRSLLHAG 
GVAVLAVGAG SALAACSNTT EPAPTSSSPT AAATEPPAGM NFASVAPNSE DAVVVADGYQ
QAVVISWGDP VLPDAPKFDV NNQTGAAQRG QFGFNNDFAG LLPIDGQPGR FLLVTNFEYA
TPQFMFPGYD AEAPTRDQFD VEIASMGMGV VEVERTPDGG LRPVMGRYNR RITADSPFAI
TGPAAGTDFV KTQADPEGRT VLGTIANCAG GVTPWGTVLS GEENFHGYFG APEGSPAPNP
VDADRHDRYG VSLEPSELRW ETFDPRFDLA KTPNEVNRFG YIVELNPWDP TSTPVKHSAL
GRFKHEGANI HVTDDGTVVA YTGDDERFDY MYKFVSSRKV QPGKDPAAMA NNMAILDEGT
LYVAKLSSDI PANEIDGSGK LPTAGAFRGT GTWLPLLRSG PNGRAESLVD GITAQEVAVF
TRMAADKVGA TKMDRPEDFE ANPATGKVYV ALTNNSKRGA EGEAAADASN PRNDNKSGQI
LEITDNHAGT DFTWDLLLVC GDPQAADTYY GGFDKTKVSP ISCPDNLAFD SHGNLWISTD
GNALDSNDGL FAVALDGPNR GETKQFLTVP LGAETCGPVV TDDLVTVCVQ HPGENDENSI
DSPQSRWPEG GNGTARPSVV AVWKNGGQIG V