Gene Mvan_0415 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_0415 
Symbol 
ID4645893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp452093 
End bp453667 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content72% 
IMG OID639803923 
ProductPPE protein 
Protein accessionYP_951269 
Protein GI120401440 
COG category[N] Cell motility 
COG ID[COG5651] PPE-repeat proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.284434 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTCCC CCGTCTGGAT GGCGCTGCCG CCCGAGGTGC ACTCGGCGCT GCTCAGCAGC 
GGCCCCGGTC CGGGGTCGAT GCTGGCCGCA GCAGCGACAT GGCAGTCGCT GAGCGCCGAG
TACGCCGCTG CCGCAGCAGA ACTCAGCTCG ATTCTGGCCG ACGTGCAGGC CGGGGCGTGG
GAGGGCCCGA GCTCGGAACA GTACGTGGCC GCGCACACGC CCTACCTGGC GTGGCTCGCG
CAGCAGAGCG CGGCCGGCGC CGCCGCGGCG GCGCAGCACG AGGCCGCCGC GGCCGCATAC
TCGACGGCGC TGGCGACGAT GCCGACGCTG CCCGAACTGG CGCTCAACCA CACCACCCAC
GCCGTGCTGG TGGCCACGAA CTTCCTCGGC ATCAACACGA TTCCGATCGC GATGACAGAG
GCCGACTACG TCCGCATGTG GATCCAGGCG GCCACCACCA TGGCGACCTA CCAGGCGGTC
TCGGGCGCTG CACTGGCCGC GACGCCGACC GCGACACCCG CCCCGTTCGT GCTGGCGCCC
GGTGTGGGGG AAGCCGGCAG GGCAGCAGCT GACGTCACCG CCTTCGCCGC GCAGGCGCAG
GCCGCGGAAG CCGGTTCAGC CCTGGATCTT TCCAACATCA TCGCCGACCT GATCCGCGCC
TACGGTGAAC TGCTCAGGTT CCTGTTCGAA CCGATCTTCG ACTTCCTGCG TGACCCGCTC
GGAAACACCA TCAAGCTCAT CACCGACTTC CTGACCAACC CGGCGCAGGC GCTGATCACT
TGGGGCCCGT TCCTGGCCGC CGTTGCCTAC CAAGCAGTTT CGTGGGTGGG CGCCTCGATC
CTGTACCCGT CACTTCTGCT CCTGCCGCTG GTGGCGACCA CGCTCGCGAT CGTGCTCGGG
GTGGGTGCCT ACCTCTTGGA GAACCTGCCG GCGCCCGCCG AAGACGCACC TGCCGAGGAA
CCCGCCGCAT CGTCGCCGGC GCCCACCCGC GCCGACCAGC CGAGTCCCGC AATCGCGGTC
TCGGCGCCCC CACCACCGAG CAGCGCGGCG GCGACGGTGG GCACGGTGGC GACGGGGACG
GCGCCGGCAC CGGGCGCTCC TGCCGCCGCC ACTGCGTCGT TCGTGCCTTA CGCGGTGGCC
GGCCGCGACC CCGGAACGGG CTTCTCGCCG ACCGTGCGTG ATTCGACCAG CGCCAAGGCG
CCGGCCTCCG GCATCCCGGC GGCGGCGTCG GGCGTCGCGG CCTCGGCGGC GGAGCGGCGT
AAGCGCAGGC GCCGCCAGAA GGACGAGATC GCCGGTCGCG CCTACGCCGA CGCGTACGCC
GACTACGAGC CAGAGCCCGA CGACGAACCA CCGGTGCGGC AGGAGCCGCG GATCGCCGCC
ACCGAGCGCG GTGCCGGCCC CATGGGCTTC GCCGGCACGG TGTCCAGGGA CGCCGCACAG
GCCGGCGGGT TGACCACGCT GCCGGGCGAC CCGTTCGGCG GCGGACCGAA AGCGCCGATG
CTGCCGGGGA CCTGGGACCC GGACACCGAA CCCGACGAAC GCCACAACCA CCACGATGGA
AAGGACTCTC AATGA
 
Protein sequence
MSSPVWMALP PEVHSALLSS GPGPGSMLAA AATWQSLSAE YAAAAAELSS ILADVQAGAW 
EGPSSEQYVA AHTPYLAWLA QQSAAGAAAA AQHEAAAAAY STALATMPTL PELALNHTTH
AVLVATNFLG INTIPIAMTE ADYVRMWIQA ATTMATYQAV SGAALAATPT ATPAPFVLAP
GVGEAGRAAA DVTAFAAQAQ AAEAGSALDL SNIIADLIRA YGELLRFLFE PIFDFLRDPL
GNTIKLITDF LTNPAQALIT WGPFLAAVAY QAVSWVGASI LYPSLLLLPL VATTLAIVLG
VGAYLLENLP APAEDAPAEE PAASSPAPTR ADQPSPAIAV SAPPPPSSAA ATVGTVATGT
APAPGAPAAA TASFVPYAVA GRDPGTGFSP TVRDSTSAKA PASGIPAAAS GVAASAAERR
KRRRRQKDEI AGRAYADAYA DYEPEPDDEP PVRQEPRIAA TERGAGPMGF AGTVSRDAAQ
AGGLTTLPGD PFGGGPKAPM LPGTWDPDTE PDERHNHHDG KDSQ