Gene Mvan_1067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1067 
Symbol 
ID4648452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1122005 
End bp1124332 
Gene Length2328 bp 
Protein Length775 aa 
Translation table11 
GC content68% 
IMG OID639804568 
Productpeptidase S15 
Protein accessionYP_951911 
Protein GI120402082 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCTCC GGAGGGTCGC TAGGCACGGT AAGTACATAG CCGTGACTGC GGGGACATAT 
GTGGGTCGAG TCGGTGGGTT GGCGGTAGCT CTCGGTATCG GAGCGGCGAT TCTGGCAGGC
ACAGGTGTCG CCTTGGCGGA CGACGGCGCT GCGTCGGACA CGGCGACCAG CACGTCGCAG
ACGTCGGAGG CGGCACCGGC GAAGGACATG CCCGACGACA GCACCTCGGA GGCCGACACT
GCCGACGGGG CCGAACCGAA CGGTGACGTC GAGGACGAGT CGGTCGCAGA GGCGGACCCC
GACTCGGCCG AACCGGAGGA AGACGCCGAG GACGAGGTCG TCGAACCCGA GGCCGCCGAC
GAGTCGGCGC CTGTCGTCGA ACCCGAGCTC GCCGAAGATC CGACGCCCGT CGTCGAAGGG
CCCGTCGACG CCGAGGCCGC CGAGGAAGAC GACGAACCCG AGTTCGTCTC GACTCCGATC
TCGAACGTCG GCCACACGGA CTTCGAGGCC GATAAGGACG AAGAGCCCGC CGCCCCCGCC
GAGTCCGCGC TGGCACTGTC CGTGCTGGCC ACCGCCCGCG AAAAGGCCGA TGAGCCAACG
ATCGACACCG TCGGCGCCCA GGCGGCCCTC AGCCTCGTCG ACGACGAGTA CCCGATCCCG
ACCGACGTCG AGGTCACCGA GCTCAAGCCG GCGTTCGAGT GGCTGCAGCA GATCCCGGTA
CTCGGGAGGT TCGTGGTGAC GCCGATCGTG CACCTGATCC ACGCGATCCC GTTCGTCTCC
GAGATCCTGC ATCCGCTGAT CGGCTTCCCG ATCGACCACG ACGCCCCGCC CGGGACGCCG
CGGGCACGCA CCGTGCGGCT GAAGTCGTTC GACGGCACCG AGATCTATGT GAACTTCATG
CCGGCCAAGG GATTGCAGGC CGGTGAGTCG GCGCCGACGG TGCTGTCCGG TCCGGGTGTC
GGACTGCCGG GGTCGACGAC CCTGGGCCTG GACATCGACA GCTTCCTGCC GCACGACGTG
GTCGGCATCG GCATGCTGCG CAAGGCGGGC TACAACGTCG TCACCTGGGA TCCGCGCGGG
GAGTGGCATT CGGGTGGCCG GATGCAGCTG CAATCACCGG ATTTCGAGGG CCGCGACATC
TCGCACATCA TCAGCTGGCT GTCCACCCTC GGCGAGGTCG ACAGCGTCGA CGGCGATCCC
AAGGTCGGGA TGGTGGGTGT GTCCTACGGC GGCGGCATCC AATTGTCCGC GGCCTCAGTC
GATCACCGGA TCGACGCGAT CGTGCCGACC ATCGCGTGGA ACAGCCTGGT GGACGCGATC
TTCCCGCGTC AGGCGGTCAG CAGCGTCTGG GGCACGTTCC TGAGCGCCCT GCTGGTGGGC
GTGGGGGCGC GTCCCAACGA GCGGGTCCTG CCTGCGGTGA TCGAGGCGGT GCTCACCGGT
GAGGCGGCGC AGTCCGACAT CGACCTGTTC AACAGCCTCA ACTTCGCCGA CCAACTCGCC
AACATCACGG CGCCGACGTT GTTGATCCAG GGCACGGTGG ACACGCTGAC CACGCTGGCA
CAGGCCGACG CCAACGCCAA GGCGTTGATC GCCGCGGGCA CCACGACGAA GGTGGTGTGG
TTCTGTGGCG GCCACGGTGC GTGCCTGAGC AGTTTCAACG ACGGCGAGGT GGTGTGGCGG
GAGACGATGG AATGGCTGGA CCGCCACGTC AAGGGTGACG AATCCATCGA TCCCGGACCG
CAATTCGAGT GGGTGGATCA GCACGGCGAC TGGTATTCCT CGGAGGTCTA CCCGGTGGCT
TCCGGCGAGT CGGTCACCGC GACGCTGGCC AACGGCGGGA AGACGCTGCC GTTCGTGCCG
TTCATCGGCG GGTCGGGACC CAACCCGGCG ATCCTGACGC GGGGTGTGAT CCGCGCGGTG
ATGGGGTTGC CGTCGGGGGC GCCGGCACTG AACGCGGTGA ACCTGCGCGT GCCGGACGCG
ACGGAGCCGA CGCATCTGCT GGGGGCGCCG CAGCTGACGC TGACTTACTC GGGCACCGGT
AACGCCAAGC ATGTGTACGC GCAGCTGGTC GACGACGAGA CCGGGCTGGT GCTGGGCAAC
CAGGTGACCC CGATTCCTGT TGTGCTGGAT GGGGAGTCGC ACACGGTGAC GTTCTCGATG
GAGCAGATCG CGCACACGCT CAAACCGGGG GAGTCGGTGA CGCTGCAGGT GCTCACGTCG
TCGTTCAGCT TCCTGAACTT CTACTCGTAC GGGGCGATCA CAGTCGAGGG CATGTCGGTG
AAGCTGCCGA CGATGGCCGC GGCGCAGGTG GTCGCGGTCG CGGCGTGA
 
Protein sequence
MLLRRVARHG KYIAVTAGTY VGRVGGLAVA LGIGAAILAG TGVALADDGA ASDTATSTSQ 
TSEAAPAKDM PDDSTSEADT ADGAEPNGDV EDESVAEADP DSAEPEEDAE DEVVEPEAAD
ESAPVVEPEL AEDPTPVVEG PVDAEAAEED DEPEFVSTPI SNVGHTDFEA DKDEEPAAPA
ESALALSVLA TAREKADEPT IDTVGAQAAL SLVDDEYPIP TDVEVTELKP AFEWLQQIPV
LGRFVVTPIV HLIHAIPFVS EILHPLIGFP IDHDAPPGTP RARTVRLKSF DGTEIYVNFM
PAKGLQAGES APTVLSGPGV GLPGSTTLGL DIDSFLPHDV VGIGMLRKAG YNVVTWDPRG
EWHSGGRMQL QSPDFEGRDI SHIISWLSTL GEVDSVDGDP KVGMVGVSYG GGIQLSAASV
DHRIDAIVPT IAWNSLVDAI FPRQAVSSVW GTFLSALLVG VGARPNERVL PAVIEAVLTG
EAAQSDIDLF NSLNFADQLA NITAPTLLIQ GTVDTLTTLA QADANAKALI AAGTTTKVVW
FCGGHGACLS SFNDGEVVWR ETMEWLDRHV KGDESIDPGP QFEWVDQHGD WYSSEVYPVA
SGESVTATLA NGGKTLPFVP FIGGSGPNPA ILTRGVIRAV MGLPSGAPAL NAVNLRVPDA
TEPTHLLGAP QLTLTYSGTG NAKHVYAQLV DDETGLVLGN QVTPIPVVLD GESHTVTFSM
EQIAHTLKPG ESVTLQVLTS SFSFLNFYSY GAITVEGMSV KLPTMAAAQV VAVAA