Gene Mvan_3437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3437 
Symbol 
ID4646251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3653256 
End bp3654386 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content71% 
IMG OID639806913 
Productpeptidase M24 
Protein accessionYP_954238 
Protein GI120404409 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGG ACCGTTTCGA CACCGAGGTG TACGCCCGCC GACTGCGAGC CGCCGCCGCG 
GCGGCCGCCG ACGCCGGGCT GGCCGGTCTG GTCATCACGC CCGGCTACGA CCTTCGCTAT
CTGGTGGGCT CGCGCGCGCA GACGTTCGAG CGGCTGACCG CGCTGGTGCT GCCGGCGACG
GGAGACCCGA CCGTCGTGGT GCCCCGCCTC GAGCTGGCCT CGCTCAAGGA CTCGGCGGTC
ACCGAACTCG GTGTGGCCGT GCGAGATTGG GTGGACGGCG ACGACCCCTA CCGCCTGGTC
GTCGAGGCGT TGCCCGGAGA CGGGCCGCTC GCCGTGGCCG TGACCGACTC GATGCCCGCG
CTGCATCTGC TGCCGCTGGC CGAGGTCCTC GGCGCGGTGC CGGTGCTCGC CACCGATGTG
CTGCGCAACC TGCGGATGGT CAAGGACCCC GCCGAGATCG ACGCACTGCG CAAAGCGGGC
GCGGCGATCG ACCGTGTGCA CGAACGTGTT CCGCAGTTCC TTCGGCCCGG GCGCACCGAG
GCCGACGTCG CTGCTGACAT CGCCGAAGCC ATTGTCGCCG AGGGCCATTC GGAGGTCGCG
TTCATCATCG TCGGGTCGGG GCCGCACGGG GCCGATCCCC ACCATGAATG TTCGGACCGC
GAGCTGCGCG CAGGCGACAT CGTCGTCGTC GACATCGGCG GCCCGTATGA CCCCGGCTAC
AACTCCGACT CGACACGCAC GTACAGCATC GGCGAACCCG ATCCGGAGGT GGCGCGCCGC
TACGCGGTGC TGCAGCGCGC CCAGCGCGTG GCGGTCGACA TGGTGCGCCC CGGGGTCACG
GCCGAACAGG TCGACGCGGC CGCCCGCGAC GTGCTGGCCG CCGAAGGGCT GGCCGAGGCG
TTCGTACACC GGACCGGACA CGGGATCGGT CTCTCGGTGC ACGAGGAGCC CTACATCGTG
GCGGGCAACT CACTGCCGCT GCAGGAGGGG ATGGCGTTCT CCGTCGAGCC CGGTATCTAC
TTCCCGGGGC AGTGGGGTGC GCGCATCGAG GACATCGTGA TCGTCACCGG CGACGGCGCC
GAGCCGGTCA ACCACCGGCC GCACGAACTC GTCGTCGTAC CGGTGCCCTG A
 
Protein sequence
MSQDRFDTEV YARRLRAAAA AAADAGLAGL VITPGYDLRY LVGSRAQTFE RLTALVLPAT 
GDPTVVVPRL ELASLKDSAV TELGVAVRDW VDGDDPYRLV VEALPGDGPL AVAVTDSMPA
LHLLPLAEVL GAVPVLATDV LRNLRMVKDP AEIDALRKAG AAIDRVHERV PQFLRPGRTE
ADVAADIAEA IVAEGHSEVA FIIVGSGPHG ADPHHECSDR ELRAGDIVVV DIGGPYDPGY
NSDSTRTYSI GEPDPEVARR YAVLQRAQRV AVDMVRPGVT AEQVDAAARD VLAAEGLAEA
FVHRTGHGIG LSVHEEPYIV AGNSLPLQEG MAFSVEPGIY FPGQWGARIE DIVIVTGDGA
EPVNHRPHEL VVVPVP