Gene Mvan_1006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1006 
Symbol 
ID4645791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1052827 
End bp1054143 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content70% 
IMG OID639804507 
Productprotein of unknown function DUF1100, hydrolase family protein 
Protein accessionYP_951850 
Protein GI120402021 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.342716 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGGAC ACGCCGTCGC CCGCCGACTG GTCACCAGCG GTTGTGCGCT GACCCGCACC 
ACCGAGTGGG CGGCGTCGCG ATGGGGCACG GCGTACTTCC TGCCCGCGCT GTTCGCAGAC
CGTGTCACGC ACCTCGGTGG GATTGACAAG CGTTTTTTCG CAGAGCAATT GGCGCAGTGC
CGCTCGTTCC GTGACGGCTC CTGGGCGGGG CATTGGCAGG CTATCGCCGC CGACCACGCC
GGCGTCGCCG ACGCGGCCCT GGCCCGGCTC GGCGGGCCCA CCGTCGCGCA GATGCTCGCC
GGCCCGGTCG ACACGTCCGC ACTGGGTGAG CTGCTCACCC CCGCCGTGTC GATCCTGGCC
GACCGGGGGC CGGTGGCGTC ACCGGACGCC GTGACGACGT TCCGACTGCA CAGCGGCGGC
GCGGGCGATG ACGCCGCGAT CGCGGTGGAT GCGCTCATCA AGGTGGTCAC GTACAAGTTC
GCGGCGGCGT GGCCGGGCTG GACACCGCAG CGACTGAAGG CGCACGCGCA GTCACGGCGG
CTGTGCGATG TCCTCACCGA GGCATTGGCC CCGGCGATGG GTCTGAGCAT CGAGCACCTA
CGGGTCCCCG TCCCCGGCGG TGACGTCGTG GAGGGCGCCG CGGTGTTCCC GCTCGGTGTC
CGTGGTTCGC CGACCGTGTT GTGCGCCAAG GGACTTGAGG GCGTCGTGGC CGAGACCCTG
CTGCCGTGGC TGAAGTTCCG CGGGCACGGC CTGGGGATGT TCATCATGGA GATGCCGGGC
ACCTACACCT ACCGGCAACC GCTGACCGTC GCCGCGGAGA ACGTGTATCG CGCGGTCATC
GACCGGCTGG CGGCCGACCC CCGCGTCGAC GCAGACCGGA TCGGCATGCT GGGGCTCAGT
TTCGGCGCAT ACTGGGCGGC CCGGATGGCC GCCGCCGATC CGCGTCTGCG CGCCGTCGTC
GCCAACGGGG CGCCGGCGGA CCGCACGTTC CGGCCGTCGG GAGCCTTCGG CACCCCCGAG
ATCATGATGT GGACGATGGC GAACACCACG CACGCCCGCA GCACCGCCGA CCTGCTGACC
AAGCTGCGGG CGCTGTCGCT GAAGGACCTT TACCCGCGAA TGACCGCACC GCTGTTGGTG
ATCAACGGCG ATTCCGACAC GCTTGCGAGT ACCCGGGACT CGATCGACAT CGCGACGTAC
GCCCCCAACG CACTGCTCAA GCTCTACCCG GGCGACGACC ACTGCGCGAT GGGACACGCA
CGGCAGTGGT GGGATCTGGC CGTCCGGTTC TTGGCCGACC AGCTTGGCGC TGTGTGA
 
Protein sequence
MAGHAVARRL VTSGCALTRT TEWAASRWGT AYFLPALFAD RVTHLGGIDK RFFAEQLAQC 
RSFRDGSWAG HWQAIAADHA GVADAALARL GGPTVAQMLA GPVDTSALGE LLTPAVSILA
DRGPVASPDA VTTFRLHSGG AGDDAAIAVD ALIKVVTYKF AAAWPGWTPQ RLKAHAQSRR
LCDVLTEALA PAMGLSIEHL RVPVPGGDVV EGAAVFPLGV RGSPTVLCAK GLEGVVAETL
LPWLKFRGHG LGMFIMEMPG TYTYRQPLTV AAENVYRAVI DRLAADPRVD ADRIGMLGLS
FGAYWAARMA AADPRLRAVV ANGAPADRTF RPSGAFGTPE IMMWTMANTT HARSTADLLT
KLRALSLKDL YPRMTAPLLV INGDSDTLAS TRDSIDIATY APNALLKLYP GDDHCAMGHA
RQWWDLAVRF LADQLGAV