Gene Mvan_4398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4398 
Symbol 
ID4648737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4719752 
End bp4720780 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content63% 
IMG OID639807869 
Productvirulence factor Mce family protein 
Protein accessionYP_955180 
Protein GI120405351 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAGTG ACCTGCCGGG CGTGGCGGCA CGTCTCGGGC TGTTCACCAT GGCATGCGCG 
GCCGGGACTT TCGCGCTGAT CATGATCTTC GCGCAGCTGC GGTTCGCTCC TTCGAACCAC
TACGACGCAC AGTTCGTGAA CGTGAGCGGG CTGAAGGCGG GCGAGTTCGT GCGCATCGCC
GGGGTGGAAG TGGGCAAGGT GAAACGCATC GCGGTGAACG AGGACGGGAC CGCCACCGTG
ACGTTCGGCG CTGACGATTC GGTGCTGTTG ACCGAAGGTA CACGCGCGCT GATCCGCTAC
GACAACCTCA TCGGCGACCG GTATCTGGAG TTGCAGGAGG GCGCCGGTGC AGTGCAGGCA
CTGGAACCCG GTGGCACCAT CCCAGTCGCG CGCACCCAGC CAGCCCTCGA CCTCGACGCC
TTGGTGGGCG GATTCCGGCC ATTGTTCAAA GCCTTGGATC CAACCCAGGT CAATGCCTTG
AGCGCGCAAC TGGTCCAGGC TTTCCAAGGG CAGGGGACCA CCATAAACTC GTTCCTGGCC
CAAACCGCCG AGATGACCGC CGCCCTGGCC GACCGGGATG CGCTCATCGG AGAAGTGATC
ACCAACCTCA ACACCGTCCT CGGATCGCTT GCCGACGAGA GCGACAACTT CGATGAAGCG
GTCACCTCAC TGTCGCAACT GGTCGAAGGC CTGGCGGCAC GCAAGACCGA CATCGGAGAG
TCGGTATCGC ACTCGAACGC TGCGGCCGCG TCGATCACCG ATCTACTGGC CGATATCAGA
CCGCCGTTCG CTGAGACGGT GGCCCAGTCC GACCGGATGA ACAGCGTCGT TCTGTCTGAT
CATGAGTACG TCGACGACCT GCTTGCGACC CTGCCCGACG CCTACCGGGT GCTCGGAAGA
CAAGGCATCT ACGGCGACTT CTTCGCCTTC TACCTCTGCG ATCTCGTATT GAAAGTCAAC
GGCAAGGGCG GCCAGCCGGT GTATGTCAAA GTGGCTGGGC AGGATTCAGG ACGGTGCGCG
CCACGATGA
 
Protein sequence
MRSDLPGVAA RLGLFTMACA AGTFALIMIF AQLRFAPSNH YDAQFVNVSG LKAGEFVRIA 
GVEVGKVKRI AVNEDGTATV TFGADDSVLL TEGTRALIRY DNLIGDRYLE LQEGAGAVQA
LEPGGTIPVA RTQPALDLDA LVGGFRPLFK ALDPTQVNAL SAQLVQAFQG QGTTINSFLA
QTAEMTAALA DRDALIGEVI TNLNTVLGSL ADESDNFDEA VTSLSQLVEG LAARKTDIGE
SVSHSNAAAA SITDLLADIR PPFAETVAQS DRMNSVVLSD HEYVDDLLAT LPDAYRVLGR
QGIYGDFFAF YLCDLVLKVN GKGGQPVYVK VAGQDSGRCA PR