Gene Mvan_0438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_0438 
Symbol 
ID4647813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp475685 
End bp477334 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content64% 
IMG OID639803946 
Productextracellular solute-binding protein 
Protein accessionYP_951292 
Protein GI120401463 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.397858 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACATAT TTCGCCGAGC ACTGATCATC GCCTGCGTCG CATCGCTGGC CGCTTTCGGA 
GTGGCGGCCT GCGGCAGCGA CGACAGTTCC GGCGGCGGCG GGGGCAGCGG CGGTGACATC
ACCGTGAACG CAACGTCGTT CCCCGACTAC ATCGACCCGC AGCTGTCCTA CACCGTGGAG
GGCTGGGAGG TGTTGTGGAA CGTCTACACC CCGCTGCTGA CCTACAGGCA CGCCAGGGGC
AAGGAGGGCA CCGAGGTGGT CCCGGCCCTG GCCGAGGCGC TGCCGGACAT CTCCCCGGAC
GGGAAGACCT ACAAGCTCAA ACTGCGGCCG AACATGAAGT ACTCGGACGG CACCCCGATC
AAGGCGTCCG ACTTCACGTA CGCGATTCAG CGCCTGTTCA AGACGGATTC GGGCGGCTCG
GTCTTCTACA ACGTCATCGC CGGCGCCACG GAGTACGCCG ACGGTGCCGC CGACACGATC
ACCGGCATCA CCACCGATGA CGGGACCGGC GACATCACCA TCCAATTGAC CGAACCCAAC
GGCACTTTCG ACAATCTGCT GGGGCTGATG TTCGCCGCGC CGATCCCGCA GAGCACGCCA
CTGGACGCCG ACGCGACGAA CAACCCGCCA CCGGCGAGCG GACCGTTCAT GTTCACCACG
GTCGACGCCC CGCGCACGCT GACGATGGAA CGCAATCCGC AGTTCCAGAC CGTCAAGGAC
GCGGGCGCCG ACGAGGTCGC CGACGCCGGG GTGGACAAGA TCACCCTCAT CGAGAACAAG
AACCAGAGCG CGCAGGTGAC CGACATCATG CAGAACAAGG TCGATTTCAT GATGGACCCG
GTGCCATCGG ACCGGCTGCA GGAGGTGAAG AGCCGCTACT CCGACCGGTT CCGGATGGAG
GACTCGATCA ACACCTACTA CATGTTCATG AACACCGAGC GGGCCCCCTT CAACGACGTC
AGGGTGCGAC AGGCGATCAA CTACGCCATC GACCCCGAGG CGCTGAACCG GATCTTCGGC
GGCCGGCTGC ACCCGACTCA GCAGGTTCTG CCACCGGGCA TGCCGGGCTA CCAGGAATAC
AAGCTGTATC CGGGGCCGGA CATGGACAAG GCCAGAGCGC TGATCGCCGA GGCGAATCCG
GCCGACCGCG ACATCACGGT GTGGACCGAC GACGAGCCGG ACCGCAAGCG CATCGGTGAG
TACTACCACG ACCTGCTCAC CCAGCTCGGC TTCAACGCCA CGCTGAAAGT GATTGCGGGC
GACGTGTACT GGACGACGGT GGGCAACCAG TCCACCCCGG ACGTGGACAC CGGCTTCGCC
GACTGGTTCC AGGATTTCCC GCATCCCGAC GACTTCTTCC GTCCGCTGCT GCACGGTGAC
AGCATCCTGC CGACCAACGG GAACAACCTG TCCCGCGCCA ACATCGCGGA GAACAACGCC
AAGATGGACG AACTGGTCAC CAAGCAGATC ACCGACGAGG GTGTCGAACA GCAGTACGCC
GACTTGGACC GGGCCTACAT GGAGCAGGCG GTGTGGGCCC CGTACGGCAA CGAGCAGTTC
ACCACGTTCC TGTCGGAGCG GATGGACTTC GACAAGTCGT ATCATCATCT GCTGTTCAAG
CAGGATTTCA CCTCGTTCGC GCTGAAGTAG
 
Protein sequence
MHIFRRALII ACVASLAAFG VAACGSDDSS GGGGGSGGDI TVNATSFPDY IDPQLSYTVE 
GWEVLWNVYT PLLTYRHARG KEGTEVVPAL AEALPDISPD GKTYKLKLRP NMKYSDGTPI
KASDFTYAIQ RLFKTDSGGS VFYNVIAGAT EYADGAADTI TGITTDDGTG DITIQLTEPN
GTFDNLLGLM FAAPIPQSTP LDADATNNPP PASGPFMFTT VDAPRTLTME RNPQFQTVKD
AGADEVADAG VDKITLIENK NQSAQVTDIM QNKVDFMMDP VPSDRLQEVK SRYSDRFRME
DSINTYYMFM NTERAPFNDV RVRQAINYAI DPEALNRIFG GRLHPTQQVL PPGMPGYQEY
KLYPGPDMDK ARALIAEANP ADRDITVWTD DEPDRKRIGE YYHDLLTQLG FNATLKVIAG
DVYWTTVGNQ STPDVDTGFA DWFQDFPHPD DFFRPLLHGD SILPTNGNNL SRANIAENNA
KMDELVTKQI TDEGVEQQYA DLDRAYMEQA VWAPYGNEQF TTFLSERMDF DKSYHHLLFK
QDFTSFALK