Gene Mvan_1024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1024 
Symbol 
ID4644245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1073963 
End bp1075537 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content67% 
IMG OID639804525 
Productextracellular solute-binding protein 
Protein accessionYP_951868 
Protein GI120402039 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.216522 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCCA GCACCGCGTT GAAGCCTCGA CGGTTGGTCG CCGGCCTCGC CCTCGCGACC 
GTCACCGTCG CGACCGGCGG CTGCACCGTC GCCAATTCGG GCGGCGGCGG CTACGACCCC
GACACTCTGC GCATCGTCCT GCAACAGGAG CCCCCGACTC TGGAACCGTG CGAGAGCTCG
CTGACGTCGA CAGGCATCGT GGTCCGCTCC AACATCACCG AACCCCTGCT CGAACGCGAC
GCCAATACCG GTGAGCTGCA GCCTCTGCTG TCCACCGGGT GGGAGCAGAC CTCCCCCAAC
GAATGGACCT TCACCCTTCG TGACGGCGTG ACCTTCTCCG ACGGGGCGCC GTTCACCTCC
GCGGACGCCG CGTTCTCGAT CGATCGCGCG GTCAACTCCG ACCTGCAGTG CAACGTCGAC
GGCTACGTCT TCGGCGACGA GAAGCTGGGC CTGCAGACTC CGAACCCCAG CACGGTCGTC
GTCAGCACCA CGAAGCCCGA TCCGATTCTG CCGCTGCGTA TCTCGTTCGT CGAGATGGTG
CCGCGCACGA CCAGCACCAC CGAGAAGGTC CGCGAACCCA TCGGGACAGG TCCGTACGCG
ATCGAACGCT GGGACTACGG CCAGAAGCTC GTCCTGGCCC GCAACGCCAC CTACTGGGGC
CAGGCACCGT CTTTCGCCAA GGCCGAATAC CAATGGCGCA GTGAGGGAAG CGTGCGCGCT
GCGATGATCA CCAACGACGA GGCCGATATC GCCACCGGCT TGGGCCCTGA AGACGGAGCG
GGCGATCTGG GTGTCCCGTT CCAGAACAAC GAGACGACCG CGCTGCGCAT GCAGGCCACC
GAACCGCCGC TCGACGACAT CCGAGTGCGG CAGGCGATCA ACTACGCGGT CAACCGCACC
GGCATCGTCA AAGCGCTGTT CCGTGACCTC GGCCAGCCCG CCGCCCAGTT GATCCCCTCG
GGCGTCGTCG GCTACAACGC CGAGCTGCAG CTGTGGCCGC ACGACCTCGA CAAGGCCCGA
GCCCTGATCG AGGAGGCCAG GGCCGACGGC GTCCCGGTCG ACCGCGAGAT CCGGCTGATC
GGACGTACCG CGCAGTTCCC GAAGATCACC GAGACCATCG AGGTGCTGCA GAGCGAGTTC
ACCGAGATCG GCCTCAACGT CAAGATCGAG ATGATGGACA CCGCCGCCCA GTTGGAGTAC
CAGCTGCGGC CGTTCCCGCC CGACACCGGG CCGTACCTGC TGATGATCAT GCACGGCAAC
CAGGCCGGCG ACGCCGCATT CACCCTCGAC CAGTACATGC TGTCCGACGG TCCGCAGGCG
GCCTACGGCA CACCGGAATT CGACGCCAGG ATTCGCACCG CCGAGGCGCT GACCGGCCAG
GCCCGCCAGG ACGCGTTCGC CGCCCTGTTC GCCGAGGAAC CGCAGGAGAT CGTCCAGATG
GCCTACATCG CGCACATGAA GGGGATCCTC GGCAAGTCCG AGCGCATCGA CTACACCCCC
AACCCGGCTA CCGGCGATGA AATGTTGCTG GCTGCAATGA CTCCCGCGGG TAACGACCGC
ACCGATCAGT CCTGA
 
Protein sequence
MNPSTALKPR RLVAGLALAT VTVATGGCTV ANSGGGGYDP DTLRIVLQQE PPTLEPCESS 
LTSTGIVVRS NITEPLLERD ANTGELQPLL STGWEQTSPN EWTFTLRDGV TFSDGAPFTS
ADAAFSIDRA VNSDLQCNVD GYVFGDEKLG LQTPNPSTVV VSTTKPDPIL PLRISFVEMV
PRTTSTTEKV REPIGTGPYA IERWDYGQKL VLARNATYWG QAPSFAKAEY QWRSEGSVRA
AMITNDEADI ATGLGPEDGA GDLGVPFQNN ETTALRMQAT EPPLDDIRVR QAINYAVNRT
GIVKALFRDL GQPAAQLIPS GVVGYNAELQ LWPHDLDKAR ALIEEARADG VPVDREIRLI
GRTAQFPKIT ETIEVLQSEF TEIGLNVKIE MMDTAAQLEY QLRPFPPDTG PYLLMIMHGN
QAGDAAFTLD QYMLSDGPQA AYGTPEFDAR IRTAEALTGQ ARQDAFAALF AEEPQEIVQM
AYIAHMKGIL GKSERIDYTP NPATGDEMLL AAMTPAGNDR TDQS