Gene Mvan_4433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4433 
Symbol 
ID4649049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4759058 
End bp4760710 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content66% 
IMG OID639807904 
Productextracellular solute-binding protein 
Protein accessionYP_955215 
Protein GI120405386 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.766057 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGTCTGC TCGCGGTGGG ATCGGTGGTG GCGCTGTTGG TCAGCGGGTG CTCCAGCGGC 
CAGAACGACG TCCCGTCCAC CGGCGGTAGC GCCGAGCTGG GCGCCACCGC CGACATCAAC
CCGCAGGACC CGGCCACGCT GCAGCAGGGC GGCAACCTAC GGCTGGCGCT GACCGGCTTT
CCGTCGAACT TCAACAACCT GCACATCGAC GGCAACCTCG GCGAGATCGG TCGCATGTAC
CGGCCCACGC TTCCCCGCGC GTTCTTCATC AAGCCCGACG GAGAGATGAC GGTCAACAGC
GACTACTTCA CCGACGTCGA GCTGACCAGC ACCGACCCCC AGGTAGTCAC CTACACCATC
AACCCGAATG CGGTGTGGAC CAACGAACGA CCAGTGACCT GGGAGGACAT CGCCGCTCAG
ATCAACGCCA CCAGCGGCAA GGACAAGCGG TTCCTGTTCG CGTCCCCGAA CGGAAGTGAG
CGGGTCGCGT CGGTCACCAG GGGTGTCGAC GACCGGCAGG CGGTGGTCAC TTTCGCCAAG
CACTACGCCG ACTGGCGAGG GATGTTCGCC GGGAACGGGA TGCTGTACCC GAAAGAGATC
ACGCAGGATC CCGAGGCGTT CAACAAGGGC TTTCTCACCG GTCCCGGCCC GTCGGCGGGG
CCTTTCATGA TCACCACGGT CGACCGCGGC GCCCAGCGAA TCACCTTGGA GCGCAACCCG
AAGTGGTGGG GCACGCCTCC CGTGCTGGAC CGCATCACCT ACACCGTGCT CGATGACGCC
GCGATGCTGC CCGCACTGGA GAACAACGCG CTGGACTCGA TCGGCCTGGG GACTCTGGAC
GATCTCGAAC GTGCCCGTCG CGCCCAGGGT GTCACGATCC GCCGTGCCCC GGCCCCGAAC
TGGTATCACC TGACCCTCAA CGGCGCCGAG GGCGCGTTGC TGTCCGATCC GGCGCTGCGG
GCGGCGATCA CCAAGGGCAT CGACCGGCAG GCCATCACTG CGGTGTCGCA GCGCGGACTG
ACCGATGATC CGGCCGCGCT GAACAACCAC ATCTACCTGG CGGGCCAGGA GGGCTACCAG
GACAACAGCA TCGGCTTCGA CCCTGAGGCC GCCAAGCGCG AACTCGACGC GCTCGGCTGG
ACACTCAACG GCCAGTTCCG GGAGAAGGAC GGGAAACCGC TGACGCTGCG CGACGTGTTC
TACGACGGCG CCAGCACCCG CGCCATCGCC CAGGTCGCGC AGAACCAGCT CGCGCAGATC
GGCGTCAACC TGGAGTTGGT GCCCGCCGCG GGCGGTTCGT TGTTCCCCGA CTACATCACG
CCGGGTAACT TCGACATCGC CCAATTCGCC TGGGGTGGAG ACGCTTTCCC ACTGGGCGGG
TTGACCCAGA TCTACGCCTC GAACGGTGAG AGCAACTACG GCAAGATCGG CAGCCCGCAG
GTCGACGCCA AGATCGAGGA GACGCTCTCC GAGCTGGATC CCGCCAAGGC ACGCACGCTG
GCCAACGAAC TCGACAAGAT GATCTGGGAG ATCGGGCACA GCCTGCCGCT GTTCCAGGCG
CCCGGCAACG TGGCCGTGCG CAGCAATCTC GCCAACTACG GTCCTGCCGG CATCGGGGAC
ATCAACTACT CGGCGATCGG CTTCATGAAG TAG
 
Protein sequence
MRLLAVGSVV ALLVSGCSSG QNDVPSTGGS AELGATADIN PQDPATLQQG GNLRLALTGF 
PSNFNNLHID GNLGEIGRMY RPTLPRAFFI KPDGEMTVNS DYFTDVELTS TDPQVVTYTI
NPNAVWTNER PVTWEDIAAQ INATSGKDKR FLFASPNGSE RVASVTRGVD DRQAVVTFAK
HYADWRGMFA GNGMLYPKEI TQDPEAFNKG FLTGPGPSAG PFMITTVDRG AQRITLERNP
KWWGTPPVLD RITYTVLDDA AMLPALENNA LDSIGLGTLD DLERARRAQG VTIRRAPAPN
WYHLTLNGAE GALLSDPALR AAITKGIDRQ AITAVSQRGL TDDPAALNNH IYLAGQEGYQ
DNSIGFDPEA AKRELDALGW TLNGQFREKD GKPLTLRDVF YDGASTRAIA QVAQNQLAQI
GVNLELVPAA GGSLFPDYIT PGNFDIAQFA WGGDAFPLGG LTQIYASNGE SNYGKIGSPQ
VDAKIEETLS ELDPAKARTL ANELDKMIWE IGHSLPLFQA PGNVAVRSNL ANYGPAGIGD
INYSAIGFMK