Gene Mvan_1020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1020 
Symbol 
ID4644241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1069919 
End bp1070923 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content67% 
IMG OID639804521 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_951864 
Protein GI120402035 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4608] ABC-type oligopeptide transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.113907 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.741319 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAGA ACCTGCTCGA GGTCCGCGAT GTGCGCAAAT CGTTCCGCGT CCCGCACGCC 
GGCAAGAACA AGCTGTGCGC GCTCGACGGC ATCACCCTCG ACCTCAAGCG CGGTGAGACG
CTCGGCCTGG TCGGCGAGTC CGGCTGCGGA AAGTCCACCC TGGCACGCAC TTTGATGATG
CTGGAACGTC CCGACGAGGG CACGGTCCGG TTCGACGGCG TCGATCCGTT CAGCCTCAAA
GGTGCGGACC TGCTGAAGTT CCGCCGCCGT GTGCAGATGG TGTTCCAGGA TCCCTACGCC
TCGCTGAACT CCCGGATGTC AGCGGCCGAG ATCATCGCCG AGCCGTGGCG CAGCCACAAG
GGCGTCGTCC AGAACCGTCA TGACCGCGAC ATGAGGGTGC GCGGGCTGCT GGATCTGGTG
GGGCTGGGCG CCAAGGCTGC CGGCAAGTAC CCGCAGGAGT TCTCCGGCGG GCAACGCCAG
CGCATCGGCA TCGCCCGCGC GCTTGCGCTG AACCCTGACG TGATCGTCTG CGACGAGCCG
GTGTCCGCGC TGGACCTGTC GGTACAGGCA CAGGTGCTCA ACCTGCTCAA CGACCTGCAG
GAGCAGCTGC AGATCTCCTA CATCTTCATC TCCCACGACC TGTCGGTGGT GCGCCACGTC
GCCGACCGCG TCGCGGTGAT GTACCTGGGC CGGATCATCG AGAACGGGCC CACCGAGCGC
GTTTTCGAAC GGTCCAACCA CCCCTACACC GCCGCGCTGA TGTCGGCTGC CCCCACACTG
GACGGCGCGC AACGCGCACA GCGCATCCTG CTCAAGGGCG AGGTGCCCTC GCCGATCGAT
CCACCGTCGG GATGCCGATT CCGCACCCGG TGCTGGAAGG CCACCGACGT GTGCGCGACC
ACCACACCTT CGGCCGCCGT GGACCCGGAA TTGGCCGACC ACACCGCGCT GTGCCACCAC
CCACTGCTGC TGGAAGGGAG TGGTCGTGAA ACCGTACCGG CATGA
 
Protein sequence
MSENLLEVRD VRKSFRVPHA GKNKLCALDG ITLDLKRGET LGLVGESGCG KSTLARTLMM 
LERPDEGTVR FDGVDPFSLK GADLLKFRRR VQMVFQDPYA SLNSRMSAAE IIAEPWRSHK
GVVQNRHDRD MRVRGLLDLV GLGAKAAGKY PQEFSGGQRQ RIGIARALAL NPDVIVCDEP
VSALDLSVQA QVLNLLNDLQ EQLQISYIFI SHDLSVVRHV ADRVAVMYLG RIIENGPTER
VFERSNHPYT AALMSAAPTL DGAQRAQRIL LKGEVPSPID PPSGCRFRTR CWKATDVCAT
TTPSAAVDPE LADHTALCHH PLLLEGSGRE TVPA