Gene Mvan_4739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4739 
Symbol 
ID4647745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5076761 
End bp5079136 
Gene Length2376 bp 
Protein Length791 aa 
Translation table11 
GC content68% 
IMG OID639808208 
Producthypothetical protein 
Protein accessionYP_955519 
Protein GI120405690 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.448945 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAGCG TGGGCCACAT CGCTTGGGTC GGATCGCTGG CGGTGGCGCT GGGAGTAGCG 
GGCGCCGCTG CGAGCCCGCC CGCGATCAGC TGGGCGGAGT CCTCGGATTC CGCTTCCCAG
GACACCTCGG CGCCAGATCC CGGCGAGGCA TCCGGCGATG ACACCGCGGA CTCCGAACCG
GACCACGAGT CGGCACCCTC TGCCGGAGAG AGTGATTCGG CCGACGACGA CTCGGCCGAC
GACGACGGGC CGGTCGACCT TCCCACCGGC GACGCAGGTT CCATCGACAT CCCGGACACT
GCGGAATCGG ATCGCGGCAA CCACGGTGAC GACACCATGA CCGCCGCGCC ATCGGAGGAG
ACCGCCGACG ATCTGTCGCA CACCCAGTCC GACCCGGACC TGTCAGCCCT CGAATCCGTC
GCAGACCACG AAGCCCCGCC TGCAGAACCC GACACCGCAC CGGCAACGAC GAGGGCTGTG
CAGCCGATCA CGGCCCCGGC CGAGCGCGAG GTGATCGAAT CCGCCTACAC CCCAACGAGT
GCTGCGCAGA CGACCAGCAC GGCGGTGTTC GCGCCTCCGC TCACGCCGGC ACCGCCGAGC
ACCCCGGTCC CCCCTGTCGC ATCACTGGCA GCGTTCGCCT CGATGCGCCG CGACAACGAG
CCGGCGCTGC GCGAGCGGAC CACCGAGCAG AGCAACGCAG CCCACGCAAC CGTGGCGGCT
GCGGGCGGTC CGACGCCGAA TCCGGTGGCT ATGGCGAGTC CGCCCGCCGG CTCGATCATG
GCGGCGGTCG AGTCGATGCT CACCGCCGCC CGGAACTGGT TTGAGCGCAC GTTCTTCGCG
GCGACGCCCG TCTACCCGCC GCAGACGGTG ATGGTGACCG TCGGACCGGG TTCCGCCAGC
GACCCGTTCA CGTTGACCGC CGCGGACGCC GACGGCCGTG CGTTGACCTA CAGCGTGCAC
GGCTCGACAG GGGGAACCGG GACAGCTGCA GGCACCCTGA CGATTTCGGG CGACAAAGCC
ACCTACACCC CGCCAGCGGA CTGGAACGGC GAAACCGCCT ACACCGACAC GTTCTCGGTC
ACGGCGTCGG ATCAGCGGGA TGGTTTCCAC ATCCACGGCC TGTCCGGCCT GATCTACAAC
CTGACCTTCG GGTTGCTCGG CCGCGCCGGT CACTCCGCCA CCACCACCGT GACCGTCGGC
GTCCGCGCCG CGTCAACCCC ACCCGGTCCG GACCCGGAAC CGCCCGACGG GCCGGGCGTG
CCGGGATCGT TTCCGGTGTC GTTCGCCAAC AACAGCGGCT ATGCCGACGA CGAGGTGTAC
GTGATGGTCA TCGGTCAGGT CACCCCGGGA CAGTGGTCCT GGGTCGATCG TGACGGCGTG
GCCCACCACA TCGATCATGC CGCCGCCGAT GCCCCAGGCC ACCTGGAAAA AGACGGCGTC
AACTATGCCG ACATGACGTT CACCCTCGCC GAGGCGGACG ACCTGCGCAT CCCGCCGGAG
TTGCTGGGTG GTCGCATCTA CGTGTCGCTG GAACAACCGC TGTACATCGC CATCAGCGCC
GACGATTCGG GCTGGGCCTC CCCTGATGGG GCCAACCCTG CCGACCCCAA CTACGAAACG
GTCTTCGACT GGTACGAAAT GAGTTACGAC AACGGCTCAG TTCCGTTCGG CGGCAACACC
ACCCAGGTAG ACCAGTTCGG TTTCCCGTTC TCCTTCACCG TGTCACAGGA TGCCACCGGA
TTCTCGGCTA CCCGCGGTAT CGCGTTGAGT CGACGCGAGG TCTTCAGTGC GTTCGAGGAC
ACCGTCCCGG AGGCATACCA GGCGCTGATC ATCCGGGATG AGGACGGAAA CCCGATTCGG
ATCCTGGCGC CGCGCTCACA CCAGCCCGGC AGCCTGGCGA CCTGGTTCGA CGAGCCGGTC
GACGACTTCT GGCACACCTA CCGGACCACT GAATTCGTCT ATCACGGAAC GGGTTACACG
GTGACCGGGC GAGTCGGGGA CGACGACCGG TTCGCTTACG CCGTCACCGC CGCCGGCGGC
GCCTCGACGG CGCACAGCAT GACCAAGCCG AGTACCGCGG ACGTCTTCCG CGCCGACGGG
CCATTCGTCG GCACCGGCCT GCAAGGTGCA TTCCTGGCCG AGCTCGACGC CGCGTTCAAC
CGTGGAGTGG CCACCTCGCC TGACGACTGG AACGACGTGT CGGCCTACTA TCCGGCCGGC
GGACGCTGGA ATGACTGGGC CCGCTTCTTC CACGCCCACA GCCTGAACGG GTTCGCCTAC
GGATTCCCCT ACGACGACGT CAACAGCCAG AGTTCGGTGG TGATCCTGAA CAACGCCGAA
CCGCTGACCG ACCTGAGGCT CACGCTGACG TCTTAG
 
Protein sequence
MDSVGHIAWV GSLAVALGVA GAAASPPAIS WAESSDSASQ DTSAPDPGEA SGDDTADSEP 
DHESAPSAGE SDSADDDSAD DDGPVDLPTG DAGSIDIPDT AESDRGNHGD DTMTAAPSEE
TADDLSHTQS DPDLSALESV ADHEAPPAEP DTAPATTRAV QPITAPAERE VIESAYTPTS
AAQTTSTAVF APPLTPAPPS TPVPPVASLA AFASMRRDNE PALRERTTEQ SNAAHATVAA
AGGPTPNPVA MASPPAGSIM AAVESMLTAA RNWFERTFFA ATPVYPPQTV MVTVGPGSAS
DPFTLTAADA DGRALTYSVH GSTGGTGTAA GTLTISGDKA TYTPPADWNG ETAYTDTFSV
TASDQRDGFH IHGLSGLIYN LTFGLLGRAG HSATTTVTVG VRAASTPPGP DPEPPDGPGV
PGSFPVSFAN NSGYADDEVY VMVIGQVTPG QWSWVDRDGV AHHIDHAAAD APGHLEKDGV
NYADMTFTLA EADDLRIPPE LLGGRIYVSL EQPLYIAISA DDSGWASPDG ANPADPNYET
VFDWYEMSYD NGSVPFGGNT TQVDQFGFPF SFTVSQDATG FSATRGIALS RREVFSAFED
TVPEAYQALI IRDEDGNPIR ILAPRSHQPG SLATWFDEPV DDFWHTYRTT EFVYHGTGYT
VTGRVGDDDR FAYAVTAAGG ASTAHSMTKP STADVFRADG PFVGTGLQGA FLAELDAAFN
RGVATSPDDW NDVSAYYPAG GRWNDWARFF HAHSLNGFAY GFPYDDVNSQ SSVVILNNAE
PLTDLRLTLT S