Gene Mvan_3050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3050 
Symbol 
ID4643271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3215103 
End bp3216590 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content69% 
IMG OID639806527 
Producthypothetical protein 
Protein accessionYP_953858 
Protein GI120404029 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.184286 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTGTT CGCGGGAGGA CGTGTTGGGG GCCTTCGATG CCCTCGACGC GGTCGTGGAG 
TCGATCCTGG CGTTGGACTA CGACGCACTC AGTGCTGCCG AGCGGGTGCG TCTGGAGGCC
CGCCTGGAAC GCAACCTGCG CCGTATCCCC ACTGTGGAAC ACGAGTTGCT CGCCTCGGTG
ATCGCCGAGA CCGAGCCGGC CCGGTTGGGT GAGGGGTCGT GGAAGAAGGT GTTGACCACC
GCGCTGCGGA TCTCCGGTGC CGAGGCAGGG CGGCGGCTGA AGCGGGCCAA AACCCTGGGC
CCGCGGCGCG GACTGACCGG GACGCCGTTG CCACCGTTGT GGGAGTCCAC CGCCGCCGCC
CAGGCCCAGG GCCTGCTGAG TGAGGAGCAT GTCGCGGTGA TCGCGGCGTT CCACAAGAGG
CTGCCGGCCT GGGTCGATAT CGAGACCCGG GCCGAGGCCG ACCGCCAGCT GGCTCACGCG
GGTTCCGGAC TGGATCCCGA AGGCCTCGAC GAGGCCGCCG GGGTGCTGCT GGCCATGATC
AACCCCGACG GCGCCCAACC CTGCGACAAA GAGCGGGCCC GCAAACGGGG CATCCGGATC
AGCAAGCAAC ACCCCGACGG CACCGCCACC ATCTCGGGCA CCCTCACCCC CGAAGCCCTG
GCCATCTGGC AGGCGATCTT CGCCAAAGAA GCCGCCCCCG GCGCCAACAA CCCCGAGTCT
GAACACACCG AGGACAGCAC ATCCGGCGGG GCGGCGGACG ACGCATCCGA CGCCCCGGGC
GATCATGCCG GTGCCGCTTC GAGCGCGGCA TCAGGCGCAT CGGACGCTGC TGAGCATGAT
CCACAGCCCG AACGGTGCGG CTCTGATACC CGTACCCAGG CTCAGCGCAA CCACGACGCC
TTCCTGGCCG TCGGGCGCCG CCTCCTGGAA TCCGGAGAAC TCGGCACCCA CAACGGGTTA
CCGGTGACGG TGATCGTCTC CACGACGCTG CAGGAGCTCG AAAAAGGCGC AGGGGTCGCG
GTCACCGGCG GCGGATCGCT GTTGCCGATG CCGGATCTGA TCCGGCTTGC CGCCCGGGCC
CACCACTACC TGTATGTCTA CGACCAACAC AGCGGCAAAT CCCTCTACCT GGGCCGGGCC
AAACGGTTGG CCAACGCCGC GCAGCGGATC GTGCTGCACG CCCGCGACCG CGGGTGTACG
CGGCCGGGCT GCACCGCACC CGGGTACTGG TGCCAGGCCC ACCACGCCAG CGCCGATTTC
GTCGACGGCG GACTGACCAA CATCGACGAC CTGACGCTGG CATGCCCGTG CGATCACCGC
ATGCTCGACA ACACCGGCTG GCGCACCCGC AAGAACGGCA AGAATCAGAC CGAATGGCTC
CCGCCACCGG ACCTCGACAC AGGCCAACAC CGCGTCAACG GCCACCACCA CCCCGAAAGA
TACCTACTCC CCGAAGACGA CCTCCCCGAA GACGACCAAG GACCCTGA
 
Protein sequence
MGCSREDVLG AFDALDAVVE SILALDYDAL SAAERVRLEA RLERNLRRIP TVEHELLASV 
IAETEPARLG EGSWKKVLTT ALRISGAEAG RRLKRAKTLG PRRGLTGTPL PPLWESTAAA
QAQGLLSEEH VAVIAAFHKR LPAWVDIETR AEADRQLAHA GSGLDPEGLD EAAGVLLAMI
NPDGAQPCDK ERARKRGIRI SKQHPDGTAT ISGTLTPEAL AIWQAIFAKE AAPGANNPES
EHTEDSTSGG AADDASDAPG DHAGAASSAA SGASDAAEHD PQPERCGSDT RTQAQRNHDA
FLAVGRRLLE SGELGTHNGL PVTVIVSTTL QELEKGAGVA VTGGGSLLPM PDLIRLAARA
HHYLYVYDQH SGKSLYLGRA KRLANAAQRI VLHARDRGCT RPGCTAPGYW CQAHHASADF
VDGGLTNIDD LTLACPCDHR MLDNTGWRTR KNGKNQTEWL PPPDLDTGQH RVNGHHHPER
YLLPEDDLPE DDQGP