Gene Mvan_5019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5019 
Symbol 
ID4644629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5368579 
End bp5370129 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content70% 
IMG OID639808490 
Producthypothetical protein 
Protein accessionYP_955797 
Protein GI120405968 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.579081 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGAGG GACGCGTCGA CACCGTCATT GTCGGGGGCG GACACAACGG CCTGGTGGCC 
GCCGCGTACC TGGCCAAGGC CGGCCGCAGT GTCCGGGTTC TCGAGCGTCT CGGCCATGTC
GGCGGCGCGG CGGTTTCGCT GTATCCGTTC GACGGGATCG ACGCCCGACT CTCCCGCTAC
TCGTATCTGG TCAGCCTGCT GCCGCGGCGC ATCGTCGACG ACCTCGGCGC GCGGATCAGG
CTGGCGCGGC GGCGGTACTC GTCCTACACG CCCGACCCGT CCGACGGCGG CCGCAGCGGC
CTGCTGATCG GACCGGAATC GACGTTCGGG GCCGTCGGCG CCGCCGCAGA CGAGGCCGGT
TTCGGCGAGT TCTACCGGCA GTGCGCAACG GTCACCTCCG CGCTGTGGCC GACGCTGACC
GAGCCGCTGC GCACCCGGTC GGAGGCGCAC CGCGCAGTGG TCGGCGCCGG AGGCCGCGAC
GCGTGGCAGT GGATGATCGA GCGTCCGCTG GCCGAGGCGA TCACCTCGGC CGTGGGCGAC
GATCTCGTCC GCGGTGTGAT GGCGACAGAC GCGTTGATCG GAACGTTCGC CCGGCTCGAC
GAGGAATCCC TGGTTCAGAA CCGGTGCTTC CTCTACCACC TGATCGGCGG CGGCACCGGC
GACTGGGACG TGCCCATCGG CGGGATGGGG GCGGTCAGCG AGGCGATGGC GGCCGCCGCC
ACCGGCTTCG GCGCCGAGAT CGTCACCGGC GCAGACGTTT ACGCGATCGA CCCGGAGGGC
CAGGTGCGCT ACCGGTGTGC CGACGACGAA CATGTCGTGG CCGCCCGGCA CATCCTCGCC
AACGTGACGC CCGCGGTGCT GGCCCGCCTG CTCGGGGAGG CCGAACCCGA ACTCGCCCCC
GGCGCGCAGG TCAAGGTCAA CCTCATGCTC ACCCGGTTGC CGCGACTGCG CGACGAAAGT
GTCACCCCGG AACAGGCTTT CGGCGGCACC TTTCACATCA ACGAGACCTA CCGTCAACTC
GACGCCGCCT ACCGGCAGGC GGCCGCCGGT TCGGTGCCCG ATCCGCTGCC GTGCGAGATC
TACTGCCACA CCCTGGCCGA CCCGACGATC CTGTCCGATT CGTTGCGGGC CGCGGGCGCA
CACACGCTGA CGGTGTTCGG CCTGCACACC CCGCACAGCC TGGCCGACTC GGCCACCCCT
GACCGGCTGC GCGACACTCT GACCTCGGCG GTGCTGACCT CGTTGAACTC TGTTCTGGCC
GAGCCGATTC AGGACGTCAT CATGCAGGAC TCGTCGGGCC GGTTGTGCAT CGAGGCCAAG
ACGACGTCGG ACCTCGACGA TGCCCTGGGT ATGACCGACG GCAACATCTT CCACGGCGCG
CTGTCGTGGC CGTTCGTCGA GGACGACGAG CCGCTCGCGA CGCCGGCCCA GCGATGGGGT
GTCGCCACCG CGCACGATCG AATTCTGTTG TGCGGGTCCG GTTCCCGTCG GGGCGGAGCC
GTGTCGGGCA TTGGTGGCCA CAACGCGGCG ATGGCGGTGC TGGAGCGCTA A
 
Protein sequence
MSEGRVDTVI VGGGHNGLVA AAYLAKAGRS VRVLERLGHV GGAAVSLYPF DGIDARLSRY 
SYLVSLLPRR IVDDLGARIR LARRRYSSYT PDPSDGGRSG LLIGPESTFG AVGAAADEAG
FGEFYRQCAT VTSALWPTLT EPLRTRSEAH RAVVGAGGRD AWQWMIERPL AEAITSAVGD
DLVRGVMATD ALIGTFARLD EESLVQNRCF LYHLIGGGTG DWDVPIGGMG AVSEAMAAAA
TGFGAEIVTG ADVYAIDPEG QVRYRCADDE HVVAARHILA NVTPAVLARL LGEAEPELAP
GAQVKVNLML TRLPRLRDES VTPEQAFGGT FHINETYRQL DAAYRQAAAG SVPDPLPCEI
YCHTLADPTI LSDSLRAAGA HTLTVFGLHT PHSLADSATP DRLRDTLTSA VLTSLNSVLA
EPIQDVIMQD SSGRLCIEAK TTSDLDDALG MTDGNIFHGA LSWPFVEDDE PLATPAQRWG
VATAHDRILL CGSGSRRGGA VSGIGGHNAA MAVLER