Gene Mvan_0468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_0468 
Symbol 
ID4645574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp510393 
End bp511478 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content62% 
IMG OID639803976 
Productcupin 2 domain-containing protein 
Protein accessionYP_951321 
Protein GI120401492 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3435] Gentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR02272] gentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACCG CCGAGAGTTC AGAGCTCCGC GAATTTGACG TCGAGCTCGA GGCTGCCAAC 
TTACGTGGGC AATGGATCTA CGACGACATG CTGGAGAGCG TCGTCGGCGG GCCCAAGCCT
GCGGGTGTTC CCTTTCTGTG GCGATGGCAC GATGTTTACG CGAAGCTTCT GAAGTCGTGC
GACGTGATGC CTGAAAGTTT GACGGCGCGA CGCAATCTCT CGTTCATCAA CCCGGATGCC
CGAGGAACCA CGCACACCAT CAACATGGGT ATGCAGATGC TCAAGCCCGG CGAGATTGCC
TATGCGCACC GCCACACCAT GGCAGCGCTG CGGTTCGCTA TTCAAGGCGG CCCCGGCCTG
GTGACTGTGG TGGATGGCGA GCCTTGTCAA ATGGACACCT ACGACCTGGT TCTGACCCCT
CGCTGGACGT GGCATGACCA TGAGAACGCC ACCTCGGAGA ACGTCGTTTG GCTCGACGTG
CTGGATATCG GCCTAGTGCT CGGGCTGAAT GTGCCCTTCT ATGAGCCCTA TGGCGAGATG
CGCCAACCTC AACGCGAGGA CCCGGGAGAG CATCTCGCTG ACCGCGGTGG GATGCTGCGC
CCGGCGTGGG AGCAGGTCAA GGCGGCGAAC TTCCCGTACC GCTATCCTTG GCGTGACGTC
GAGCGGCAGC TCCAGCGGAT GGCGGGCCTT GCGGGCAGTC CCTACGACGG CGTAGTCCTG
CGTTATGCGA ACCCCGTTAC CGGCGGATCG ACTATGCCAA CGCTGGATTG CTGGGTGCAG
TTGCTGCGGC CGGGCCAGCA GACCGAGGCC CATCGCCACA CGTCGAGTGC CGTGTATTTC
GTCGTGCGCG GTGAGGGAAC TACGGTTGTC GACGGGGTCG AACTCGACTG GGGGCCCCAC
GACAGCTTCG TGGTGCCCAA CTGGAGCACC CATCACTTCG TCAACCGGTC GGCAGAAAAT
GCGTTGCTGT TCTCGGTCAA CGACATCCCT ACATTGAAGG CTCTCGATCT CTACTACGAA
GAGCCCGAGC TGTCTTTGGG GACGCAGCCA TTTCCGCCGG TCCCCGCTAA CCTCCGAGCC
CGCTGA
 
Protein sequence
MSTAESSELR EFDVELEAAN LRGQWIYDDM LESVVGGPKP AGVPFLWRWH DVYAKLLKSC 
DVMPESLTAR RNLSFINPDA RGTTHTINMG MQMLKPGEIA YAHRHTMAAL RFAIQGGPGL
VTVVDGEPCQ MDTYDLVLTP RWTWHDHENA TSENVVWLDV LDIGLVLGLN VPFYEPYGEM
RQPQREDPGE HLADRGGMLR PAWEQVKAAN FPYRYPWRDV ERQLQRMAGL AGSPYDGVVL
RYANPVTGGS TMPTLDCWVQ LLRPGQQTEA HRHTSSAVYF VVRGEGTTVV DGVELDWGPH
DSFVVPNWST HHFVNRSAEN ALLFSVNDIP TLKALDLYYE EPELSLGTQP FPPVPANLRA
R