Gene Mvan_4775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4775 
Symbol 
ID4644100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5111622 
End bp5112935 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content67% 
IMG OID639808244 
ProductDyp-type peroxidase family protein 
Protein accessionYP_955554 
Protein GI120405725 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01412] Tat-translocated enzyme
[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCATCGC ACCCCAATAA CGAGAACGCG GAACCGGCTC AGCCCGAACG GTCCGGGCTG 
AGCCGGCGCA AGCTGTTCGG CGCGGCCGGA GTGACGGCGG CGGTCGTCGG CGCGGCGGGC
GTGGGAGCGC TGGCCGGCCG CGCTTCGGCG GCGCAACCGC CGAATCATCT CGACACGGCG
GTGCCGTTCC GCGGGCAACA TCAAGCGGGC ATCGTCACCG AGGCGCAGGA CCGCATGCAC
TTCGCCACGT TCGACGTGAC CACCAAGAGC CGCGACGACG TCATCAAGAT GCTCGCCGAC
TGGACCGAGA TGGCCGAGCG GATGACGCAG GGCGAGGAGG CGTTTCCGAA CGGCTCCATC
GGCCAGAATC CGTACTCGCC ACCGTCGGAC ACCGGCGAGG CCCTCGGCCT GCCTCCGTCT
CAGCTGACGC TGACCATCGG GTTCGGGCCG TCGTTCTTCG TCAAGGACGG AGTCGACCGG
TTCGGCATCG CCGACAAGAA ACCGGCCGAG CTGGTCGACC TGCCCAAGTT CCCCAACGAG
AAGATCGACC CGGCCCGCAG CGGCGGCGAC ATCGTCGTGC AGGCCTGCGC GAACGATCCG
CAGGTGGCCG TCCACGCCAT CCGCAACCTG GCCAGGATCG GCTTCGGCAC CGTCGCGGTA
CGGTACTCGC AGCTGGGCTT CGGCCGCACC TCGTCGACCA CGCGCGACCA GACCACTCCG
CGAAACCTGT TCGGGTTCAA GGACGGAACG GCCAACATGC GTTCCGACGA AACGGACAGG
CTGAACAAGT CGGTCTGGGT GGCCGACGGT GACGGCCCGG CGTGGCTGAC CGGTGGCACG
TATCTGGTGG CGCGGCGGAT CCGGATGCTC ATCGAGCAGT GGGACCGCAC CACGCTGCTG
GAACAGGAAC GCGTCATCGG CAGGCAGAAG GGCTCCGGCG CGCCGATGGG ACTCGACGAC
GAGTTCCAGG AACTCAATTT CGATCTGACC AACGACAAGG ACGAGCCGCT GATCGACCCG
GTGGCCCACG TCCGGCTCGC GTCGTCGAAG AACCTCGGCG GCATCGAGAT CCTGCGTCGC
GGTTACAACT TCACCGACGG CTCGGACGGC TTCGGCCATC TGGATGCGGG GCTGTTCTTC
ATCGCGTTCG TGCGCAACCC GGTGACCCAG TTCGTGCCGA TGCAGAACGC GATCTCCCGC
AACGACGCGA TGAACGAATA TGTCCGACCC ACCAGCTCAG CGGTGTTCGC TTGTCCGCCA
GGAATTCCCG AGGGAGACAA GTCCACGTTC TGGGGGTCGA CGCTGTTCGA CTGA
 
Protein sequence
MSSHPNNENA EPAQPERSGL SRRKLFGAAG VTAAVVGAAG VGALAGRASA AQPPNHLDTA 
VPFRGQHQAG IVTEAQDRMH FATFDVTTKS RDDVIKMLAD WTEMAERMTQ GEEAFPNGSI
GQNPYSPPSD TGEALGLPPS QLTLTIGFGP SFFVKDGVDR FGIADKKPAE LVDLPKFPNE
KIDPARSGGD IVVQACANDP QVAVHAIRNL ARIGFGTVAV RYSQLGFGRT SSTTRDQTTP
RNLFGFKDGT ANMRSDETDR LNKSVWVADG DGPAWLTGGT YLVARRIRML IEQWDRTTLL
EQERVIGRQK GSGAPMGLDD EFQELNFDLT NDKDEPLIDP VAHVRLASSK NLGGIEILRR
GYNFTDGSDG FGHLDAGLFF IAFVRNPVTQ FVPMQNAISR NDAMNEYVRP TSSAVFACPP
GIPEGDKSTF WGSTLFD