Gene Mvan_3803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3803 
Symbol 
ID4645492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4050315 
End bp4051688 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content71% 
IMG OID639807268 
Producthypothetical protein 
Protein accessionYP_954590 
Protein GI120404761 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.85728 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.322661 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGGGG CACAGACGGT GGTCGACGTC GCGCTGAGCG AGGCGCGCCG ACTGGGCAGG 
GCAGACGAGA CCATCGTGGT GGTGACCGAC CGGGTCGACA CGTCGTTGCG ATGGGCCAAC
AACACGATGA CCACCAACGG CGAATCGACC AGCCGTACGA CGACCGTCAT CTCGATCGTG
CGTGACGGTG AGGACGCCCA CGTCGGGTCG GTGCGGTCCA GCGCGGCCGC CCCGGCCGTC
ATCGCCGGTC TGGTGGCCGC CTCCCAGGCG GCGGCCGCGG CGGCGCCGGC GGCGAGCGAC
AGCGCCCCCG CGCTGCCCGG CGGCGACACC CCGGCCGACT GGGACGCCGG GATCCCCCGG
ACCGGGGTCG AGGTGTTCGG TGGTGTGGCA ACGGAATTGG CGCGCGGATT CCGCGGACGC
GACACCCTGT TCGGATTCGC GCGGCACGAG CTCGAGACGA CGTTCATCGC CACTTCCGCC
GGGTTGCGCA GACGGTTCAC CCAACCGACC GGCTCGGTGG AGATCAACGC CAAGCGCGAC
GGCGCCAGCG CCTGGAGCGG GGTCAGCACG GCCGATTTCA CGGATGTGCC GACGGATTCG
ATGCTCGAGG AGCTCGCGAC GCGGCTGTCC TGGGCGAAAC GCACCGTCGA GCTGCCCGCC
GGTCGCTACG AGACGATCAT GCCGCCGTCC ACGGTGGCCG ACATGATGAT CTACCTCATG
TGGTCGATGG GCGGCCGCGG CGCGCAGGAG GGCCGCACCG CGCTGGCGGC GCCCGGTGGT
GGCACCCGGG TGGGGGAGAA GCTGACGTCG CTGCCGTTGA CGCTGTACTC CGATCCTTCC
GCGCCGGGGC TGGAGTGCGC GCCGTTCGTG GCGGCGCCCA CGTCCTCGGA GAACTCGTCG
GTGTTCGACA ACGGCCTGCG CATCGACCGG GTGGACTGGA TCCGCGACGG CGTGATCAAC
GCGTTGTCCT ATCCGCGGGC AGCGGCCCGG GAGTTCGACG CGCCGGTGGC GCTGGCCGCC
GACAACCTGT TGATGACCGG CGGGACGGCG AGCCTGCAGG AGATGATCGC GGACACCGAG
CGGGGGCTGC TGCTCTCGAC GCTGTGGTAC ATCCGCGAGG TCGACCCGTC GGTGCTGCTG
CTGACCGGGC TCACCCGCGA CGGGGTCTAC CTGGTCGAGG ACGGCCAGGT CACGGCCGCG
GTGAACAACT TCCGGTTCAA CGAGAGCCCG CTGGATCTGC TGCGCCGCGC CACCGAAGCC
GGTGCCAGCG AGGTGACCCT GCCGCGCGAG TGGGGCGATT GGGCCACCCG CGCCAGGATG
CCGTCGCTGC GGATCCCCGA CTTTCACATG TCCTCGGTCA GCCAGGCGCA ATAA
 
Protein sequence
MIGAQTVVDV ALSEARRLGR ADETIVVVTD RVDTSLRWAN NTMTTNGEST SRTTTVISIV 
RDGEDAHVGS VRSSAAAPAV IAGLVAASQA AAAAAPAASD SAPALPGGDT PADWDAGIPR
TGVEVFGGVA TELARGFRGR DTLFGFARHE LETTFIATSA GLRRRFTQPT GSVEINAKRD
GASAWSGVST ADFTDVPTDS MLEELATRLS WAKRTVELPA GRYETIMPPS TVADMMIYLM
WSMGGRGAQE GRTALAAPGG GTRVGEKLTS LPLTLYSDPS APGLECAPFV AAPTSSENSS
VFDNGLRIDR VDWIRDGVIN ALSYPRAAAR EFDAPVALAA DNLLMTGGTA SLQEMIADTE
RGLLLSTLWY IREVDPSVLL LTGLTRDGVY LVEDGQVTAA VNNFRFNESP LDLLRRATEA
GASEVTLPRE WGDWATRARM PSLRIPDFHM SSVSQAQ