Gene Mvan_0843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_0843 
Symbol 
ID4646176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp876978 
End bp877883 
Gene Length906 bp 
Protein Length301 aa 
Translation table11 
GC content72% 
IMG OID639804343 
ProductHAD family hydrolase 
Protein accessionYP_951687 
Protein GI120401858 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0560] Phosphoserine phosphatase 
TIGRFAM ID[TIGR01488] Haloacid Dehalogenase superfamily, subfamily IB, phosphoserine phosphatase-like
[TIGR01490] HAD-superfamily subfamily IB hydrolase, TIGR01490 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.163877 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.772835 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGATA CCGGGGGAAT CGAGATCGGT TCCGGCAGCC GGGCGCAGGA ACTCGCCGGT 
GAGGTCAGCG CCGAGGTCGC CGCGGAGGGG CTCGCCCAAC CGCTGGACGC CGTCGCCGCC
CCACCGCCGC CACCGCCCGA TCTGACCGCG GCCGCGTTCT TCGACGTCGA CAACACGCTG
GTGCACGGGT CGTCGCTGGT GCACTTCGCC AGGGGCCTGG CCGCCCGCGA GTACTTCACC
TATCAGGACC TGGCCCGCTT CGCCTTGGCG CAGGCCAAGT TCCAGCTGAC CGGCCGGGAG
AACAGCGGCG ACGTCGCCGC GGGCCGGCGC AAGGCGCTGG CGTTCATCGA GGGACGGTCG
ACGGCCGAGC TGGTCGCGCT CGGCGAGGAA ATCTACGACG AGATCATCGC CGACAAGATC
TGGCCCGGCA CCAGGGCGCT GGCGCAGATG CACCTCGACG CCGGCCAACA GGTGTGGCTG
GTCACCGCGA CGCCCTACGA GCTGGCCGAC ACCATCGCCC GCCGGCTGGG TCTGACCGGC
GCGCTGGGAA CCGTGGCCGA GTCGATCGAC GGGGTCTTCA CCGGCAGGCT CGTCGGCGAC
ATCCTGCACG GCACCGGCAA GGCGCACGCG GTGCGGTCGC TGGCGATCCG CGAAGGACTG
AACCTGCGCC GCTGCACCGC CTACTCGGAC AGCTTCAACG ACGTGCCGAT GCTGTCGCTG
GTCGGCACCG CGGTGGCGAT CAACCCGGAC GCCGACCTGC GCGACCTGGC CCGGGAGCGG
GGCTGGGAGA TCCGCGATTT CCGCACCGCC CGCAAGGCGG CCCGGATCGG GGTGCCTTCG
GCGCTCGCGC TGGGAGCGGT CGGCGGCGCG CTGGCGGCTG CGGTGTCCCG CCGGGAGAAG
AAGTAG
 
Protein sequence
MSDTGGIEIG SGSRAQELAG EVSAEVAAEG LAQPLDAVAA PPPPPPDLTA AAFFDVDNTL 
VHGSSLVHFA RGLAAREYFT YQDLARFALA QAKFQLTGRE NSGDVAAGRR KALAFIEGRS
TAELVALGEE IYDEIIADKI WPGTRALAQM HLDAGQQVWL VTATPYELAD TIARRLGLTG
ALGTVAESID GVFTGRLVGD ILHGTGKAHA VRSLAIREGL NLRRCTAYSD SFNDVPMLSL
VGTAVAINPD ADLRDLARER GWEIRDFRTA RKAARIGVPS ALALGAVGGA LAAAVSRREK
K