Gene Mvan_4075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4075 
Symbol 
ID4649286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4364557 
End bp4366134 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content66% 
IMG OID639807539 
ProductHAD family hydrolase 
Protein accessionYP_954858 
Protein GI120405029 
COG category[E] Amino acid transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG0204] 1-acyl-sn-glycerol-3-phosphate acyltransferase
[COG0560] Phosphoserine phosphatase 
TIGRFAM ID[TIGR00530] 1-acyl-sn-glycerol-3-phosphate acyltransferases
[TIGR01488] Haloacid Dehalogenase superfamily, subfamily IB, phosphoserine phosphatase-like
[TIGR01490] HAD-superfamily subfamily IB hydrolase, TIGR01490 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.588653 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTGC CGGGATCGGT CGCCGAGATC CACGCCAGCC CCGAGGGCCC CGAGATCGGT 
GCGTTCTTCG ACCTCGACGG CACGCTCGTG GCCGGATTCA CCGGCGTGGT GATGACACAG
GACCGGTTGC GACGCAGGCA GATGTCGGTC GGTGAGTTCA TCGGCATGGT GCAGGCCGGG
CTCAACCATC AGCTCGGCCG CTCGGAGTTC GAGGACCTGA TCGGCAAGGG TGCGCGAATG
CTGCGCGGCA ACTCGGTGGA CGACATCGAC GAGCTCGCCG AGCGGCTCTT CGTCCAGAAG
ATCGTCGGCA GAATCTATCC GGAGATGCGC GAGATCGTGC GCGCACACAT GGCGCGCGGC
CACACCGTCG TGTTGTCCTC GTCCGCGCTG ACGGTGCAGG TGGAGCCGGT GGCCCGGTTT
CTCGGCATCA ACAACGTGCT GAGCAACAAG TTCGAGACCG ACGACGACGG GCTGATCACC
GGCGAGGTCC AGCGGCCGAT CATCTGGGGA CCTGGAAAGG CCAGGGCGGT ACAGGAATTC
GCCGCCGCCA ACGACATCGA CCTGTCGAAG AGCTACTTCT ACGCCGACGG TGACGAGGAC
GTCGCCCTGA TGTATCTGGT CGGCAATCCA CGCCCCACCA ACCCGGCGGG CAAGATGGCG
GCCGTCGCCG CCAAACGTGG CTGGCCGATC CTGCGCTTCA GCAGCCGCAG CGGAGCCAGC
CCGGCCTCGC AGGTGCGCAC CGCCGTCGGT ATCGCGACGA TGGTGCCGAT CGCCGCCGGC
GCCATCGGTG TCGGGCTGCT GACCCGCAAC AAGCGCACCG GAGTCAACTT CTTCACCTCG
ATGTTCGGCC GGACGCTGCT CAACACGGTC GGCATCAATC TCCAGGTGCT GGGCAAGGAG
AACCTGACGG CCCAGCGGCC GGCGGTGTTC ATCTTCAACC ATCGCAACCA GGCCGACCCG
CTGATCGCCG GGCGGCTGGT CAACGACAAC TTCACCTCGG TGGGCAAGAA GGAGCTGGAG
AACGACCCGA TCGTCGGCAC GATGGGCAAG ATCATGGACG CCGCCTTCAT CGACCGGGAC
GATCCGCAGA AGGCCGTCGA GGGACTGCAC AAGGTCGAAG AGCTTGCCCG CAAAGGGCTC
TCGATTCTGA TCGCGCCCGA AGGCACCCGG CTGGACACCA CCGAGGTGGG CCCTTTCAAG
AAGGGACCGT TCCGCATCGC GATGTCGGTG GGCATCCCGA TCGTGCCGAT CGTGATCCGC
AACGCCGAGG TGATCGCCGC GCGCGACTCC AGCACGTTCA ACCCGGGCAC CGTCGATGTC
GTTGTCTACC CGCCCATTCC GGTCGACGAC TGGACCCGCG AGAACCTGTC GGAGCGCATC
GACGAAGTGC GTCAGCTCTA TATCGACACG CTCAAGGACT GGCCGCACGA CGAGCTGCCC
ACGCCTGAGC TGTACAGACG TGCCAAACCG GCGAAGAAGT CGACCGCCAG GAAGACCCCG
GCCAAGAAGG CGCCGGCCAA GAAGGCAGCC GCCAAGGCGC CGACCAAGCG GGCCGGTGGC
ACGAAGGGCC GGCCGTGA
 
Protein sequence
MRLPGSVAEI HASPEGPEIG AFFDLDGTLV AGFTGVVMTQ DRLRRRQMSV GEFIGMVQAG 
LNHQLGRSEF EDLIGKGARM LRGNSVDDID ELAERLFVQK IVGRIYPEMR EIVRAHMARG
HTVVLSSSAL TVQVEPVARF LGINNVLSNK FETDDDGLIT GEVQRPIIWG PGKARAVQEF
AAANDIDLSK SYFYADGDED VALMYLVGNP RPTNPAGKMA AVAAKRGWPI LRFSSRSGAS
PASQVRTAVG IATMVPIAAG AIGVGLLTRN KRTGVNFFTS MFGRTLLNTV GINLQVLGKE
NLTAQRPAVF IFNHRNQADP LIAGRLVNDN FTSVGKKELE NDPIVGTMGK IMDAAFIDRD
DPQKAVEGLH KVEELARKGL SILIAPEGTR LDTTEVGPFK KGPFRIAMSV GIPIVPIVIR
NAEVIAARDS STFNPGTVDV VVYPPIPVDD WTRENLSERI DEVRQLYIDT LKDWPHDELP
TPELYRRAKP AKKSTARKTP AKKAPAKKAA AKAPTKRAGG TKGRP