Gene Mvan_5687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5687 
Symbol 
ID4646208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp6075957 
End bp6076964 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content69% 
IMG OID639809163 
Producthypothetical protein 
Protein accessionYP_956458 
Protein GI120406629 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1387] Histidinol phosphatase and related hydrolases of the PHP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0196791 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCCGG TCATCGCGCT GCGTCAGATC GCGTACTACA AGGACCGCGC ACGGGAGGAT 
CCGCGCCGGG TGATGGCCTA CCGCAACGCC GCCGACGTGG TCGAAGCGCT CACCGACGCC
CAGCGCGAGA AGCACGGCGC CGCCAACAGC TGGCAGACGC TGCCCAAGAT CGGGCCCAAG
ACCGCCAAGG TGATCGCGCA GGCCTGGGCC GGCCACGAAC CCGCCGTCCT GGTGGAGTTG
CGCGAGGCCG CGCAGGACCT CGGCGGCGGC GACATCCGGG CCGCGCTGCG CGGTGACCTG
CACGTGCACT CCAACTGGTC CGACGGTTCG GCGCCCATCG AGGAGATGAT GCTGGCCGCC
CGCGCGATCG GGCACGAGTA CTGTGCGCTG ACCGACCACT CCCCGCGGCT ACGGATCGCC
AACGGGCTCT CCCCGGAGCG CCTGCGCGAA CAACTCGACG TCATCGACGA GATCCGCGAA
AAGGTCGCTC CGCTAAGGAT TCTCACCGGG ATCGAGGTCG ACATCCTGGA AGACGGCTCG
CTGGACCAGG AACCCGAACT GCTGGAGCGG CTCGATGTCG TCGTGGCCAG CGTGCACTCC
AAGCTCGCGA TGGATGCCGC CGCGATGACC CGGCGCATGC TCAAGGCCGT CACCAACCCG
CACACCGACG TCCTCGGTCA CTGCACCGGA CGGCTCGTCA CCGGCGGCCG CGGTATCCGA
CCCGAATCGA AGTTCGATGC CGAGAAGGTG TTCACCGCCT GTCGCGACGC GGGCACCGCC
GTCGAGATCA ACTCGCGTCC GGAACGCCGC GATCCGCCGA CACGTCTGCT CACGCTGGCG
ATGGACATCG GATGCGTGTT CTCGATCGAC ACCGACTCCC ACGCGCCGGG GCAGTTGGAG
TTCCTCGGCT ACGGCGCCCA ACGCGCCCTG GACGTCGGGC TGGAAGCCGA GCGCATCGTC
AACACCTGGC CCGCCGACCA ACTGCTCGCC TGGACGCGCT CCGGTTAG
 
Protein sequence
MDPVIALRQI AYYKDRARED PRRVMAYRNA ADVVEALTDA QREKHGAANS WQTLPKIGPK 
TAKVIAQAWA GHEPAVLVEL REAAQDLGGG DIRAALRGDL HVHSNWSDGS APIEEMMLAA
RAIGHEYCAL TDHSPRLRIA NGLSPERLRE QLDVIDEIRE KVAPLRILTG IEVDILEDGS
LDQEPELLER LDVVVASVHS KLAMDAAAMT RRMLKAVTNP HTDVLGHCTG RLVTGGRGIR
PESKFDAEKV FTACRDAGTA VEINSRPERR DPPTRLLTLA MDIGCVFSID TDSHAPGQLE
FLGYGAQRAL DVGLEAERIV NTWPADQLLA WTRSG