Gene Mvan_5271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5271 
Symbol 
ID4644737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5645971 
End bp5646885 
Gene Length915 bp 
Protein Length304 aa 
Translation table11 
GC content65% 
IMG OID639808746 
Productectoine hydroxylase 
Protein accessionYP_956048 
Protein GI120406219 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG5285] Protein involved in biosynthesis of mitomycin antibiotics/polyketide fumonisin 
TIGRFAM ID[TIGR02408] ectoine hydroxylase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.442083 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGTTCCC GGCATCTTTC GCAGAGCACA GACCGGTACC CGACCCGGCT GGACACGCAG 
ATCGACCCGA TCGCACGCGA GGAGCCGACC GTGTGGGGCT CGGCGGCCAA CGGTCCGCTG
GCCGCGGCCG ATCTCGACTC CATGGCGAGC AATGGATACA TGGTGCGGCG CGACGCGGTG
GAGGCAGCTT GGCTGCCGGC CCTGACCGCA GAACTCGACC GTGTCGCGGC GACGGCGGAC
CCGCGGGACC CGCGGATCAT CCGCGAGCCG GGCGGCAGCA TCCGGTCGGT GTTTCAGCCG
CACCTCTTCA GCGATCTGAT CAGCGAGGTC GTCACGCTGG ACACCGTGCT TCCCGTCGCC
CGTCAGTTGC TGGGCAGTGA CGTCTATCTG CACCAGGCGC GCATCAACAT GATGCCCGGC
TTCACCGGGA CCGGCTTCTA CTGGCATTCC GATTTCGAGA CCTGGCATGC CGAGGACGGC
ATGCCCGAGA TGCGTGCGGT GTCATGTTCC ATCGCCTTGA CTCAGAACTT CCCGTACAAC
GGTTCGTTGA TGGTGATGCC CGGGTCGCAT CGGGTCTTCT ATCCCTGTGT CGGGGCCACG
CCCCGCGACA ATCACGCGTC ATCGTTGGTC AAGCAGGAGA TCGGCGTCCC GAGTCATGCC
ACCTTGACCG CAGCTGCCGA TCGGCACGGT ATCGACCAGG TCACCGGTCC CGCCGGAACC
GCGCTGTGGT TCGACTGCAA CATCATGCAC GGGTCGGGCT CGAACATCAC GCCGTACCCG
AGGTCGAACA TCTTCCTGGT ATTCAACTCG GTGGAGAACC GGCTGCTCGC GCCGTATCGG
GCGGAGACGC CACGTCCGGA GTACCTGGCG GCCAGGACCA GCCAACCGTT CAGCGCCACC
GCCCTCACCA CTTGA
 
Protein sequence
MSSRHLSQST DRYPTRLDTQ IDPIAREEPT VWGSAANGPL AAADLDSMAS NGYMVRRDAV 
EAAWLPALTA ELDRVAATAD PRDPRIIREP GGSIRSVFQP HLFSDLISEV VTLDTVLPVA
RQLLGSDVYL HQARINMMPG FTGTGFYWHS DFETWHAEDG MPEMRAVSCS IALTQNFPYN
GSLMVMPGSH RVFYPCVGAT PRDNHASSLV KQEIGVPSHA TLTAAADRHG IDQVTGPAGT
ALWFDCNIMH GSGSNITPYP RSNIFLVFNS VENRLLAPYR AETPRPEYLA ARTSQPFSAT
ALTT