Gene Mvan_4035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4035 
Symbol 
ID4648436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4317230 
End bp4318528 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content70% 
IMG OID639807497 
Productglycosyl transferase family protein 
Protein accessionYP_954818 
Protein GI120404989 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID[TIGR01426] glycosyltransferase, MGT family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.996814 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.480541 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCCG TCCTGATCGC CGTGCTGTCA CCCGTCGGGC ACGTCGAACC GCTGTTCGCC 
GTCGCCGAGG ATCTGGTCTC CCGCGGAGAC CAGGTGACCG TGATGACCGG TGCGGCGCAT
ACCGATTCGA TCCGCGCGAT CGGGGCGCAT CCGCACCCGC TACCGCGGTA CGCGGACTTC
GACGACGCGC CGTTCGACAC CGGTCAGCGC GCCGCGACCT CCGGAATCGA CGCGCTCAGC
CGGGCCGTCA TCCGACTCTT CCTCACGCCG ATGCCCCTCC AGGCGGCCGA ACTGCGCAGG
GCGCTGGCCG ATCGGCACTA CGACGCGGTG ATCGTCGACT ATCTGTTCCT CGGCATCCTG
CCGTTGCTGC TCGACGACAC GGTCCGCACA CCGCCGGTGC TGCACTACAC GCCCACGCCC
ATGATGCTGT CCAGCCGTGA CACCGCGCCG GTTGGTCTGG GTCTGCCTCC GGCCCGCACC
GCGGCCCGCC GGGCGCTGTA CCGCGCCCTG ACGCTGCTGT CGCAGAAAGT CCTGCTGCGC
AACGCCCAAC GCACCGCCGA CCGGATGCTG ACGTCCATGG GTGTGCCACG GCTACCGGTG
CCGCTGCTCG ATGCCGGGAG GCTGGCAGAC CGGTTGATCG TGCCGACGGT GCCGAGCTTC
GAGTACCCGC GCAGTGACCT GCCAGAAGGG GTGCGGTTCG TCGGACTGGT CCGGCCCCGC
TCGGCGGACC GCTTCGTGAC CCCACCGTGG TGGGAAGTGC TCGACGCGGA CCGACCCGTC
GTGCACGTCA CGCAGGGCAC CGTCGACAAC CGCGACCTGT CACGTCTGAT CGAGCCGACG
ATCACCGCAC TGGCCGACAC CGACGTCACG GTGGTGGTCA GCACCGGCGG CCGCACCCGG
GACTCGATCC GGGTGCCCAT CCCCGCCAAC ACCCATGTGT GCGAGTACAT CCCACACGAC
CGACTGCTGC CCAAGGTCGA CGTGATGGTG ACCAACGGTG GCTACGGCGG GGTGCAGCGC
GCACTGGCCG CAGGGGTGCC GCTGGTGGTC GCAGGCAGCA CCGAGGACAA GCCGGAGGTG
GCCGCCCGGG TGGCGTGGTC GGGGGCCGGA ATCAACCTCG ACACGGGTAC ACCCTCGGCC
GACGCCATCC GGTCGGCGGT CGCACTGCTG CGCAGCGACG ACCGCTACCT TCGCAACGCC
CGCCGATTGG AGGCGGCGTT CGCCCGCCAG GACGGAATCG CCGAGATCGC CGCACTCATC
GACGAGCTCA TCGGCGTACG CCGGACAGCG GCCACATGA
 
Protein sequence
MAAVLIAVLS PVGHVEPLFA VAEDLVSRGD QVTVMTGAAH TDSIRAIGAH PHPLPRYADF 
DDAPFDTGQR AATSGIDALS RAVIRLFLTP MPLQAAELRR ALADRHYDAV IVDYLFLGIL
PLLLDDTVRT PPVLHYTPTP MMLSSRDTAP VGLGLPPART AARRALYRAL TLLSQKVLLR
NAQRTADRML TSMGVPRLPV PLLDAGRLAD RLIVPTVPSF EYPRSDLPEG VRFVGLVRPR
SADRFVTPPW WEVLDADRPV VHVTQGTVDN RDLSRLIEPT ITALADTDVT VVVSTGGRTR
DSIRVPIPAN THVCEYIPHD RLLPKVDVMV TNGGYGGVQR ALAAGVPLVV AGSTEDKPEV
AARVAWSGAG INLDTGTPSA DAIRSAVALL RSDDRYLRNA RRLEAAFARQ DGIAEIAALI
DELIGVRRTA AT