Gene Mvan_4034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4034 
Symbol 
ID4643463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4315909 
End bp4317219 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content69% 
IMG OID639807496 
Productglycosyl transferase family protein 
Protein accessionYP_954817 
Protein GI120404988 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID[TIGR01426] glycosyltransferase, MGT family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.468428 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGAGA TCCTCATCGC GACATGCCCA CCCGTCGGAC ACCTCTCGCC GCTGCTGAAC 
GTCGCCCGCG GGCTCGTCGC CCGCGGCGAC CGCGTCACCG TCCTCACCAG CGCCCGCCAC
GCCGACAAGA TCAGAGCGGT CGGCGCCGAG CCCCGGCCGC TGCCCCACGC CGCCGACTAC
GACGACTCCA CGCTCGACGC CGACCTGCCG GGCCGCGCCG CGACAGCAGG CCTGGCACGG
ATCAACTTCG ACATCGAGAG CATCTTCGTA CGCCCGCTAC CGCACCAGTT CAGCGCGCTA
CAGGAACTGC TGGTGACCGA CCGCTTCGAC GCCGTCATCG CCGACGCGCT CTTTCTCGGT
ACCCTGCCGC TGTTACTCGG CGACCCAGCC AAACGTCCGC CGATCCTGTC CTACTCCACC
ACACCGCTGT TCCTCACCAG CCGGGACACC GCCCCGGCCG GGCCGGGGAT CGCACCGATG
CCCGGGATGA TCGGCAGGTT GCGCAATCGC GTGCTGGCAC GTGCCACCCA GTCGGTGTTG
CTGCGCCCGG GCCAGCGCGC GGCGGACCGG ATGCTCGAAG CGATGGGCCT GCCGGAGCTA
CCGGTGTTCA TCCTGGACTC CGCGGTCCTC GCGGACCGGG TGATCGTCCC GACGGTCCCC
GAGTTCGAGT ACCGTCGCAG CGATCTGCCC TCCCATGTCC GATTCGTCGG CCCGGTCAGT
CCGCTGCCCG GCAGCGACTT CGTCGCCCCG CCGTGGTGGG GTGAGCTCAA CGCCGGCAGG
CCCGTCGTGC ACGTCACACA GGGCACCATC GACAACGCCG ACCTCACCCG GCTGATCGAA
CCGACCATCG AGGCACTGGC GGACGAAGAC GTCACGGTCG TAGCCACCAC CGGGGGACGT
CCGATCTCTC AGATTCGAAT TCCTCTACCC GCCAACACCT TTGTCGCCAA GTACGTGCCA
CACGACGTGC TGCTGCCGAT GGTCGACGTG ATGGTCACCA ACGGTGGTTA CGGCGCCGTC
CAGCGCGCGC TGTCCGACGG GGTGCCGGTG GTGGTCGCCG GGCATACCGA GGACAAGCCC
GAGGTGGCCG CGCGGGTCCG GCATTTCGGC GTGGGAATCG ATCTGCGCAC CGGCACACCG
ACTCCGACCC AGGTGCGCCG CGCTGTGCGG AAGGTGCTGC ACGAACCGGG ATTCCGCACC
AAGGCCGGGT GGCTGCGCAG CGCCTACGCC GCCTATGACA GCGTCGCCGA GATCGCGAAG
CTCGTCGACG AAGCAGTCGC GCAGCGGACG CAACCCCTCC AGTCGGCCTG A
 
Protein sequence
MPEILIATCP PVGHLSPLLN VARGLVARGD RVTVLTSARH ADKIRAVGAE PRPLPHAADY 
DDSTLDADLP GRAATAGLAR INFDIESIFV RPLPHQFSAL QELLVTDRFD AVIADALFLG
TLPLLLGDPA KRPPILSYST TPLFLTSRDT APAGPGIAPM PGMIGRLRNR VLARATQSVL
LRPGQRAADR MLEAMGLPEL PVFILDSAVL ADRVIVPTVP EFEYRRSDLP SHVRFVGPVS
PLPGSDFVAP PWWGELNAGR PVVHVTQGTI DNADLTRLIE PTIEALADED VTVVATTGGR
PISQIRIPLP ANTFVAKYVP HDVLLPMVDV MVTNGGYGAV QRALSDGVPV VVAGHTEDKP
EVAARVRHFG VGIDLRTGTP TPTQVRRAVR KVLHEPGFRT KAGWLRSAYA AYDSVAEIAK
LVDEAVAQRT QPLQSA