Gene Mvan_5141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5141 
Symbol 
ID4647501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5502146 
End bp5504062 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content68% 
IMG OID639808615 
Productglycosyl transferase family protein 
Protein accessionYP_955918 
Protein GI120406089 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATC CCGGCCCGCG GGGGGTCGAG GAGCACTACC GCTCGATCGA CAGTGCGCCC 
CCGGAATACT CCGTCAACCG CCCGCCGAGC GCGGTCAACG GCGCGCTGAT GAACTTCTTC
TCACTCAGCG TGGTGGCACT GCTCGCCGCA ACCGAAATCG TGCGCCGCAT TGACCCCTAC
GCCCCCCGCG ACTTGGTTTT CATCCGTGCC GAGGGCCAGA GCGCCGCGAT CCCGGTGCGG
GTGTTCCTTC TGTTGATCGC CTCCGCGGTG GCGCTCTCCC TGGCCACCAA TTGGTGGCGG
CGGCTGGCGG TGGGTGGGGA AATGGTCGGC AAGGGCCTGC TGATCTGCCT GGTGGTGGAC
CTGTCGGCAT ACATCGGTCA CTCCCTGGGG CTGTTCGAAG CGGAGGTCGT CGGTCAGCAG
CTGGCGTCCA GTCTGGCGTC GCTGGTCGTG TTCCCGTTCG TGATCCTGCG GCATGCCCGG
CTGCCCAAAC CGGTCGATCT GCCGCCGGTC GGGCGCATCC GCTGGCATGC CTGGGTGCGC
CTGATCGTGC CGTTGGTGGT GGCGTTCGTG CTGGCTGCGT GGATCGAGCA GCGCACCCCG
CTCACGACCG AGTTCATGCG GGAATGGGCG CTGCTGGGCG GGGTCGGTCC TGGCATCTTC
CTGGTGCAGC AGCTGTTCGT GATCATCACC GCGAGCATCG GGTTACTGAT GGTCCGCTGG
TCACGCCGGG CGCGCTTCGC GCCCCCGCTG GCCGTGATGG TCCCCGCCCA CAACGAGGCC
CACGACATCG CCGCCGCCAT CGAGGCCGTC GACCGGGCGG CTGCCCGGTA CGCCGCTCCT
GTGCACCTCT ACGTGATCGA CAATGCCTCC ACCGACGCCA CCACGGCCGC CGCCGAGGCC
GCTCTGGCCG CCTGCGCGCA CTGCACCGGC GAGGTGCGGC CATGCGCGCA GCCCGGGAAG
GCGGTCGCGC TGAACTACGC GATATCGATC ATCCGTGAGG AGTTCGTGGT CCGCATCGAC
GCCGACACGG TGATCGGTGA GAACTGCCTC GATGTCACGC TGCGCCACTT CGCCAACCCC
AAGGTCGGCG CCGTCGGTGG CATGCCGCGG CCCCCGCGGG TCCGGACATT CTTCGACCGG
GCCCGGATGG TGGAGGTCCT GCTCAAGCAC GGCTTCTTCC AGGTCTCCAT GATGGGGTTC
GACGGGATCC TCGGCGAACC GGGCATGTTC GTGGTGTACC GGCGCCGCGC AGTCCTCGAG
GCCGGGGGGA TCGTCGAGGG CATGAACGGC GAGGACACCG ACATCTGCCT GAGGATGAGC
AGTCAGGGCT ATCTGAACAT GGCGGAACCG ACCGCCGTGT ATCTCAGCGA GGTGCCGCAG
ACCTGGGCGC ACCTGCGCGA ACAGCGAATC CGGTGGTTCC GCAGCATCTA TCACGTCACC
GCCCACAACC GGCGGGCATT GCTCGACCGC AGCTCGATGG CCGGTGTGGT GGTACTGCCC
TTCCAGCTGG CCAACGCGGC ACGGCGGGCG ATGATGCTCC CGCTGCTGCT GTTCGGCCTG
CTGATCTTCG GCCTGTTCCA GAAGACCTAT CCGGGGTTGA GCGAGCCGAG GCTGTGGGCG
GTGTTCCTCG GTCTGCCGAT GCTCGTCGCA ATCGCGGTAT GCCTGTCGCG GCAGCCACGC
GCGGTGCTCT ACATCCCGGA GTATCTGGTG TTCCGGGTGG TGCGCAGCTA CTTCACCCTG
GCCGCGGTGC TGAGCCTGAA CTTCCCGCCG CTGCATCCGC GCCTGCGGCG GCGCGCCTCC
GGCGAGAAAC CGGTTGCAAC GCCACGCCAG GCGCCGGCGA AGCGTTCGTC GCACCTGACT
CGCCAGCACA GCCGGGTGGC ACCGGCCGGC CGGCGTTCCA GCTCAGTCGA GTCGTGA
 
Protein sequence
MSDPGPRGVE EHYRSIDSAP PEYSVNRPPS AVNGALMNFF SLSVVALLAA TEIVRRIDPY 
APRDLVFIRA EGQSAAIPVR VFLLLIASAV ALSLATNWWR RLAVGGEMVG KGLLICLVVD
LSAYIGHSLG LFEAEVVGQQ LASSLASLVV FPFVILRHAR LPKPVDLPPV GRIRWHAWVR
LIVPLVVAFV LAAWIEQRTP LTTEFMREWA LLGGVGPGIF LVQQLFVIIT ASIGLLMVRW
SRRARFAPPL AVMVPAHNEA HDIAAAIEAV DRAAARYAAP VHLYVIDNAS TDATTAAAEA
ALAACAHCTG EVRPCAQPGK AVALNYAISI IREEFVVRID ADTVIGENCL DVTLRHFANP
KVGAVGGMPR PPRVRTFFDR ARMVEVLLKH GFFQVSMMGF DGILGEPGMF VVYRRRAVLE
AGGIVEGMNG EDTDICLRMS SQGYLNMAEP TAVYLSEVPQ TWAHLREQRI RWFRSIYHVT
AHNRRALLDR SSMAGVVVLP FQLANAARRA MMLPLLLFGL LIFGLFQKTY PGLSEPRLWA
VFLGLPMLVA IAVCLSRQPR AVLYIPEYLV FRVVRSYFTL AAVLSLNFPP LHPRLRRRAS
GEKPVATPRQ APAKRSSHLT RQHSRVAPAG RRSSSVES