Gene Mvan_5249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5249 
Symbol 
ID4645264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5623536 
End bp5625551 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content67% 
IMG OID639808724 
ProductO-antigen polymerase 
Protein accessionYP_956026 
Protein GI120406197 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.566036 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.80633 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGCT CCACCCGCGT CAGCCAGGGT GAGCTCGCGG GACTCGTTGT CGCGGTGGGC 
TTCGTCAGCA TCGGGTCGCT TCTCGTTTTC GGATCGCGCA GCACGCTGCT TCTGATCGCC
GCGCCGATCG GGGCGATCGC GTTGTTTGTC GTCGCCCGAC GCCCGGTGCT GGCGCTGAGC
ATCATGGTGG TCATCGAGTT CGCCAATCTG TCCGGCCTTC TCGCGCCGAA GACCGGCCTC
CCCATCTTTC CGGCGTCGCT GTTGCTGGGG CTGATCGCCG TCGGATTCGC GCTGCGCGAT
CCACAGTGTC GGGGCCGGAT CAACGGCTGG ACAGCGGTCT GTGCGGGATT GCTCGCCGTC
TATCTGGCCA CCCAGGCGGT CGCGGGGATC GGAAGCGTCG ATACCGCAGA ATCCGTTGCG
ATCCTGCAGC GCCACGTCAT CGATTGCGTT TTCGTCATGG TGGTGCTGGT GCTGGTCCAG
GTCACGGCAC GACCCTGGAC GGTGGCCGCC GTCATCGTCG TGACGTTGGC AGCGTTGTGC
ACGCTCACCG TGATCAGCCA GGTTGTCTAC GGTGGTGCCG CGACGTTCGG CGGGTTGTCC
ACGGTGACAA CGGCATCTGG AGAGATGGTC ACCACGTTGC GCTACGGCGG TCCGCTTCCG
GACTCGAACT TCTGGGGTCG GCACCTCGTC ATGGGGTTGC CGATGGCCGC TGCGCTGATG
ACGCGCGCCC TCCGGTCCGC CCGCCGGGCG ACGGCAGCGC CCTGGGCGAT TGCACTTGCC
CTGCTGCTCT GCGGCATCTA CCTGACACAA TCTCGTGGCA CGTTCCTCGC CGCGGGTGTC
GCGATCGTGG TGTGGTTCGT GGCCGTCGAC CGCGCAGTCC GGCGGTGGGC GCTGATCCTA
ATACCGCTCG GCGTCGCCGT CTTCGCTGTC CCGGGCGTGG GCAACAGGAT GGTGGCCGCA
TTCGAGGACT TCACCCACGC ACAGGTGCAG ACCGACATCG ACCCTTCGGT CGTGCAGCGG
ATCTCTGCCC AGCAGCAGGC GTGGCTGATG TTCAACGAAC GCCCGACTTT CGGATTCGGC
CCTGCGACCT TCCCCGGCCA GGTGATCAAC TTCGCCGACC GCACCGACAT TGCTCCTCGC
GATCCGACCA ACGCGCCCCA CAATCTCTAC GCCGAACTCG CCGCGGAGTC GGGATGGGTC
GGCCTGCTCG GCTGGATGGT GGTGATCCTC GGGTTCCTGA CGATCACAGT CTTGGGCAAC
CTCGCGAATC CCTTTCGGCG CGACCGTGTC CTGGCCGCAG CGGTATGCGC GGCGATCGTC
GCGTGGTCGG TGGCCAGCAT CGGGCTGCAC ATGGCCTACT TCCGGACGTT CGGGGTGGTG
CTCGCCCTCG TCGGTGCACT TGCGCCGATG TGGCCGGTGC CGGCCGAACC CGTGCGCAGG
CTGGTCCACG GTGTGATCAC TTGGGGCACC GCGGGTCTGC TCGGGTTCGC CGCATTTTGG
ATCTTCCTGT CAGCGAACAG CTCTGCGGCC GTGACCGCCC GTCAGCCGAT GACGCTGGTC
CCCGCGGGGC CCGTCGACGG GTGGTACGCC TACGCCCTTG ACATCCGCTC CCGCGCCGAA
CTACTGCCCA CTGTCGCACG TCTGTTGGAA GACCCTCGGT CGCCGGTCGA CATCATTGCC
GACCCGGCGC GCGGCGTGCT GATATTCACG GCCACCGCGG ACAACGTCGA CCGGGCGCGC
ACCGATATCC AACGCGCAGC CGCGCATGCG GAAGCGGCCC TGCACAGCGC AATCGGCTAT
CAGCAGTACT CGCTGCAGAC TGTCGGCAGC ATGCGCGCCC AGACCACGCG GCAACCCTCA
CCAGGCACGC TGATCGTCGC CATCGGCGTG GGGGTGGGCA CCATGCTCAT CGTCCGCGCG
GTGTGGCTGC GGGCGGCAAC CCGGCGGCGC ACCGGCGCCG TTGACGATCG ACCCACGACC
CGGGAGGCGA CATCGGTACC CGCGGCAAGC CCATAG
 
Protein sequence
MTGSTRVSQG ELAGLVVAVG FVSIGSLLVF GSRSTLLLIA APIGAIALFV VARRPVLALS 
IMVVIEFANL SGLLAPKTGL PIFPASLLLG LIAVGFALRD PQCRGRINGW TAVCAGLLAV
YLATQAVAGI GSVDTAESVA ILQRHVIDCV FVMVVLVLVQ VTARPWTVAA VIVVTLAALC
TLTVISQVVY GGAATFGGLS TVTTASGEMV TTLRYGGPLP DSNFWGRHLV MGLPMAAALM
TRALRSARRA TAAPWAIALA LLLCGIYLTQ SRGTFLAAGV AIVVWFVAVD RAVRRWALIL
IPLGVAVFAV PGVGNRMVAA FEDFTHAQVQ TDIDPSVVQR ISAQQQAWLM FNERPTFGFG
PATFPGQVIN FADRTDIAPR DPTNAPHNLY AELAAESGWV GLLGWMVVIL GFLTITVLGN
LANPFRRDRV LAAAVCAAIV AWSVASIGLH MAYFRTFGVV LALVGALAPM WPVPAEPVRR
LVHGVITWGT AGLLGFAAFW IFLSANSSAA VTARQPMTLV PAGPVDGWYA YALDIRSRAE
LLPTVARLLE DPRSPVDIIA DPARGVLIFT ATADNVDRAR TDIQRAAAHA EAALHSAIGY
QQYSLQTVGS MRAQTTRQPS PGTLIVAIGV GVGTMLIVRA VWLRAATRRR TGAVDDRPTT
REATSVPAAS P