Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_0211 |
Symbol | |
ID | 4647724 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | - |
Start bp | 222656 |
End bp | 224527 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639803721 |
Product | glycosyl transferase family protein |
Protein accession | YP_951067 |
Protein GI | 120401238 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACGA CAGACGCCCC GGGAGCCATC GCCGCCGGTC CGGCCGTCAT GCAGACACCG AAGGTGTCGA TATGTATCCC CGCCCACCAG GCCGCTGCGT ACCTGCAACC CCTGCTCGAC AGCGTGCTGT CCCAGGCCTA CGACGACTTC GAGGTGGTCG TCATCGACAA CCACAGCACC GACGGCACTT CCGACATCCT GGCGCGCGTC GACGATCCGC GTGTTCGGGT CATGCGGAAT CCGGCCACCC TGCCGTTCGT CGAGAACTGG AACCTCCTGG TGTCACAGTC CCGGGGCGAG TTCGTCAAAC TTGTCTGCGC CGACGATCTG CTCAAGCCCG GCTGCCTTGC GGTGCAGGCC TCGGTCCTCG ACAACAATCC CGATGTCGCC CTGGTGTCGG TGAAATGCGA CTTCATCGAC GACAACGAGC GCTTGATCGT GCCCGCCCGG GGACTCGACG GCATCGAGGG ACAGGTCACC GCCGAAGGCG TGGTCAGGCG GATCGTGCGC AATGGAGGCA ATCCGATCGG AGCACCGGTG GCGGGCATGT TCCGGCGCGC CGACTTCGAC CGGGTCGGTG GGTTCACCGC CGACTTCCCC TTCCTGAGTG ACATACATCT GTGGGTGCGG CTGTTGGGCT GTGGCGACTT CTACGGCATA CCGGCGACAC ACGCCTCGTT CCGGATCCGC GGTGGCTCCA TGAGCGGCCT GACCTCGGCG CGGACCCAAC TTGCCCAGTC GCTCGACTTC GAGAAGTCGC TTGCCCGCGA TCCACGCTGG GACCTGTCCC AAATCGACCT TTTCCGCGGC TGGATGCGTT GCCACGAACA GACTTTGCGC CGGATGGCAC TGTTCGGTCT GACCAAGTGG CGTGTTGCGC GACGTGACCG CGGGCCGGTC CGGGCCGGCC CGCGAGCCGG CACCGATCTG CCGTCGACCG TCGTGGCCGA CACCCTGACC GTGGTGATCT GCGCCTACAC CACGCAACGG TGGGATGAGC TCTGCCCTGC AGTGGAATCA GTTCTGAATC AGGACTTCCC GGTACTCGGC GTCGTCGTGG TAATCGATCA CTGCCCGGAG CTGTACCGGC TCGCCCGGGA CCGATTCGGT GCCCGAGGAC GAGTCACGGT GCTCGAAAGT GACGGGGAGC GTGGACTTTC GGGTGCCAGG AACACGGGGG TGGGCGCGGC GCGCGGCGAC GTCGTCGCGT TCCTCGACGA CGACGCAGTC GCCGAGCCCG GTTGGGCGCA TGCCCTGATG CGCCACTATC GCGATCCGCG GGTCGCTGCC GTCGGCGGCT ATGCCGCCCC GGTGTGGCCC ACGGGCGCCC GCCCGCACTG GATGCCTGCC GAGTTCGACT GGGTGGTCGG GTGCAGTTAC ACCGGGCAGC CGACCGAGCT GGCGGAGGTG CGGAACCCCC TTGGCTGCAA TATGTCGATC CGCCGCTCGG TGTTCGACGA CATCGGCGGG TTCAGGTCCG AGGTGGGCCG GGTCGGCAAC CACCCGGTCG GCGGAGAGGA GACGGAGCTG TGCCTCCGCA TCCGTGGCCG CCAACCGGAT GCACGGGTGC TGTACGACCC GGACGCCGTT GTCCGTCATC ATGTCTCGTG CGATCGGACG ACGATCCGCT ATTTCCGGCG GCGGTGCTAC CACGAGGGGA TTTCGAAGGC TGTCGTCACC GAGATCGCAG GCGTCGGCAA CCCGCTGCTC GCGGAGCGGG CCTACACGAC GCGGACCCTC CCGCGCGGCG TCCTGCGGGA GCTCACAGCG CCGAGGCAGG GCGGGTTCCG GCGCGCGGGA GTCATGGCTT TCGGGCTCGC AGCGACGACG GCCGGCTATC TGCGCGCCAA GACCCAGTAC CGGCTCTCAT GA
|
Protein sequence | MTTTDAPGAI AAGPAVMQTP KVSICIPAHQ AAAYLQPLLD SVLSQAYDDF EVVVIDNHST DGTSDILARV DDPRVRVMRN PATLPFVENW NLLVSQSRGE FVKLVCADDL LKPGCLAVQA SVLDNNPDVA LVSVKCDFID DNERLIVPAR GLDGIEGQVT AEGVVRRIVR NGGNPIGAPV AGMFRRADFD RVGGFTADFP FLSDIHLWVR LLGCGDFYGI PATHASFRIR GGSMSGLTSA RTQLAQSLDF EKSLARDPRW DLSQIDLFRG WMRCHEQTLR RMALFGLTKW RVARRDRGPV RAGPRAGTDL PSTVVADTLT VVICAYTTQR WDELCPAVES VLNQDFPVLG VVVVIDHCPE LYRLARDRFG ARGRVTVLES DGERGLSGAR NTGVGAARGD VVAFLDDDAV AEPGWAHALM RHYRDPRVAA VGGYAAPVWP TGARPHWMPA EFDWVVGCSY TGQPTELAEV RNPLGCNMSI RRSVFDDIGG FRSEVGRVGN HPVGGEETEL CLRIRGRQPD ARVLYDPDAV VRHHVSCDRT TIRYFRRRCY HEGISKAVVT EIAGVGNPLL AERAYTTRTL PRGVLRELTA PRQGGFRRAG VMAFGLAATT AGYLRAKTQY RLS
|
| |