Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_3868 |
Symbol | |
ID | 4649185 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | - |
Start bp | 4136643 |
End bp | 4138226 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639807334 |
Product | glycosyltransferase family 28 protein |
Protein accession | YP_954655 |
Protein GI | 120404826 |
COG category | [R] General function prediction only |
COG ID | [COG4671] Predicted glycosyl transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.479264 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.241184 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGATCC CGGAACCGCA CTCCACCAAG CACATCGAGG ACATGCTGGC CTGGTCGGCC GACGTCGATG CCGCCACGCT CGTGGACACC ACGGATGCCA GGCTGGACCC GGAATTCCTC GCGGCGGAGC CATTCGAGGA TCTCGCGAAG GCCGTCGGCT GCCCGGTCCA TGTGGTGCAC GGGACAGCGG ACCGGATCAG CAGCCCCGCG GTCGGCGAGC AGCTCGCCGA GCTCACCGGC GGCTCGCTGA CGCTGATCGA GGGTGCCGGC CACGCGCCGC TGGCCCGAGA TCCGGTGCTG ATCAACACGA TGATCCACGA TTTCGTCGCG ACGGTCGCGC CGTCGCCGCG CCTCAAGCAG CGCGTCCGCG CGCCCCGCCG ACGGCGGAAG GCTCTGTACC TGTCCTCGCC GATCGGGCTC GGCCACGCCC GCCGCGATGT CGCGATCGCC ACCGAGCTAC GTTCTGCAAC AGAAGATCTG GAGATCGAGT GGCTGGCGCA GGACCCCGTC ACGCGGGTGC TCGCGTCAGC CGGTGAGCGG ATTCATCCCG CATCGGCACA GCTGCTCAAC GAATCGACGC ACGTCGAACA CGAATCCGGT GAGCACGACC TGCACGCGTT CGAGGCGCTG CGCCGGATGG ACGAGATCCT GGTCGCCAAC TTCATGGTGT TCGCCGACCT GATCGCCGAG GAACCCTTCG ATCTGGTGAT CGCCGACGAA GCGTGGGAGG TGGACTACTT CCTGCACGAG AATCCGGAAC TCAAGCGGTT CTCGTTCGCC TGGCTGACCG ACTTCGTCGG CTGGTTGCCC ATGCCGGACG GTGGACCGCG GGAGGCAGCG CTGACCGCCG ACTACAACGC GGAGATGATC GAGCAGCGCG CGAGGTTTCC GCGACTTCGG GACCGGTCGA TATTCGTCGG CAATCCCGAA GACGTTGTGC GACAAGACTT CGGCCCTGGG CTGCCCGACA TCAGGGAGTG GACCGGCCAG AACTTCGACT TCTCCGGATA TGTCACAGGC TCGGTGCCGC CGGCGGGTCC GGAGCGGGCG GCACTGCGTC GGAAACTCGG GTTGCAGCCG GATCAGCGAC TGTGCGTCGT CACCGTGGGC GGCACCTCGG TGGGGGAGTC GCTGCTGCAA CGCATTCTGC ATGCGGTGCC CATCGTTCGC CGGGCAATGC CGGAGCTTCA CTTCCTGGTC GTGACGGGTC CTCGCATCGA CCCCGCGACG CTGCCTCATC CGCGAGGCGT CCGGGTCCGT GGCTTCGTCC CCGACCTCGC CGACTACCTC GCCGCCTGTG ACATCGCGCT GGTGCAGGGT GGACTGACGA CGTGCATGGA GCTGACGGCG GCGGGAACGC CGTTCGTCTA TGTGCCACTG GAGAATCACT TCGAACAGAA CTTCCATGTG CGTCACCGGT TGGAGCGCTA CGGCGGCGGC CGTCCGATGC GCTACGCGGA GGCTGCCGAT CCGGACCTGC TGGCCAAGAT CATCTTCGAT GAACTGTCCG CGACGCGACG GGTCCTTCCC GTCGAGACCG ACGGAGCCAG GCGTGCCGCG GCGATGCTCG CCGATCTGCT GTAG
|
Protein sequence | MMIPEPHSTK HIEDMLAWSA DVDAATLVDT TDARLDPEFL AAEPFEDLAK AVGCPVHVVH GTADRISSPA VGEQLAELTG GSLTLIEGAG HAPLARDPVL INTMIHDFVA TVAPSPRLKQ RVRAPRRRRK ALYLSSPIGL GHARRDVAIA TELRSATEDL EIEWLAQDPV TRVLASAGER IHPASAQLLN ESTHVEHESG EHDLHAFEAL RRMDEILVAN FMVFADLIAE EPFDLVIADE AWEVDYFLHE NPELKRFSFA WLTDFVGWLP MPDGGPREAA LTADYNAEMI EQRARFPRLR DRSIFVGNPE DVVRQDFGPG LPDIREWTGQ NFDFSGYVTG SVPPAGPERA ALRRKLGLQP DQRLCVVTVG GTSVGESLLQ RILHAVPIVR RAMPELHFLV VTGPRIDPAT LPHPRGVRVR GFVPDLADYL AACDIALVQG GLTTCMELTA AGTPFVYVPL ENHFEQNFHV RHRLERYGGG RPMRYAEAAD PDLLAKIIFD ELSATRRVLP VETDGARRAA AMLADLL
|
| |