Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_1968 |
Symbol | |
ID | 4645815 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | - |
Start bp | 2101084 |
End bp | 2103264 |
Gene Length | 2181 bp |
Protein Length | 726 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639805455 |
Product | hypothetical protein |
Protein accession | YP_952793 |
Protein GI | 120402964 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1179] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACATTTG AGCACTCTGG TGCGCAGGCA GACGACTCCA ACGGACGTGT CGAGGACCAG AGTGCCGCAG GCGCGCTCGT CCTCGACCCG GCCGAGCCGG CCGATCAACA GGTCCTGGAC TCTCTGCGTG GCGATCCGGC CATCGTGTTC ATCGACCGCG CCGCGCAACA GGCCGAGACA CTGGCTCGGC TGCTGCCGGC TCCGACAGGA GAACTGACCG ACGAGCCGCG GCGCTGGGCC TACTATCCGT GGCGTCGTAC CGTCGTCAGC ATCCTCGGCC CGCGCGCGTT CGCGCGGGTG CGTGCGGACC GCAACCGCAA CCTGATCACC TCGCAGGAGC AGGAGGCGCT CGGCTCGCTC CGGGTCGGAG TGGTCGGTCT CAGCGTCGGC CACGCCGTGG CACACACGCT GGCGATGCAG GGGCTCTGCG GCGAGTTACG CCTTGCCGAC TTCGACGATC TCGAACTGTC GAATCTGAAC CGGGTTCCGG CAACGGTTTT CGACATCGGT GTGAACAAGG CGGTCACGGC GGCACGCAGG ATCGCCGAGG TCGATCCGTA CCTCGCGGTC CGCGTGATCG ATTCCGGCTT GACGGACCAG ACCGTCGATG CGTTCCTCGA CGGGCTCGAC ATCGTGGTCG AGGAATGTGA CTCGCTTGAT GTCAAAGCGC TTGTCCGGGA AGCGGCGCGC AGCCGGCGGC AGCCGGTGCT GATGGCGACC AGCGACCGCG GCCTCATCGA CGTCGAACGG TTCGACCTCG AGCCGGAACG GCCCATCTTC CACGGCCTGC TCGGCGAGGT CGACGCGGCG AGTCTGGCCG GCCTGGACAG CCGGGAGAAA ATCCCCCACG TGCTACGGAT CATCGACGGC GGCAACCTCT CCGCGCGGGG GGCCGCATCG CTCGTCGAGG TAGGGCAGAC GCTGTCGACG TGGCCGCAGT TGGTGGGTGA CGTCCTGGTC GGGGCCGCCG CCGTCGCCGA GGCAGTACGC AGAATCGGCC TGGGGGAACC ACTTTCGTCG GGCCGGGCAC GCCTCGACAC GGCGACGGCC CTGACGGGGC TGACGGACCC GATGCGGCGG GCCCCGCAAC CCGGTTGGGA TCCGGTGACG GTCGAGGATC CGGCGGACGT TCGCGATGCC GTGCACGCGG TGGCGCTCGC CGCCAATCGC GCGCCGTCGG GCGGCAATGT CCAGCCGTGG CACATCGAGT CTGGACGAGA CGTCGTGACG ATCCGTGTCG CCCCGGAGTT CCAGTCCACG ATCGACGTGG GGTTCCGCGG TAGCGCGGTG GCGGTGGGCG CGGCCACGTT CAATGCGCGC GTCGCCGCCG CCGCCCATCA GTTGTCGGGC GAGGTGACGT TCGTCGAGGG TGACGACCGG TCACCGCTGA CCGCCGTGGT CACCCTCGCC GAAGGCCGCG ACGAGCACCT TGCTTCGCTG TATCCGGCAC TTGCGGCGCG AGAGACCAAC CGCCGCCACG GAACCCCCTC GGAGCTGCCG CCGGCCACCG TGGCCGCGCT TGACGCCGCG GGCCGGCGGG AAGGGGCCCG CGTGCAGCTG CTGGAGGATC GGGAGGTCAT CGACGCGCTC GGCACCCTCC TGGCCGAGAC CGACCGGATT CGCTATCTCA CCCCGCATCT GCACACCGAC ATGGCCTCGG AGCTGCGTTG GCCCGGTGAC GGCTTGCCAG ATTCGGGGAT CGACGTGCGC AGCCTCGAGT TGGGCGCCGC GGAGCTCGTC ACACTCGACA TCCTGCGCAG ACCCGAGGTC ATGTCACTGC TGACCGCGTG GAATGCCGGC GCCGTGCTGG GCGCCGACAC CAGGGCGCGC ATCGTCAACA GCGGTGCGCT GGCGTTGATC CTGACCGATG ACATTTCCCT CACCGGCTAT GCGCGGGGAG GCTCGGCAGC CGAGGCCGTG TGGATCGCGG CGCAGCAGCA CGGTCTTTCG GTACAGCCGA TATCCCCGGT CTTCCTGTAC GCCCACACCG CTGATGAACT CCACGAGCTT TCACCGCGAT TCGCCGACCG TCTGGCTGAT CTGCAGAGAC GGTTCCGTCA ACTGACAGCT GTCGGTCCCG GCGAGGCGAC GGCGCTGATC CTGCGACTCA CCTCCGCACC GCCCCCATCG GTGCGGAGCA GACGGCGCCC CCTCCACAGC GCTTCAACGC CCGTATCCTG A
|
Protein sequence | MTFEHSGAQA DDSNGRVEDQ SAAGALVLDP AEPADQQVLD SLRGDPAIVF IDRAAQQAET LARLLPAPTG ELTDEPRRWA YYPWRRTVVS ILGPRAFARV RADRNRNLIT SQEQEALGSL RVGVVGLSVG HAVAHTLAMQ GLCGELRLAD FDDLELSNLN RVPATVFDIG VNKAVTAARR IAEVDPYLAV RVIDSGLTDQ TVDAFLDGLD IVVEECDSLD VKALVREAAR SRRQPVLMAT SDRGLIDVER FDLEPERPIF HGLLGEVDAA SLAGLDSREK IPHVLRIIDG GNLSARGAAS LVEVGQTLST WPQLVGDVLV GAAAVAEAVR RIGLGEPLSS GRARLDTATA LTGLTDPMRR APQPGWDPVT VEDPADVRDA VHAVALAANR APSGGNVQPW HIESGRDVVT IRVAPEFQST IDVGFRGSAV AVGAATFNAR VAAAAHQLSG EVTFVEGDDR SPLTAVVTLA EGRDEHLASL YPALAARETN RRHGTPSELP PATVAALDAA GRREGARVQL LEDREVIDAL GTLLAETDRI RYLTPHLHTD MASELRWPGD GLPDSGIDVR SLELGAAELV TLDILRRPEV MSLLTAWNAG AVLGADTRAR IVNSGALALI LTDDISLTGY ARGGSAAEAV WIAAQQHGLS VQPISPVFLY AHTADELHEL SPRFADRLAD LQRRFRQLTA VGPGEATALI LRLTSAPPPS VRSRRRPLHS ASTPVS
|
| |