Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmcs_1749 |
Symbol | |
ID | 4110583 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. MCS |
Kingdom | Bacteria |
Replicon accession | NC_008146 |
Strand | - |
Start bp | 1885473 |
End bp | 1887602 |
Gene Length | 2130 bp |
Protein Length | 709 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 638030869 |
Product | hypothetical protein |
Protein accession | YP_638914 |
Protein GI | 108798717 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1179] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.13421 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACGGT CCACCACTGA TCAGACTGTC TACACCGCCT CCGTTCTCAC CGACGACGCA GACCGCGACG CACTCGCGAC GCTGCGGTCC GATCCCGGGA TCGAGTTCGT CGACCGCGCC GCGGAACTGC GCTCGGCCCT GGCCGAACTG CGTCCGCCCG CGGACGGTGA GCTGGCGGAT GAGCCGATCC GGTGGGCGTA CTTCCCGTGG CGGCGCACGG TCGTCAGCGT CCTCGGCCCT CGTGCCCACA GCCGGCTCCG ACTCGACCGC AACCGCAATC TGATCACCGC GTCGGAACAA CAGCGCCTCG GTCGGCTGCG GGTCGGGGTG ATCGGTCTCA GCGTCGGCCA CGCCGTGGCC CACACGCTTG CCGCACAGGG GGTCTGCGGC GAACTGCGCC TCGCCGACTT CGACGGCCTC GAGGTGTCGA ACCTCAACCG CGTCCCCGCC AGCCTGGTCG ACCTCGGGGT GAACAAGGCG GTGGTCGCGG CCCGCCGGAT CGCCGAACTC GATCCTCACC TCCCGGTGCA GGTCTTGACC GACGGTGTCA CTCCGGCCAC GGTCGAGGAG TTCCTCGACG GGCTCGACGT GGTGGTGGAG GAGTGCGATT CGCTGGACGT GAAGGTCCTG GTGCGTGAGC ACGCCCGCGC CCGCCGCATC CCGGTCCTGA TGGCGACCAG CGATCGCGGC CTACTCGACG TCGAACGCTT CGACGTCGAC CCGTCGCGGC CGCTGCTGCA CGGGCTGCTC GGCGACATCG ACTCGGCGGC ACTGTCCGGG CTGACGAGCA AGGACAAGGT GCCCTACGTG CTGCGCATCC TCGACGCGAG CGCGCTGTCG TCGCGGATGG CCGCGTCGCT GGTGGAAGTC GGGACGAGCC TGACGACATG GCCGCAGCTC GCCGGTGAAG TCGCGCTGGG TGCCACCGCC GTCACCGAGG CGGTGCGCCG CATCGGCCTC GGCGAACCGC TGCCCTCGGG CCGGGTGCGG ATGGATGCCG CTGCGCTCCT CGACCGCGTC GAGGACCCGC TCGCTTCCCC GGCCGTCGAG GACACGGCCG ACGATCGGGT GGCCGATTCC GAAACGGACT CCGTCCCCGA GAAGGTCGCC GCCGCCGCCG TGCGGGCCCC CTCGGGAGGC AACTCGCAGC CGTGGCACAT CGAACTGCAG TCCGACGCCG TACACCTGTG GGTGGCCCCC GAGGCCACGA GCGCGATGGA CGTCGCGTAC CGGGGTAGCG CGGTCGCACT CGGTGCGGCG GTCTTCAACG CGCGGATCGC CGCGGCGGCC CACCACCGCG CCGCCGACGT GCACATCGAG CGCGGCGACG AGCGCTCGCC CCTTCGGGCC GTCGTCCGCC TCGGGCGGGC CGAGGACGAG GAGCTGGCGC GTCTCTACCC CGCGATGCTG CGGCGGGAGA CCAACCGGCA TCACGGGTCC GGCACACCGA TCGGCGCTCC CGAGATCGAG GCGATGGCCG CCGCCGCGCG GGCCGAAGGT GGCCGCCTGG CGATGCTCAC CGACACCCGC GACATGGCCG AGGCCGCCGA GATCCTCGCG GCCACCGACC GGATCCGGTA CCTCACACCA CGTCTGCACT CGGAGATGTT CTCGGAGCTG CGCTGGCCCG GGGATCCGGC GCCCGACTCC GGGATCGACG TGCGGACACT GGAGTTGGGC CCGACGGACC TGGTGAAGCT CGACATCCTG CGCCGCGGCG AGGTGATGGC CGATCTGGCC CGATGGGAGG CGGGGTCGGC GCTGGGCGAG GACACCTACG ACCGGCTGAC GTCCAGCGCA GCACTGGGTG TCGTCACCGT CACCGGACAC ACACTGCCGG ACTACCTGCG CGGGGGCGCG GCGACCGAGG CGGTCTGGAT CGCCGCCGAA CAGCACGGCG TCGCAACCCA TCCGGTGTCC CCGGTGTTCC TCTACGCGCA CGGCGACCGC GAACTCGCCG AACTCTCCCC GGCGTTCGCC GGTGAACTCG GCGAGCTGCA GCGCCGTTTC CGCGCGCTGA CCGGCATCGG CGCCGAGGAG TCCGAGGCGT TGGTCCTACG CTTCTCCATC GCGCCCCGCC CGTCGATGCG CAGCAGACGC CGACGCTTGA GCACCCCGGC ACCGGCATGA
|
Protein sequence | MTRSTTDQTV YTASVLTDDA DRDALATLRS DPGIEFVDRA AELRSALAEL RPPADGELAD EPIRWAYFPW RRTVVSVLGP RAHSRLRLDR NRNLITASEQ QRLGRLRVGV IGLSVGHAVA HTLAAQGVCG ELRLADFDGL EVSNLNRVPA SLVDLGVNKA VVAARRIAEL DPHLPVQVLT DGVTPATVEE FLDGLDVVVE ECDSLDVKVL VREHARARRI PVLMATSDRG LLDVERFDVD PSRPLLHGLL GDIDSAALSG LTSKDKVPYV LRILDASALS SRMAASLVEV GTSLTTWPQL AGEVALGATA VTEAVRRIGL GEPLPSGRVR MDAAALLDRV EDPLASPAVE DTADDRVADS ETDSVPEKVA AAAVRAPSGG NSQPWHIELQ SDAVHLWVAP EATSAMDVAY RGSAVALGAA VFNARIAAAA HHRAADVHIE RGDERSPLRA VVRLGRAEDE ELARLYPAML RRETNRHHGS GTPIGAPEIE AMAAAARAEG GRLAMLTDTR DMAEAAEILA ATDRIRYLTP RLHSEMFSEL RWPGDPAPDS GIDVRTLELG PTDLVKLDIL RRGEVMADLA RWEAGSALGE DTYDRLTSSA ALGVVTVTGH TLPDYLRGGA ATEAVWIAAE QHGVATHPVS PVFLYAHGDR ELAELSPAFA GELGELQRRF RALTGIGAEE SEALVLRFSI APRPSMRSRR RRLSTPAPA
|
| |