Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_4958 |
Symbol | |
ID | 4612635 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | + |
Start bp | 5197189 |
End bp | 5198745 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639794650 |
Product | permease for cytosine/purines, uracil, thiamine, allantoin |
Protein accession | YP_940937 |
Protein GI | 119870985 |
COG category | [F] Nucleotide transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG1953] Cytosine/uracil/thiamine/allantoin permeases |
TIGRFAM ID | [TIGR00800] NCS1 nucleoside transporter family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGACA CCCGTGACCT CCCGCCGAGC GCCGTGGTCG GCGCGGGCGA CATCGTCGAA GCCGCAGGGC ATCCCGTCGG CAGCGGGGTG ATCAAGGACA GCTACGACCC GCGGCTGACC AACGAGGACC TCGCACCCCT GGGCAAGCAG ACGTGGTCGT CGTACAACAT CTTCGCGTTC TGGATGTCGG ACGTGCACAG CGTCGGCGGA TATGTCACCG CGGGCAGCCT GTTCGCCCTG GGCCTGGCGA GCTGGCAGGT GCTGATCGCC CTGCTCGTCG GCATCGTGAT CGTCAACCTG CTGTGCAACC TGGTCGCCAA GCCCAGCCAG CAGGCCGGCG TGCCGTACCC CGTCGTATGC CGCAGTTCCT TCGGGGTCCT CGGCGCGAAC ATCCCGGCCA TCATCCGCGG CCTGATCGCG GTGGCCTGGT ACGGCATCCA GACCTACCTG GCGTCGGCGG CGCTCGACGT CGTACTGCTC AAACTGTTCC CCGGCCTGGC GCCCTACGCC GACGCCGACC AGTACGGCTT CACCGGCCTG TCCCTGCTGG GCTGGTGCAG CTTCATGCTG CTGTGGGTTC TGCAGGCGTG CGTGTTCTGG CGCGGTATGG AGTCGATCCG CAAGTTCATC GACTTCTGCG GTCCCGCGGT GTACGTGGTG ATGTTCATCC TCTGCGGCTA CCTGCTGTGG AAGTCGGGCT GGCACGTCAG CCTGTCGCTG GGCGGCGAGA AGCAGGGCAA CACGCTGGTG GTCATGCTCG GCGCGATCGC GCTCGTCGTG TCGTACTTCT CCGGGCCGAT GCTGAACTTC GGCGACTTCG CCCGCTACGG CAAGAGCTTC GAGGCGGTCA AGAAGGGCAA CTTCCTCGGC CTGCCGATCA ACTTCCTGAT GTTCTCGATC CTGGTCGTCG TCACCGCCGC AGCCACGGTG CCGGTGTTCG GCGAGCTCCT CACCGACCCG GTCGAGACCG TCGCCCGCAT CGACAGCGTC ACCGCGATCG TCCTCGGAGC GCTGACGTTC TCGATCGCCA CGATCGGCAT CAACATCGTG GCCAACTTCA TCAGCCCCGC CTTCGACTTC TCCAACGTCA GCCCGCAGCG GATCAGCTGG CGCATGGGCG GCATGATCGC CGCGGTCGGG TCGGTGCTGC TCACGCCGTG GAACCTCTAC AGCAACCCCG AGGTCATCCA CTACACGCTG GAGACCCTCG GTGCGTTCAT CGGCCCGCTG TTCGGTGTGC TGATCGCCGA CTTCTACCTG GTGCGCAAGC AGAAGATCGT GGTCGACGAT CTGTTCACGA TGTCGGAGAC CGCCAACTAC TGGTACCGGA GGGGCTACAA CCCCGCCGCG GTGACCGCCA CTCTGGTCGG CGCCGTCCTG GCCATGGCAC CGGTACTGCT CGGCGGCGTC GTACTCGGCA TGGCCGGCGC CGCGCAGTAC AGCTGGTTCA TCGGCTGCGG TGTGGCGTTC GCCCTCTACT ACGTGCTGGC CACCCGCGGC CCGTGGCGCA TGACCGCGCT GCGCGTGGCC GAGGGCGCGA CGCTGGTCTC GAACTAG
|
Protein sequence | MTDTRDLPPS AVVGAGDIVE AAGHPVGSGV IKDSYDPRLT NEDLAPLGKQ TWSSYNIFAF WMSDVHSVGG YVTAGSLFAL GLASWQVLIA LLVGIVIVNL LCNLVAKPSQ QAGVPYPVVC RSSFGVLGAN IPAIIRGLIA VAWYGIQTYL ASAALDVVLL KLFPGLAPYA DADQYGFTGL SLLGWCSFML LWVLQACVFW RGMESIRKFI DFCGPAVYVV MFILCGYLLW KSGWHVSLSL GGEKQGNTLV VMLGAIALVV SYFSGPMLNF GDFARYGKSF EAVKKGNFLG LPINFLMFSI LVVVTAAATV PVFGELLTDP VETVARIDSV TAIVLGALTF SIATIGINIV ANFISPAFDF SNVSPQRISW RMGGMIAAVG SVLLTPWNLY SNPEVIHYTL ETLGAFIGPL FGVLIADFYL VRKQKIVVDD LFTMSETANY WYRRGYNPAA VTATLVGAVL AMAPVLLGGV VLGMAGAAQY SWFIGCGVAF ALYYVLATRG PWRMTALRVA EGATLVSN
|
| |