Gene Mkms_4958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4958 
Symbol 
ID4612635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp5197189 
End bp5198745 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content67% 
IMG OID639794650 
Productpermease for cytosine/purines, uracil, thiamine, allantoin 
Protein accessionYP_940937 
Protein GI119870985 
COG category[F] Nucleotide transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG1953] Cytosine/uracil/thiamine/allantoin permeases 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGACA CCCGTGACCT CCCGCCGAGC GCCGTGGTCG GCGCGGGCGA CATCGTCGAA 
GCCGCAGGGC ATCCCGTCGG CAGCGGGGTG ATCAAGGACA GCTACGACCC GCGGCTGACC
AACGAGGACC TCGCACCCCT GGGCAAGCAG ACGTGGTCGT CGTACAACAT CTTCGCGTTC
TGGATGTCGG ACGTGCACAG CGTCGGCGGA TATGTCACCG CGGGCAGCCT GTTCGCCCTG
GGCCTGGCGA GCTGGCAGGT GCTGATCGCC CTGCTCGTCG GCATCGTGAT CGTCAACCTG
CTGTGCAACC TGGTCGCCAA GCCCAGCCAG CAGGCCGGCG TGCCGTACCC CGTCGTATGC
CGCAGTTCCT TCGGGGTCCT CGGCGCGAAC ATCCCGGCCA TCATCCGCGG CCTGATCGCG
GTGGCCTGGT ACGGCATCCA GACCTACCTG GCGTCGGCGG CGCTCGACGT CGTACTGCTC
AAACTGTTCC CCGGCCTGGC GCCCTACGCC GACGCCGACC AGTACGGCTT CACCGGCCTG
TCCCTGCTGG GCTGGTGCAG CTTCATGCTG CTGTGGGTTC TGCAGGCGTG CGTGTTCTGG
CGCGGTATGG AGTCGATCCG CAAGTTCATC GACTTCTGCG GTCCCGCGGT GTACGTGGTG
ATGTTCATCC TCTGCGGCTA CCTGCTGTGG AAGTCGGGCT GGCACGTCAG CCTGTCGCTG
GGCGGCGAGA AGCAGGGCAA CACGCTGGTG GTCATGCTCG GCGCGATCGC GCTCGTCGTG
TCGTACTTCT CCGGGCCGAT GCTGAACTTC GGCGACTTCG CCCGCTACGG CAAGAGCTTC
GAGGCGGTCA AGAAGGGCAA CTTCCTCGGC CTGCCGATCA ACTTCCTGAT GTTCTCGATC
CTGGTCGTCG TCACCGCCGC AGCCACGGTG CCGGTGTTCG GCGAGCTCCT CACCGACCCG
GTCGAGACCG TCGCCCGCAT CGACAGCGTC ACCGCGATCG TCCTCGGAGC GCTGACGTTC
TCGATCGCCA CGATCGGCAT CAACATCGTG GCCAACTTCA TCAGCCCCGC CTTCGACTTC
TCCAACGTCA GCCCGCAGCG GATCAGCTGG CGCATGGGCG GCATGATCGC CGCGGTCGGG
TCGGTGCTGC TCACGCCGTG GAACCTCTAC AGCAACCCCG AGGTCATCCA CTACACGCTG
GAGACCCTCG GTGCGTTCAT CGGCCCGCTG TTCGGTGTGC TGATCGCCGA CTTCTACCTG
GTGCGCAAGC AGAAGATCGT GGTCGACGAT CTGTTCACGA TGTCGGAGAC CGCCAACTAC
TGGTACCGGA GGGGCTACAA CCCCGCCGCG GTGACCGCCA CTCTGGTCGG CGCCGTCCTG
GCCATGGCAC CGGTACTGCT CGGCGGCGTC GTACTCGGCA TGGCCGGCGC CGCGCAGTAC
AGCTGGTTCA TCGGCTGCGG TGTGGCGTTC GCCCTCTACT ACGTGCTGGC CACCCGCGGC
CCGTGGCGCA TGACCGCGCT GCGCGTGGCC GAGGGCGCGA CGCTGGTCTC GAACTAG
 
Protein sequence
MTDTRDLPPS AVVGAGDIVE AAGHPVGSGV IKDSYDPRLT NEDLAPLGKQ TWSSYNIFAF 
WMSDVHSVGG YVTAGSLFAL GLASWQVLIA LLVGIVIVNL LCNLVAKPSQ QAGVPYPVVC
RSSFGVLGAN IPAIIRGLIA VAWYGIQTYL ASAALDVVLL KLFPGLAPYA DADQYGFTGL
SLLGWCSFML LWVLQACVFW RGMESIRKFI DFCGPAVYVV MFILCGYLLW KSGWHVSLSL
GGEKQGNTLV VMLGAIALVV SYFSGPMLNF GDFARYGKSF EAVKKGNFLG LPINFLMFSI
LVVVTAAATV PVFGELLTDP VETVARIDSV TAIVLGALTF SIATIGINIV ANFISPAFDF
SNVSPQRISW RMGGMIAAVG SVLLTPWNLY SNPEVIHYTL ETLGAFIGPL FGVLIADFYL
VRKQKIVVDD LFTMSETANY WYRRGYNPAA VTATLVGAVL AMAPVLLGGV VLGMAGAAQY
SWFIGCGVAF ALYYVLATRG PWRMTALRVA EGATLVSN