Gene Mjls_5237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_5237 
Symbol 
ID4880935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp5487163 
End bp5488794 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content66% 
IMG OID640142549 
Productpermease for cytosine/purines, uracil, thiamine, allantoin 
Protein accessionYP_001073492 
Protein GI126437801 
COG category[F] Nucleotide transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG1953] Cytosine/uracil/thiamine/allantoin permeases 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.111402 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.975867 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGGCGCA ACGACGGTGA AACACACCGC AGCTACGTTC CGTTCATCGC CACGCCGATG 
GACAGGAACC GCATCATGAC AGACACCCGT GACCTCCCGC CGAGCGCCGT GGTCGGCGCG
GGCGACATCG TCGAAGCCGC AGGGCATCCC GTCGGCAGCG GGGTGATCAA GGACAGCTAC
GACCCGCGGC TGACCAACGA GGACCTCGCA CCGCTGGGCA AGCAGACGTG GTCGTCGTAC
AACATCTTCG CGTTCTGGAT GTCGGACGTG CACAGCGTCG GCGGATATGT CACTGCGGGC
AGCCTGTTCG CCCTGGGCCT GGCGAGCTGG CAGGTGCTGA TCGCCCTGCT CGTCGGCATC
GTGATCGTCA ACGTGCTGTG CAACCTGGTC GCCAAGCCCA GCCAGCAGGC CGGCGTGCCG
TACCCCGTCG TATGCCGCAG TTCCTTCGGT GTCCTCGGCG CGAACATTCC GGCCATCATC
CGCGGCCTGA TCGCGGTGGC CTGGTACGGC ATCCAGACCT ACCTGGCGTC GGCCGCGCTC
GACGTCGTGC TGCTCAAACT GTTCCCCGGC CTGGCGCCCT ACGCCGACGC CGACCAGTAC
GGCTTCACCG GCCTGTCCCT GCTGGGCTGG TGCAGCTTCA TGCTGCTGTG GGTGCTGCAG
GCGTGCGTGT TCTGGCGCGG TATGGAGTCG ATCCGCAAGT TCATCGACTT CTGCGGTCCC
GCGGTGTACG TGGTGATGTT CATCCTCTGC GGCTACCTGC TGTGGAAATC GGGCTGGCAC
GTCAGCCTGT CGCTGGGCGG CGAGAAGCAG GGCAACACGC TGGTGGTCAT GCTCGGCGCG
ATCGCACTCG TCGTGTCGTA CTTCTCCGGG CCGATGCTGA ACTTCGGCGA CTTCGCCCGC
TACGGCAAGA GCTTCGAGGC GGTCAAGAAG GGCAACTTCC TTGGCCTGCC GATCAACTTC
CTGATGTTCT CGATCCTGGT CGTCGTCACC GCCGCGGCCA CGGTGCCGGT GTTCGGCGAG
CTCCTCACCG ACCCGGTCGA GACCGTCGCC CGCATCGACA GCGTCACCGC GATCGTCCTC
GGAGCGCTGA CGTTCTCGAT CGCCACGATC GGCATCAACA TCGTGGCCAA CTTCATCAGC
CCCGCCTTCG ACTTCTCCAA CGTCAGCCCG CAGCGGATCA GCTGGCGCAT GGGCGGCATG
ATCGCCGCGG TCGGGTCGGT GCTGCTCACG CCGTGGAACC TCTATAGCAA CCCCGAGGTC
ATCCACTACA CGCTGGAGAC CCTCGGCGCG TTCATCGGCC CGCTGTTCGG CGTGCTGATC
GCCGACTTCT ACCTGGTGCG CAAGCAGAAG ATCGTGGTCG ACGATCTGTT CACGATGTCG
GAGACCGCCA ACTACTGGTA CCGGAAGGGC TACAACCCCG CCGCGGTGAC CGCCACCCTC
GTCGGCGCCG TCCTGGCCAT GGCACCGGTA CTGCTCGGCG GCGTCGTGTT CGGCATGGCC
GGCGCCGCGC AGTACAGCTG GTTCATCGGC TGCGGTGTGG CGTTCGCCCT CTACTACGTG
CTGGCCACCC GTGGCCCGTG GCGCATGACC GCGCTGCGTG TCGCCGAGGG CGCGACGCTG
GTCTCGAACT AG
 
Protein sequence
MRRNDGETHR SYVPFIATPM DRNRIMTDTR DLPPSAVVGA GDIVEAAGHP VGSGVIKDSY 
DPRLTNEDLA PLGKQTWSSY NIFAFWMSDV HSVGGYVTAG SLFALGLASW QVLIALLVGI
VIVNVLCNLV AKPSQQAGVP YPVVCRSSFG VLGANIPAII RGLIAVAWYG IQTYLASAAL
DVVLLKLFPG LAPYADADQY GFTGLSLLGW CSFMLLWVLQ ACVFWRGMES IRKFIDFCGP
AVYVVMFILC GYLLWKSGWH VSLSLGGEKQ GNTLVVMLGA IALVVSYFSG PMLNFGDFAR
YGKSFEAVKK GNFLGLPINF LMFSILVVVT AAATVPVFGE LLTDPVETVA RIDSVTAIVL
GALTFSIATI GINIVANFIS PAFDFSNVSP QRISWRMGGM IAAVGSVLLT PWNLYSNPEV
IHYTLETLGA FIGPLFGVLI ADFYLVRKQK IVVDDLFTMS ETANYWYRKG YNPAAVTATL
VGAVLAMAPV LLGGVVFGMA GAAQYSWFIG CGVAFALYYV LATRGPWRMT ALRVAEGATL
VSN