Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0076 |
Symbol | |
ID | 4027255 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 95608 |
End bp | 96957 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637965227 |
Product | permease for cytosine/purines, uracil, thiamine, allantoin |
Protein accession | YP_572139 |
Protein GI | 92112211 |
COG category | [F] Nucleotide transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG1953] Cytosine/uracil/thiamine/allantoin permeases |
TIGRFAM ID | [TIGR00800] NCS1 nucleoside transporter family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00450378 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAGCG ACTCCGCCAC GTCGGAAGGG CCGCGCCCTG CCGGCAAGGC ACCGGGACAA GAATCCCTGG ACCCACAGCG TACCCGCATC ATGGGACGCC TCTCCTATCT ACTGGCCTGG TTCGGTGGCT GCGTGTCGAT CGGCACCTTC GCCATGGGCT CGAGCATCGT CGGTACGCTC AACCTGTTGC AGGCCTCGCT GGCGATCGCC ATCGGCTGCT TCGTGATCGG CGTCGCGCTG ACGATCAACG GCGCCGCCGG CTACAAGTAC GGCATTCCTT TCATGGTCCA GGCACGCAGC GCCTTCGGCT TTACCGGCAC ACGCTTGCCG GGGCTCATCC GCGCGGTGCC GGCCATCGTC TGGTACGGAT TTCAGAGCTG GATCGGCGCC GGCGCCCTGA ACGCCGTCTC GTCGACCCTG CTGGGCTTCG ACAATCTGGT GTTCTATTTC ATCGCCTTCC AGCTCTTGCA GATCGCCCTG TCGGTGTTCG GCTTCCAGGG CATCAAGTGG CTGGAAAACA TCGGCAGTGG CTTCATCCTG GCCTCGCTGG TCTACATGTT CTTCAGCGTG CTCGACAAGT ACGGCGATGT GATCGGCGAA CAGCTGATCG ACATCGACGG CACCTGGGGA CTGCCCTTCT GGGGCGCGAC GATGCTGTTC CTGGGCATCT ACAGCACCAT GATCCTCAAT GCCAGCGACT ACTCCCGCGA GCTCAAGCAC GGCAGCGGGC CGGGGCTTCT GACCACGCTC TACGCCATGT CGATTTTGCC CTGCACGCTG TTCATGGGCT TGATCGGGCT AATGGTCTCC GGTGCCACCG GCGTCGCCGA TCCCATCGAT GTCTTCGCCA ATGCCGTCGA CAATCCACCG CTTCTGATCA CCACCCTGTT GTTCATTGCC TTCGCCCAGG TCACCACCAA CGTGCTCAAC AATGTGGTGC CCCCCACCTA CGTGCTGATG GACGTCTTCA AGCTGAAATT CCGCAGCGCC ACCGTGATCG TGGGGTTGCT GGCCTTCGCG ACCTTTCCCT GGGAACTGGT CAAGGACGAT TCCGCGGCCG GACTGCAGCT GTTCGTGCAG ACCTATTCGG CGTTTCTGGG GCCGATCTTC GCGATCATGG CGGTGGATTA TTACCTCATT CGCCAGCGCA CGCTGGATCT CGACAAGCTC TATGACGCGC ATGGCCCTTA TCGCGGCATC AATTACGCCG CCTGCATCGC CACGCTGATC GGCGCCCTGG TGGCGCTCAA CGTCTCGGCG GTCTCGTGGT ACGCGAGCCT GCTGCCGGCC GGCCTCGCCT ATTACCTGCT GATGCGCCAC TGGCCGGCCT GCCGGCGTTT CAACCAATGA
|
Protein sequence | MTSDSATSEG PRPAGKAPGQ ESLDPQRTRI MGRLSYLLAW FGGCVSIGTF AMGSSIVGTL NLLQASLAIA IGCFVIGVAL TINGAAGYKY GIPFMVQARS AFGFTGTRLP GLIRAVPAIV WYGFQSWIGA GALNAVSSTL LGFDNLVFYF IAFQLLQIAL SVFGFQGIKW LENIGSGFIL ASLVYMFFSV LDKYGDVIGE QLIDIDGTWG LPFWGATMLF LGIYSTMILN ASDYSRELKH GSGPGLLTTL YAMSILPCTL FMGLIGLMVS GATGVADPID VFANAVDNPP LLITTLLFIA FAQVTTNVLN NVVPPTYVLM DVFKLKFRSA TVIVGLLAFA TFPWELVKDD SAAGLQLFVQ TYSAFLGPIF AIMAVDYYLI RQRTLDLDKL YDAHGPYRGI NYAACIATLI GALVALNVSA VSWYASLLPA GLAYYLLMRH WPACRRFNQ
|
| |