Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A3693 |
Symbol | codB |
ID | 6873653 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 3542649 |
End bp | 3543926 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642786669 |
Product | cytosine permease |
Protein accession | YP_002217303 |
Protein GI | 198242466 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1457] Purine-cytosine permease and related proteins |
TIGRFAM ID | [TIGR00800] NCS1 nucleoside transporter family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 78 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGGCAAAA TTCATGGAGG CGTTGTGTCG CAGGACAACA ATTATAGCCA GGGCCCCGTC CCTCTGGCGG CGCGGAAGGG CGTGATTCCA CTGACGTTTG TCATGTTGGG TTTAACGTTT TTTTCCGCCA GTATGTGGAC CGGAGGGACA CTCGGCACCG GTCTTACCTA TCACGATTTC TTCCTCGCAG TTCTCTTCGG TAATCTCCTC CTCGGTATCT ACACTGCATT TCTTGGTTAC ATCGGCGCAA AAACCGGACT CTCCACCCAC CTCCTTGCAC GTTACTCCTT TGGCGTTAAA GGATCATGGC TTCCCTCGCT GCTGCTAGGC GGCACACAGG TAGGCTGGTT TGGCGTTGGC GTAGCGATGT TCGCTATTCC GGTCAGTAAA GCGACGGGCA TTGATGCCAA TATTCTGATT GCCATTTCGG GTCTACTGAT GACCCTGACC ATTTTTTTCG GCATCTCGGC GTTGACCATT TTGTCTATCA TTGCCGTACC CGCGATCGTT ATACTGGGCA GCTACTCCGT CTGGCTGGCG GTCAGCGGCG TGGGTGGGCT GGAGCATTTA AAAACGATAG TGCCGCAGAC GCCGCTGGAT TTTTCCAGCG CGCTGGCGCT GGTGGTGGGC TCGTTTGTCA GCGCCGGTAC ATTGACCGCC GACTTCGTCC GCTTCGGGCG TCATGCCAAA AGCGCCGTAC TGATTGCGAT GGTCGCTTTT TTCCTCGGCA ACTCGCTGAT GTTTATCTTT GGCGCGGCAG GCGCTGCCGC CGTCGGTCAG GCGGATATCT CTGACGTGAT GATAGCGCAG GGGCTGCTGC TGCCCGCGAT TGTGGTGCTT GGCCTGAATA TCTGGACCAC CAACGATAAC GCGCTGTACG CATCGGGTCT GGGCTTCGCC AATATTACCG GTCTTTCCAG CCGTACGCTG TCGGTGGTGA ACGGGATTAT CGGTACCGTG TGCGCGCTGT GGCTTTACAA TAATTTTGTC GGCTGGCTGA CGTTCCTGTC ATCTGCCATC CCACCGATTG GCGGAGTGAT TATTGCCGAC TATCTGTTGA ACCGCCGCCG CTATGCCGAC TTCAACACCG TGCGCTTTAT TCCCGTTAAC TGGATTGCTA TTCTTTCCGT CGCGCTGGGC ATCGCCGCCG GACATTATGT TCCGGGTATT GTGCCCGTCA ACGCCGTACT CGGCGGCGTC TTCAGCTATA TCCTGCTGAA TCCACTTTTC AACCGCAGCC TTGCTAAATC ACCAGAGGTC AGCCATGCAG AACAATAA
|
Protein sequence | MGKIHGGVVS QDNNYSQGPV PLAARKGVIP LTFVMLGLTF FSASMWTGGT LGTGLTYHDF FLAVLFGNLL LGIYTAFLGY IGAKTGLSTH LLARYSFGVK GSWLPSLLLG GTQVGWFGVG VAMFAIPVSK ATGIDANILI AISGLLMTLT IFFGISALTI LSIIAVPAIV ILGSYSVWLA VSGVGGLEHL KTIVPQTPLD FSSALALVVG SFVSAGTLTA DFVRFGRHAK SAVLIAMVAF FLGNSLMFIF GAAGAAAVGQ ADISDVMIAQ GLLLPAIVVL GLNIWTTNDN ALYASGLGFA NITGLSSRTL SVVNGIIGTV CALWLYNNFV GWLTFLSSAI PPIGGVIIAD YLLNRRRYAD FNTVRFIPVN WIAILSVALG IAAGHYVPGI VPVNAVLGGV FSYILLNPLF NRSLAKSPEV SHAEQ
|
| |