Gene SeD_A3693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3693 
SymbolcodB 
ID6873653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3542649 
End bp3543926 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content55% 
IMG OID642786669 
Productcytosine permease 
Protein accessionYP_002217303 
Protein GI198242466 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1457] Purine-cytosine permease and related proteins 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones78 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGCAAAA TTCATGGAGG CGTTGTGTCG CAGGACAACA ATTATAGCCA GGGCCCCGTC 
CCTCTGGCGG CGCGGAAGGG CGTGATTCCA CTGACGTTTG TCATGTTGGG TTTAACGTTT
TTTTCCGCCA GTATGTGGAC CGGAGGGACA CTCGGCACCG GTCTTACCTA TCACGATTTC
TTCCTCGCAG TTCTCTTCGG TAATCTCCTC CTCGGTATCT ACACTGCATT TCTTGGTTAC
ATCGGCGCAA AAACCGGACT CTCCACCCAC CTCCTTGCAC GTTACTCCTT TGGCGTTAAA
GGATCATGGC TTCCCTCGCT GCTGCTAGGC GGCACACAGG TAGGCTGGTT TGGCGTTGGC
GTAGCGATGT TCGCTATTCC GGTCAGTAAA GCGACGGGCA TTGATGCCAA TATTCTGATT
GCCATTTCGG GTCTACTGAT GACCCTGACC ATTTTTTTCG GCATCTCGGC GTTGACCATT
TTGTCTATCA TTGCCGTACC CGCGATCGTT ATACTGGGCA GCTACTCCGT CTGGCTGGCG
GTCAGCGGCG TGGGTGGGCT GGAGCATTTA AAAACGATAG TGCCGCAGAC GCCGCTGGAT
TTTTCCAGCG CGCTGGCGCT GGTGGTGGGC TCGTTTGTCA GCGCCGGTAC ATTGACCGCC
GACTTCGTCC GCTTCGGGCG TCATGCCAAA AGCGCCGTAC TGATTGCGAT GGTCGCTTTT
TTCCTCGGCA ACTCGCTGAT GTTTATCTTT GGCGCGGCAG GCGCTGCCGC CGTCGGTCAG
GCGGATATCT CTGACGTGAT GATAGCGCAG GGGCTGCTGC TGCCCGCGAT TGTGGTGCTT
GGCCTGAATA TCTGGACCAC CAACGATAAC GCGCTGTACG CATCGGGTCT GGGCTTCGCC
AATATTACCG GTCTTTCCAG CCGTACGCTG TCGGTGGTGA ACGGGATTAT CGGTACCGTG
TGCGCGCTGT GGCTTTACAA TAATTTTGTC GGCTGGCTGA CGTTCCTGTC ATCTGCCATC
CCACCGATTG GCGGAGTGAT TATTGCCGAC TATCTGTTGA ACCGCCGCCG CTATGCCGAC
TTCAACACCG TGCGCTTTAT TCCCGTTAAC TGGATTGCTA TTCTTTCCGT CGCGCTGGGC
ATCGCCGCCG GACATTATGT TCCGGGTATT GTGCCCGTCA ACGCCGTACT CGGCGGCGTC
TTCAGCTATA TCCTGCTGAA TCCACTTTTC AACCGCAGCC TTGCTAAATC ACCAGAGGTC
AGCCATGCAG AACAATAA
 
Protein sequence
MGKIHGGVVS QDNNYSQGPV PLAARKGVIP LTFVMLGLTF FSASMWTGGT LGTGLTYHDF 
FLAVLFGNLL LGIYTAFLGY IGAKTGLSTH LLARYSFGVK GSWLPSLLLG GTQVGWFGVG
VAMFAIPVSK ATGIDANILI AISGLLMTLT IFFGISALTI LSIIAVPAIV ILGSYSVWLA
VSGVGGLEHL KTIVPQTPLD FSSALALVVG SFVSAGTLTA DFVRFGRHAK SAVLIAMVAF
FLGNSLMFIF GAAGAAAVGQ ADISDVMIAQ GLLLPAIVVL GLNIWTTNDN ALYASGLGFA
NITGLSSRTL SVVNGIIGTV CALWLYNNFV GWLTFLSSAI PPIGGVIIAD YLLNRRRYAD
FNTVRFIPVN WIAILSVALG IAAGHYVPGI VPVNAVLGGV FSYILLNPLF NRSLAKSPEV
SHAEQ