Gene SeHA_C3631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3631 
SymbolcodB 
ID6489255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3517128 
End bp3518405 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content55% 
IMG OID642743749 
Productcytosine permease 
Protein accessionYP_002047361 
Protein GI194447479 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1457] Purine-cytosine permease and related proteins 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value0.822935 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGCAAAA TTCATGGAGG CGTTGTGTCG CAGGACAACA ATTATAGCCA GGGCCCCGTC 
CCTCTGGCGG CGCGGAAGGG CGTGATTCCA CTGACGTTTG TCATGTTGGG TTTAACGTTT
TTTTCCGCCA GTATGTGGAC CGGAGGGACA CTCGGCACCG GTCTTTCTTA TAATGATTTC
TTCCTCGCAG TTCTCTTCGG TAATCTCCTC CTCGGTATCT ACACTGCATT TCTTGGTTAC
ATCGGCGCAA AAACCGGACT CTCCACCCAC CTCCTTGCAC GTTACTCCTT TGGCGTTAAA
GGCTCATGGC TTCCCTCGCT ACTGCTAGGC GGTACTCAAG TGGGCTGGTT TGGCGTTGGC
GTAGCGATGT TCGCTATTCC GGTCAGTAAA GCGACGGGCA TTGATGCCAA TATTCTGATT
GCCGTTTCGG GTCTACTGAT GACCCTGACC ATTTTTTTCG GCATCTCGGC GTTGACCATT
TTGTCTATCA TTGCCGTACC CGCGATCGTG ATTCTGGGCA GCTACTCCGT CTGGCTGGCG
GTCAGCGGCG TGGGTGGGCT GGAGCATTTA AAAACGATAG TGCCGCAGAC GCCGCTGGAT
TTTTCCAGCG CGCTGGCGCT GGTGGTGGGC TCGTTTGTCA GCGCCGGTAC ATTGACCGCC
GACTTCGTCC GCTTCGGGCG TCATGCCAAA AGCGCCGTAC TGATTGCGAT GGTCGCTTTT
TTCCTCGGCA ACTCGCTGAT GTTTATCTTT GGCGCGGCAG GCGCTGCCGC CGTCGGTCAG
GCGGATATCT CTGACGTGAT GATAGCGCAG GGGCTGCTGC TGCCCGCGAT TGTGGTGCTT
GGCCTGAATA TCTGGACCAC CAACGATAAC GCGCTGTACG CATCGGGTCT GGGCTTCGCC
AATATTACCG GTCTTTCCAG CCGTACGCTG TCGGTGGTGA ACGGGATTAT CGGTACCGTG
TGCGCGCTGT GGCTTTACAA TAATTTTGTC GGCTGGCTGA CGTTCCTGTC ATCTGCCATC
CCACCGATTG GCGGAGTGAT TATTGCCGAC TATCTGTTGA ACCGTCGCCG CTATGCCGAC
TTCAACACCG TGCGCTTTAT TCCCGTTAAC TGGATTGCTA TTCTTTCCGT CGCGCTGGGC
ATCGCCGCCG GACATTATGT TCCTGGTATT GTGCCCGTCA ACGCCGTACT CGGCGGCGTC
TTCAGCTATA TCCTGCTGAA TCCACTTTTC AACCGCAGCC TTGCTAAATC ACCAGAGGTC
AGCCATGCAG AACAATAA
 
Protein sequence
MGKIHGGVVS QDNNYSQGPV PLAARKGVIP LTFVMLGLTF FSASMWTGGT LGTGLSYNDF 
FLAVLFGNLL LGIYTAFLGY IGAKTGLSTH LLARYSFGVK GSWLPSLLLG GTQVGWFGVG
VAMFAIPVSK ATGIDANILI AVSGLLMTLT IFFGISALTI LSIIAVPAIV ILGSYSVWLA
VSGVGGLEHL KTIVPQTPLD FSSALALVVG SFVSAGTLTA DFVRFGRHAK SAVLIAMVAF
FLGNSLMFIF GAAGAAAVGQ ADISDVMIAQ GLLLPAIVVL GLNIWTTNDN ALYASGLGFA
NITGLSSRTL SVVNGIIGTV CALWLYNNFV GWLTFLSSAI PPIGGVIIAD YLLNRRRYAD
FNTVRFIPVN WIAILSVALG IAAGHYVPGI VPVNAVLGGV FSYILLNPLF NRSLAKSPEV
SHAEQ