Gene EcSMS35_0367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0367 
SymbolcodB 
ID6144024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp379358 
End bp380617 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content54% 
IMG OID641615263 
Productcytosine permease 
Protein accessionYP_001742470 
Protein GI170680347 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1457] Purine-cytosine permease and related proteins 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.506759 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGCAAG ATAACAACTT TAGCCAGGGG CCAGTCCCGC AGTCGGCGCG GAAAGGGGTA 
TTGGCATTGA CGTTCGTCAT GCTGGGATTA ACCTTCTTTT CCGCCAGTAT GTGGACCGGC
GGCACTCTCG GAACCGGTCT TAGCTATCAT GATTTCTTCC TCGCAGTCCT CATCGGTAAT
CTTCTCCTCG GTATTTACAC TTCATTTCTT GGTTACATTG GCGCAAAAAC CGGCCTGACC
ACTCATCTTC TTGCTCGCTT CTCTTTTGGT GTTAAAGGCT CATGGCTGCC TTCACTGCTA
CTGGGCGGAA CTCAGGTTGG CTGGTTTGGC GTTGGCGTGG CGATGTTTGC TATTCCGGTG
GGCAAGGCAA CTGGGCTGGA TATTAATTTG CTGATTGCCG TTTCCGGTTT ACTGATGACC
GTAACCGTCT TCTTTGGCAT TTCGGCGCTG ACGGTTCTTT CGGTGATTGC AGTTCCGGCT
ATCGCCTGTC TTGGCGGTTA TTCCGTGTGG TTGGCCGTTA ACGGCATGGG CGGCCTGGAC
GCATTAAAAG CGGTCGTTCC CGCACAACCG TTAGATTTCA ATGTCGCGCT GGCGCTGGTT
GTGGGGTCAT TTATCAGTGC GGGCACGCTC ACCGCTGACT TTGTCCGGTT TGGTCGTAAT
GCCAAACTGG CGGTGCTGGT GGCGATGGTG GCCTTTTTCC TCGGCAACTC GTTGATGTTT
ATTTTCGGTG CAGCGGGCGC GGCGGCACTG GGGATGGCGG ATATCTCTGA TGTGATGATT
GCTCAGGGTC TGCTGCTGCC TGCGATTGTG GTGCTGGGGC TGAATATCTG GACCACCAAC
GATAACGCGC TCTATGCGTC GGGCTTAGGT TTCGCCAACA TTACCGGGAT GTCGAGCAAA
ACCCTTTCGG TAATCAACGG TATTATCGGT ACGGTCTGCG CATTATGGCT GTATAACAAT
TTTGTCGGCT GGCTGACCTT CCTTTCGGCA GCTATTCCTC CGGTAGGGGG CGTGATCATC
GCTGACTATC TGATGAACCG TCGCCGCTAT GAGCACTTTG CGACCACGCG TATGATGAGT
GTCAATTGGG TGGCGATTCT GGCGGTCGCC TTGGGGATTG CCGCAGGCCA CTGGTTACCG
GGAATTGTTC CGGTCAACGC GGTATTAGGT GGCGCGCTGA GCTATCTGAT CCTTAACCCG
ATTTTGAATC GTAAAACGAC AGCAGCAATG ACGCATGTGG AGGCTAACAG TGTCGAATAA
 
Protein sequence
MSQDNNFSQG PVPQSARKGV LALTFVMLGL TFFSASMWTG GTLGTGLSYH DFFLAVLIGN 
LLLGIYTSFL GYIGAKTGLT THLLARFSFG VKGSWLPSLL LGGTQVGWFG VGVAMFAIPV
GKATGLDINL LIAVSGLLMT VTVFFGISAL TVLSVIAVPA IACLGGYSVW LAVNGMGGLD
ALKAVVPAQP LDFNVALALV VGSFISAGTL TADFVRFGRN AKLAVLVAMV AFFLGNSLMF
IFGAAGAAAL GMADISDVMI AQGLLLPAIV VLGLNIWTTN DNALYASGLG FANITGMSSK
TLSVINGIIG TVCALWLYNN FVGWLTFLSA AIPPVGGVII ADYLMNRRRY EHFATTRMMS
VNWVAILAVA LGIAAGHWLP GIVPVNAVLG GALSYLILNP ILNRKTTAAM THVEANSVE