Gene ECH74115_0409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0409 
SymbolcodB 
ID6968513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp415500 
End bp416759 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content54% 
IMG OID643384461 
Productcytosine permease 
Protein accessionYP_002268975 
Protein GI209396382 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1457] Purine-cytosine permease and related proteins 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGCAAG ATAACAACTT TAGCCAGGGG CCGGTCCCGC AGTCGGCGCG GAAAGGGGTA 
TTGGCATTGA CGTTCGTCAT GCTGGGATTA ACCTTCTTTT CCGCCAGTAT GTGGACCGGC
GGCACTCTCG GAACCGGTCT TAGCTATCAT GATTTCTTCC TCGCAGTTCT CATCGGTAAT
CTTCTCCTCG GTATTTACAC TTCATTTCTC GGTTACATTG GCGCAAAAAC CGGCCTGACC
ACTCATCTTC TTGCTCGCTT CTCGTTTGGT GTTAAAGGCT CATGGCTGCC TTCACTGCTG
CTGGGCGGCA CCCAGGTTGG CTGGTTTGGC GTCGGTGTGG CGATGTTTGC TATTCCGGTG
GGCAAGGCAA CCGGGCTGGA TATTAATTTG CTGATTGCCG TTTCCGGTTT ACTGATGACC
GTCACCGTCT TCTTTGGCAT TTCGGCGCTG ACGGTTCTTT CGTTGATTGC GGTTCCGGCT
ATCGCCTGTC TTGGCGGTTA TTCCGTGTGG CTGGCCGTTA ACGGCATGGG CGGCCTGGAC
GCCTTAAAAG CGGTCGTTCC CGCACAACCG TTAGATTTCA ATGTCGCGCT GGCGCTGGTT
GTGGGGTCAT TTATCAGTGC GGGCACGCTC ACCGCTGACT TTGTCCGCTT TGGTCGCAAT
GCCAAACTGG CGGTGCTGGT GGCGATGGTG GCCTTTTTCC TCGGCAACTC GTTGATGTTT
ATTTTCGGTG CAGCGGGCGC TGCGGCACTG GGGATGGCGG ATATCTCTGA TGTGATGATT
GCTCAGGGCC TGCTGTTGCC TGCGATTGTG GTGCTGGGGC TAAATATCTG GACCACCAAC
GATAACGCGC TCTATGCGTC GGGTTTAGGT TTCGCCAACA TTACCGGGAT GTCGAGCAAA
ACCCTTTCGG TAATCAACGG TATTATCGGT ACGGTCTGTG CATTATGGCT GTATAACAAT
TTTGTCGGCT GGTTGACCTT CCTTTCGGCA GCTATTCCTC CAGTGGGTGG CGTGATCATT
GCCGACTATC TGATGAACCG TCGCCGCTAT GAGCACTTTG CGACCACGCG TATGATGAGT
GTCAATTGGG TGGCGATTCT GGCGGTCGCC CTGGGGATTG CTGCAGGCCA CTGGTTACCG
GGAATTGTTC CGGTCAACGC GGTATTAGGT GGCGCGCTGA GCTATCTGAT CCTTAACCCG
ATTTTGAATC GTAAAACGAC AGCAGCAATG ACGCATGTGG AGGCTAACAG TGTCGAATAA
 
Protein sequence
MSQDNNFSQG PVPQSARKGV LALTFVMLGL TFFSASMWTG GTLGTGLSYH DFFLAVLIGN 
LLLGIYTSFL GYIGAKTGLT THLLARFSFG VKGSWLPSLL LGGTQVGWFG VGVAMFAIPV
GKATGLDINL LIAVSGLLMT VTVFFGISAL TVLSLIAVPA IACLGGYSVW LAVNGMGGLD
ALKAVVPAQP LDFNVALALV VGSFISAGTL TADFVRFGRN AKLAVLVAMV AFFLGNSLMF
IFGAAGAAAL GMADISDVMI AQGLLLPAIV VLGLNIWTTN DNALYASGLG FANITGMSSK
TLSVINGIIG TVCALWLYNN FVGWLTFLSA AIPPVGGVII ADYLMNRRRY EHFATTRMMS
VNWVAILAVA LGIAAGHWLP GIVPVNAVLG GALSYLILNP ILNRKTTAAM THVEANSVE