Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0409 |
Symbol | codB |
ID | 6968513 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 415500 |
End bp | 416759 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643384461 |
Product | cytosine permease |
Protein accession | YP_002268975 |
Protein GI | 209396382 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1457] Purine-cytosine permease and related proteins |
TIGRFAM ID | [TIGR00800] NCS1 nucleoside transporter family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGCAAG ATAACAACTT TAGCCAGGGG CCGGTCCCGC AGTCGGCGCG GAAAGGGGTA TTGGCATTGA CGTTCGTCAT GCTGGGATTA ACCTTCTTTT CCGCCAGTAT GTGGACCGGC GGCACTCTCG GAACCGGTCT TAGCTATCAT GATTTCTTCC TCGCAGTTCT CATCGGTAAT CTTCTCCTCG GTATTTACAC TTCATTTCTC GGTTACATTG GCGCAAAAAC CGGCCTGACC ACTCATCTTC TTGCTCGCTT CTCGTTTGGT GTTAAAGGCT CATGGCTGCC TTCACTGCTG CTGGGCGGCA CCCAGGTTGG CTGGTTTGGC GTCGGTGTGG CGATGTTTGC TATTCCGGTG GGCAAGGCAA CCGGGCTGGA TATTAATTTG CTGATTGCCG TTTCCGGTTT ACTGATGACC GTCACCGTCT TCTTTGGCAT TTCGGCGCTG ACGGTTCTTT CGTTGATTGC GGTTCCGGCT ATCGCCTGTC TTGGCGGTTA TTCCGTGTGG CTGGCCGTTA ACGGCATGGG CGGCCTGGAC GCCTTAAAAG CGGTCGTTCC CGCACAACCG TTAGATTTCA ATGTCGCGCT GGCGCTGGTT GTGGGGTCAT TTATCAGTGC GGGCACGCTC ACCGCTGACT TTGTCCGCTT TGGTCGCAAT GCCAAACTGG CGGTGCTGGT GGCGATGGTG GCCTTTTTCC TCGGCAACTC GTTGATGTTT ATTTTCGGTG CAGCGGGCGC TGCGGCACTG GGGATGGCGG ATATCTCTGA TGTGATGATT GCTCAGGGCC TGCTGTTGCC TGCGATTGTG GTGCTGGGGC TAAATATCTG GACCACCAAC GATAACGCGC TCTATGCGTC GGGTTTAGGT TTCGCCAACA TTACCGGGAT GTCGAGCAAA ACCCTTTCGG TAATCAACGG TATTATCGGT ACGGTCTGTG CATTATGGCT GTATAACAAT TTTGTCGGCT GGTTGACCTT CCTTTCGGCA GCTATTCCTC CAGTGGGTGG CGTGATCATT GCCGACTATC TGATGAACCG TCGCCGCTAT GAGCACTTTG CGACCACGCG TATGATGAGT GTCAATTGGG TGGCGATTCT GGCGGTCGCC CTGGGGATTG CTGCAGGCCA CTGGTTACCG GGAATTGTTC CGGTCAACGC GGTATTAGGT GGCGCGCTGA GCTATCTGAT CCTTAACCCG ATTTTGAATC GTAAAACGAC AGCAGCAATG ACGCATGTGG AGGCTAACAG TGTCGAATAA
|
Protein sequence | MSQDNNFSQG PVPQSARKGV LALTFVMLGL TFFSASMWTG GTLGTGLSYH DFFLAVLIGN LLLGIYTSFL GYIGAKTGLT THLLARFSFG VKGSWLPSLL LGGTQVGWFG VGVAMFAIPV GKATGLDINL LIAVSGLLMT VTVFFGISAL TVLSLIAVPA IACLGGYSVW LAVNGMGGLD ALKAVVPAQP LDFNVALALV VGSFISAGTL TADFVRFGRN AKLAVLVAMV AFFLGNSLMF IFGAAGAAAL GMADISDVMI AQGLLLPAIV VLGLNIWTTN DNALYASGLG FANITGMSSK TLSVINGIIG TVCALWLYNN FVGWLTFLSA AIPPVGGVII ADYLMNRRRY EHFATTRMMS VNWVAILAVA LGIAAGHWLP GIVPVNAVLG GALSYLILNP ILNRKTTAAM THVEANSVE
|
| |