Gene EcolC_3289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3289 
SymbolcodB 
ID6066992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3600859 
End bp3602118 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content54% 
IMG OID641602704 
Productcytosine permease 
Protein accessionYP_001726238 
Protein GI170021284 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1457] Purine-cytosine permease and related proteins 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGCAAG ATAACAACTT TAGCCAGGGG CCAGTCCCGC AGTCGGCGCG GAAAGGGGTA 
TTGGCATTGA CGTTCGTCAT GCTGGGATTA ACCTTCTTTT CCGCCAGTAT GTGGACCGGC
GGCACTCTCG GAACCGGTCT TAGCTATCAT GATTTCTTCC TCGCAGTTCT CATCGGTAAT
CTTCTCCTCG GTATTTACAC TTCATTTCTC GGTTACATTG GCGCAAAAAC CGGCCTGACC
ACTCATCTTC TTGCTCGCTT CTCGTTTGGT GTTAAAGGCT CATGGCTGCC TTCACTGCTA
CTGGGCGGAA CTCAGGTTGG CTGGTTTGGC GTCGGTGTGG CGATGTTTGC CATTCCGGTG
GGTAAGGCAA CCGGGCTGGA TATTAATTTG CTGATTGCCG TTTCCGGTTT ACTGATGACC
GTCACCGTCT TTTTTGGCAT TTCGGCGCTG ACGGTTCTTT CGGTGATTGC GGTTCCGGCT
ATCGCCTGCC TGGGCGGTTA TTCCGTGTGG CTGGCTGTTA ACGGCATGGG CGGCCTGGAC
GCATTAAAAG CGGTCGTTCC CGCACAACCG TTAGATTTCA ATGTCGCGCT GGCGCTGGTT
GTGGGGTCAT TTATCAGTGC GGGTACGCTC ACCGCTGACT TTGTCCGGTT TGGTCGCAAT
GCCAAACTGG CGGTGCTGGT GGCGATGGTG GCCTTTTTCC TCGGCAACTC GTTGATGTTT
ATTTTCGGTG CAGCGGGCGC TGCGGCACTG GGGATGGCGG ATATCTCTGA TGTGATGATT
GCTCAGGGAC TGCTGCTGCC TGCGATTGTG GTGCTCGGGC TGAATATCTG GACCACCAAC
GATAACGCAC TCTATGCGTC GGGTTTAGGT TTCGCCAACA TTACCGGTAT GTCGAGCAAA
ACGCTTTCGG TGATCAACGG TATTATCGGT ACGGTCTGCG CATTATGGCT GTATAACAAT
TTTGTCGGCT GGCTGACCTT CCTTTCGGCA GCTATTCCTC CGGTGGGTGG CGTGATCATC
GCTGATTATC TGATGAACCG TCGCCGCTAT GAGCACTTTG CGACCACGCG TATGATGAGT
GTCAATTGGG TGGCGATTCT GGCGGTCGCG CTGGGGATTG CCGCAGGCCA CTGGTTACCG
GGAATTGTTC CGGTCAACGC GGTATTAGGT GGCGCGCTGA GCTATCTGAT CCTTAACCCG
ATTTTGAATC GTAAAACGAC AGCAGCAATG ACGCATGTGG AGGCTAACAG TGTCGAATAA
 
Protein sequence
MSQDNNFSQG PVPQSARKGV LALTFVMLGL TFFSASMWTG GTLGTGLSYH DFFLAVLIGN 
LLLGIYTSFL GYIGAKTGLT THLLARFSFG VKGSWLPSLL LGGTQVGWFG VGVAMFAIPV
GKATGLDINL LIAVSGLLMT VTVFFGISAL TVLSVIAVPA IACLGGYSVW LAVNGMGGLD
ALKAVVPAQP LDFNVALALV VGSFISAGTL TADFVRFGRN AKLAVLVAMV AFFLGNSLMF
IFGAAGAAAL GMADISDVMI AQGLLLPAIV VLGLNIWTTN DNALYASGLG FANITGMSSK
TLSVINGIIG TVCALWLYNN FVGWLTFLSA AIPPVGGVII ADYLMNRRRY EHFATTRMMS
VNWVAILAVA LGIAAGHWLP GIVPVNAVLG GALSYLILNP ILNRKTTAAM THVEANSVE