Gene EcolC_3288 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3288 
Symbol 
ID6066989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3599586 
End bp3600869 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content54% 
IMG OID641602703 
Productcytosine deaminase 
Protein accessionYP_001726237 
Protein GI170021283 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGAATA ACGCTTTACA TACAATTATT AACGCCCGGT TACCCGGCAA AGAGGGGCTG 
TGGCAGATTC ATCTGCAGGA CGGAAAAATC AGCGCCATTG ATGCGCAAGC CGGCGTGATG
CCCGTAACTG AAAACAGCCT TGATGCCGAA CAAGGTTTAG TTATACCGCC ATTTGTGGAG
CCGCATATTC ACCTGGACAC CACGCAAACC GCCGGACAAC CGAACTGGAA TCAGTCCGGC
ACGCTGTTCG AAGGCATTGA ACGCTGGGCC GAGCGCAAAG CGTTATTAAC CCATGACGAT
GTGAAACAAC GCGCATGGCA AACGCTGAAA TGGCAGATTG CCAACGGCAT TCAGCATGTG
CGTACCCATG TCGATGTTTC GGATGCAACG CTGACTGCGC TGAAAGCAAT GCTGGAAGTG
AAGCAGGAAG TCGCGCCGTG GATTGATCTG CAAATCGTCG CCTTCCCTCA GGAAGGGATT
TTGTCGTATC CCAACGGTGA AGCGTTGCTG GAAGAGGCGT TACGCTTAGG GGCAGATGTA
GTGGGGGCGA TTCCGCATTT TGAATTTACC CGTGAATACG GCGTGGAGTC GCTGCATAAA
ACCTTCGCCC TGGCGCAAAA ATACGACCGT CTCATCGACG TTCACTGTGA TGAGATCGAT
GACGAGCAGT CGCGCTTTGT CGAAACTGTT GCTGCCCTGG CGCACCGTGA AGGCATGGGC
GCGCGAGTCA CCGCCAGCCA CACCACGGCA ATGCACTCCT ATAACGGGGC GTATACCTCA
CGCCTGTTCC GCTTGCTGAA AATGTCCGGT ATTAACTTTG TCGCCAACCC GCTGGTCAAT
ATTCATCTGC AAGGACGTTT CGATACGTAT CCAAAACGTC GCGGCATCAC GCGCGTTAAA
GAGATGCTGG AGTCCGGCAT TAACGTCTGC TTTGGTCACG ATGATGTCTT CGATCCGTGG
TATCCGCTGG GAACGGCGAA TATGCTGCAA GTGCTGCATA TGGGGCTGCA TGTTTGCCAG
TTGATGGGCT ACGGGCAGAT TAACGATGGC CTGAATTTAA TCACCCACCA CAGTGCCAGA
ACGTTGAATT TGCAGGATTA CGGCATTGCC GCCGGAAACA GCGCAAACCT GATTATCCTG
CCGGCTGAAA ATGGATTTGA TGCGCTGCGC CGTCAGGTTC CGGTACGCTA TTCGGTACGT
GGCGGCAAGG TGATTGCCAG CACACAACCG GCACAAACCA CCGTATATCT GGAGCAGCCA
GAAGCCATCG ATTACAAACG TTGA
 
Protein sequence
MSNNALHTII NARLPGKEGL WQIHLQDGKI SAIDAQAGVM PVTENSLDAE QGLVIPPFVE 
PHIHLDTTQT AGQPNWNQSG TLFEGIERWA ERKALLTHDD VKQRAWQTLK WQIANGIQHV
RTHVDVSDAT LTALKAMLEV KQEVAPWIDL QIVAFPQEGI LSYPNGEALL EEALRLGADV
VGAIPHFEFT REYGVESLHK TFALAQKYDR LIDVHCDEID DEQSRFVETV AALAHREGMG
ARVTASHTTA MHSYNGAYTS RLFRLLKMSG INFVANPLVN IHLQGRFDTY PKRRGITRVK
EMLESGINVC FGHDDVFDPW YPLGTANMLQ VLHMGLHVCQ LMGYGQINDG LNLITHHSAR
TLNLQDYGIA AGNSANLIIL PAENGFDALR RQVPVRYSVR GGKVIASTQP AQTTVYLEQP
EAIDYKR