Gene EcE24377A_0361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_0361 
SymbolcodA 
ID5588598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp391762 
End bp393045 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content53% 
IMG OID640924086 
Productcytosine deaminase 
Protein accessionYP_001461513 
Protein GI157156161 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCAAATA ACGCTTTACA AACAATTATT AACGCCCGGT TACCAGGCGA AGAGGGGCTG 
TGGCAGATTC ATCTGCAGGA CGGAAAAATC AGCGCCATTG ATGCGCAATC CGGCGTGATG
CCCATAACTG AAAACAGCCT GGATGCCGAA CAAGGTTTAG TTATACCGCC GTTTGTGGAG
CCACATATTC ACCTGGACAC CACGCAAACC GCCGGACAAC CGAACTGGAA TCAGTCCGGC
ACGCTGTTTG AAGGCATTGA ACGCTGGGCC GAGCGCAAAG CGTTATTAAC CCATGACGAT
GTGAAACAAC GCGCCTGGCA AACGCTGAAA TGGCAGATTG CCAACGGCAT TCAGCATGTG
CGTACCCATG TCGATGTTTC GGATGCAACG CTAACTGCGC TGAAAGCAAT GCTGGAAGTG
AAGCAGGAAG TCGCGCCGTG GATTGATCTG CAAATTGTCG CCTTCCCTCA GGAAGGGATT
TTGTCGTATC CCAACGGTGA AGCGTTGCTG GAAGAGGCGT TACGCTTAGG GGCAGATGTA
GTGGGGGCGA TTCCGCATTT TGAATTTACC CGTGAATACG GCGTGGAGTC GCTGCATAAA
ACCTTCGCCC TGGCGCAAAA ATACGACCGT CTCATCGACG TTCACTGTGA TGAGATCGAT
GACGAGCAGT CGCGCTTTGT CGAAACCGTT GCTGCCCTGG CGCACCGTGA AGGCATGGGC
GCGCGAGTCA CCGCCAGCCA CACCACGGCA ATGCACTCCT ATAACGGGGC GTATACCTCA
CGCCTGTTCC GCTTGCTGAA AATGTCCGGT ATTAACTTTG TCGCCAACCC GCTGGTCAAT
ATTCATCTGC AAGGACGTTT CGATACGTAT CCAAAACGTC GCGGCATCAC GCGCGTTAAA
GAGATGCTGG AGTCCGGCAT TAACGTCTGC TTTGGTCACG ATGATGTCTT CGATCCGTGG
TATCCGCTGG GAACGGCGAA TATGCTGCAA GTGCTGCATA TGGGGCTGCA TGTTTGCCAG
TTGATGGGCT ACGGGCAGAT TAACGATGGC CTGAATTTAA TCACCCACCA CAGTGCCAGA
ACATTGAATT TGCAGGATTA CGGCATTGCC GCCGGAAACA GCGCAAACCT GATTATCCTG
CCGGCTGAAA ATGGATTTGA TGCGCTGCGC CGTCAGGTTC CGGTACGTTA TTCGGTACGT
GGCGGCAAGG TGATTGCCAG TACACAACCG GCACAAACCA CCGTATATCT GGAGCAGCCA
GAAGCCATCG ATTACAAACG GTAA
 
Protein sequence
MSNNALQTII NARLPGEEGL WQIHLQDGKI SAIDAQSGVM PITENSLDAE QGLVIPPFVE 
PHIHLDTTQT AGQPNWNQSG TLFEGIERWA ERKALLTHDD VKQRAWQTLK WQIANGIQHV
RTHVDVSDAT LTALKAMLEV KQEVAPWIDL QIVAFPQEGI LSYPNGEALL EEALRLGADV
VGAIPHFEFT REYGVESLHK TFALAQKYDR LIDVHCDEID DEQSRFVETV AALAHREGMG
ARVTASHTTA MHSYNGAYTS RLFRLLKMSG INFVANPLVN IHLQGRFDTY PKRRGITRVK
EMLESGINVC FGHDDVFDPW YPLGTANMLQ VLHMGLHVCQ LMGYGQINDG LNLITHHSAR
TLNLQDYGIA AGNSANLIIL PAENGFDALR RQVPVRYSVR GGKVIASTQP AQTTVYLEQP
EAIDYKR