Gene Ent638_3792 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3792 
Symbol 
ID5110836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp4087175 
End bp4088476 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content55% 
IMG OID640494001 
Productcytosine deaminase 
Protein accessionYP_001178498 
Protein GI146313424 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0183324 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTACAT CACCGCTTTG GCTGGTTCAG AACGTTCGGT TACCGCAGCA AGATGGGTTA 
TGGCAAATCG CCATTGAGAA TGGCCGTTTT GGCGAAATTA CCCCGATGGG TGATGCGCCC
GATGAGAGCC ACGAAGTCCT CAACGCCCGG GGCGGCCTGG CGATTCCGCC GTTTATTGAA
CCGCATATTC ATCTCGATAC CACTCAGACG GCCGGTGAGC CGAACTGGAA CCAGTCTGGA
ACCCTTTTTG AGGGCATCGA ACGCTGGGCG GAACGTAAAG CGTTGCTCAG CCATGACGAT
GTCAAAGCGC GTGCGTGGAA GACGTTGAAA TGGCAAATGG CCAACGGCAT TCAGTTTGTT
CGCACTCACG TTGACGTTTC TGACCCTACG CTGACGGCGC TAAAAGCGAT GCTGGAAGTG
AAGCAGGAGG TGGCACCGTG GATAACGCTG CAAATCGTTG CTTTCCCGCA GGAGGGGATT
CTTTCCTATC CCAACGGTGC GGCGCTGCTT GAAGAAGCGT TACAGCTCGG CGCAGACGTG
GTCGGCGCCA TCCCGCATTT TGAATTCACC CGCGAATACG GCGTGCAGTC GCTGCACATC
GCGTTTGAAC TGGCGAAAAA ATATGATCGC CCGCTGGATA TTCACTGCGA CGAAATTGAC
GATGAGCAGT CACGATTTGT CGAAACGGTT GCCACGCTGG CCTACGAGGC AGGGATTGGA
TCGCGCGTCA CCGCCAGCCA CACCACCGCG ATGCACTCCT ACAATGGGGC ATACACCTCG
CGGCTGTTCC GCTTGCTGAA AATGTCCGGC ATCAACTTTG TCGCTAACCC ACTGGTGAAC
ATTCATTTGC AGGGGCGTTT TGACGATTAC CCGAAACGCC GTGGGATTAC GCGCGTGAAA
GAGTTACAGG AAGCGGGTAT CAACGTCTGC TTCGGTCATG ACGATGTTTT CGACCCGTGG
TATCCGCTGG GGACGGGTAA CATGCTGCAA GTGCTGCACA TGGGGCTACA CGTCTGTCAG
ATGATGGGTT ATCAGCAGAT CGACAGCGGA CTGAATTTGA TTACCCATAA CAGCGCGCGC
ACGTTTGGGC TGACGGATTA CGGTATCAAA ACCGGTAACC CGGCGAATCT GATCATATTG
CCTGCGGAAA GTGGTTTTGA CGCGGTGCGC TGCCAGGTGC CGGTGCGCTG GTCGATTCGT
CAGGGAAGAG TGATTGCGAC GACGCAGCTG GCGCAAACCT GGATTCAGAC GGATAGCGGG
GGAGAAGAGG TGAGTTTTAG TCAAAAACAG CCCCTTCGCT GA
 
Protein sequence
MSTSPLWLVQ NVRLPQQDGL WQIAIENGRF GEITPMGDAP DESHEVLNAR GGLAIPPFIE 
PHIHLDTTQT AGEPNWNQSG TLFEGIERWA ERKALLSHDD VKARAWKTLK WQMANGIQFV
RTHVDVSDPT LTALKAMLEV KQEVAPWITL QIVAFPQEGI LSYPNGAALL EEALQLGADV
VGAIPHFEFT REYGVQSLHI AFELAKKYDR PLDIHCDEID DEQSRFVETV ATLAYEAGIG
SRVTASHTTA MHSYNGAYTS RLFRLLKMSG INFVANPLVN IHLQGRFDDY PKRRGITRVK
ELQEAGINVC FGHDDVFDPW YPLGTGNMLQ VLHMGLHVCQ MMGYQQIDSG LNLITHNSAR
TFGLTDYGIK TGNPANLIIL PAESGFDAVR CQVPVRWSIR QGRVIATTQL AQTWIQTDSG
GEEVSFSQKQ PLR