Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_3792 |
Symbol | |
ID | 5110836 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | - |
Start bp | 4087175 |
End bp | 4088476 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640494001 |
Product | cytosine deaminase |
Protein accession | YP_001178498 |
Protein GI | 146313424 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0183324 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTACAT CACCGCTTTG GCTGGTTCAG AACGTTCGGT TACCGCAGCA AGATGGGTTA TGGCAAATCG CCATTGAGAA TGGCCGTTTT GGCGAAATTA CCCCGATGGG TGATGCGCCC GATGAGAGCC ACGAAGTCCT CAACGCCCGG GGCGGCCTGG CGATTCCGCC GTTTATTGAA CCGCATATTC ATCTCGATAC CACTCAGACG GCCGGTGAGC CGAACTGGAA CCAGTCTGGA ACCCTTTTTG AGGGCATCGA ACGCTGGGCG GAACGTAAAG CGTTGCTCAG CCATGACGAT GTCAAAGCGC GTGCGTGGAA GACGTTGAAA TGGCAAATGG CCAACGGCAT TCAGTTTGTT CGCACTCACG TTGACGTTTC TGACCCTACG CTGACGGCGC TAAAAGCGAT GCTGGAAGTG AAGCAGGAGG TGGCACCGTG GATAACGCTG CAAATCGTTG CTTTCCCGCA GGAGGGGATT CTTTCCTATC CCAACGGTGC GGCGCTGCTT GAAGAAGCGT TACAGCTCGG CGCAGACGTG GTCGGCGCCA TCCCGCATTT TGAATTCACC CGCGAATACG GCGTGCAGTC GCTGCACATC GCGTTTGAAC TGGCGAAAAA ATATGATCGC CCGCTGGATA TTCACTGCGA CGAAATTGAC GATGAGCAGT CACGATTTGT CGAAACGGTT GCCACGCTGG CCTACGAGGC AGGGATTGGA TCGCGCGTCA CCGCCAGCCA CACCACCGCG ATGCACTCCT ACAATGGGGC ATACACCTCG CGGCTGTTCC GCTTGCTGAA AATGTCCGGC ATCAACTTTG TCGCTAACCC ACTGGTGAAC ATTCATTTGC AGGGGCGTTT TGACGATTAC CCGAAACGCC GTGGGATTAC GCGCGTGAAA GAGTTACAGG AAGCGGGTAT CAACGTCTGC TTCGGTCATG ACGATGTTTT CGACCCGTGG TATCCGCTGG GGACGGGTAA CATGCTGCAA GTGCTGCACA TGGGGCTACA CGTCTGTCAG ATGATGGGTT ATCAGCAGAT CGACAGCGGA CTGAATTTGA TTACCCATAA CAGCGCGCGC ACGTTTGGGC TGACGGATTA CGGTATCAAA ACCGGTAACC CGGCGAATCT GATCATATTG CCTGCGGAAA GTGGTTTTGA CGCGGTGCGC TGCCAGGTGC CGGTGCGCTG GTCGATTCGT CAGGGAAGAG TGATTGCGAC GACGCAGCTG GCGCAAACCT GGATTCAGAC GGATAGCGGG GGAGAAGAGG TGAGTTTTAG TCAAAAACAG CCCCTTCGCT GA
|
Protein sequence | MSTSPLWLVQ NVRLPQQDGL WQIAIENGRF GEITPMGDAP DESHEVLNAR GGLAIPPFIE PHIHLDTTQT AGEPNWNQSG TLFEGIERWA ERKALLSHDD VKARAWKTLK WQMANGIQFV RTHVDVSDPT LTALKAMLEV KQEVAPWITL QIVAFPQEGI LSYPNGAALL EEALQLGADV VGAIPHFEFT REYGVQSLHI AFELAKKYDR PLDIHCDEID DEQSRFVETV ATLAYEAGIG SRVTASHTTA MHSYNGAYTS RLFRLLKMSG INFVANPLVN IHLQGRFDDY PKRRGITRVK ELQEAGINVC FGHDDVFDPW YPLGTGNMLQ VLHMGLHVCQ MMGYQQIDSG LNLITHNSAR TFGLTDYGIK TGNPANLIIL PAESGFDAVR CQVPVRWSIR QGRVIATTQL AQTWIQTDSG GEEVSFSQKQ PLR
|
| |