Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0384 |
Symbol | |
ID | 5594991 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 402147 |
End bp | 403391 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640919569 |
Product | putative deaminase |
Protein accession | YP_001457155 |
Protein GI | 157159837 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 59 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATCA CTGACCCGCA TTACTATCTC GATAACGTGC TGCTGGAAAC CGGTTTTGAC TACGAAAACG GTGTGGCGGT GCAGACCCGC ACGGCGCGCC AGACCGTGGA GATTCAGGAC GGCAAAATTG TCGCCCTGCG CGAGAACAAG CAGCATCCGG ACGCCACGCT GCCGCACTAT GACGCTGGCG GTAAGTTGAT GCTACCCACC ACTCGCGACA TGCATATTCA TCTCGACAAA ACCTTCTACG GCGGGCCGTG GCGCTCGCTC AATCGTCCGG CAGGCACCAC TATCCAGGAC ATGATCAAAC TCGAGCAGAA AATGCTGCCG GAGTTACAAC CGTATACGCA GGAGCGGGCG GAAAAACTGA TCGATCTATT GCAGTCGAAA GGCACCACCA TTGCCCGCAG CCACTGCAAT ATCGAACCGG TTTCCGGCCT GAAAAATCTG CAGAATTTGC AGGCGGTGCT GGCGCGACGT CAGGCGGGCT TCGAGTGTGA AATCGTCGCC TTCCCGCAGC ACGGTTTGCT GCTGTCGAAA TCGGAACCCT TAATGCGCGA AGCGATGCAG GCGGGTGCGC ATTACGTCGG CGGGCTGGAC CCGACCAGTG TTGATGGCGC GATGGAAAAA TCCCTCGACA CCATGTTTCA GATTGCGCTG GACTACGACA AAGGTGTCGA TATTCACCTG CACGAAACCA CTCCGTCGGG CGTGGCAGCC ATCAATTATA TGGTTGAAAC GGTAGAGAAA ACGCCGCAAC TGAAAGGTAA GCTGACCATC AGCCACGCCT TTGCGCTGGC TACGCTCAAC GAACAACAGG TCGATGCGCT GGCGAATCGG ATGGCGGCGC AGCAAATTTC TATCGCCTCG ACGGTGCCGA TTGGCACGCT GCATATGCCG CTCAAACAGT TGCACGACAA AGGCGTAAAA GTGATGACTG GCACTGACAG CGTTATCGAC CACTGGTCGC CTTATGGTCT GGGCGACATG CTGGAAAAAG CCAATCTCTA CGCGCAGCTC TATATTCGTC CTAACGAACA GAACCTCTCC CGTTCGCTGT TTCTAGCCAC TGGCGATGTA TTGCCGCTGA ATGAAAAAGG CGAGCGTGTA TGGCCAAAAG CGCAGGATGA TGCCAGCTTT GTGCTGGTGG ACGCCTCCTG TTCCGCCGAG GCGGTGGCGC GTATCTCGCC GAGAACCGCA ACGTTCCATA AAGGGCAACT GGTGTGGGGG AGTGTGGCAG GTTGA
|
Protein sequence | MKITDPHYYL DNVLLETGFD YENGVAVQTR TARQTVEIQD GKIVALRENK QHPDATLPHY DAGGKLMLPT TRDMHIHLDK TFYGGPWRSL NRPAGTTIQD MIKLEQKMLP ELQPYTQERA EKLIDLLQSK GTTIARSHCN IEPVSGLKNL QNLQAVLARR QAGFECEIVA FPQHGLLLSK SEPLMREAMQ AGAHYVGGLD PTSVDGAMEK SLDTMFQIAL DYDKGVDIHL HETTPSGVAA INYMVETVEK TPQLKGKLTI SHAFALATLN EQQVDALANR MAAQQISIAS TVPIGTLHMP LKQLHDKGVK VMTGTDSVID HWSPYGLGDM LEKANLYAQL YIRPNEQNLS RSLFLATGDV LPLNEKGERV WPKAQDDASF VLVDASCSAE AVARISPRTA TFHKGQLVWG SVAG
|
| |