Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | COXBURSA331_A0634 |
Symbol | |
ID | 5794065 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Coxiella burnetii RSA 331 |
Kingdom | Bacteria |
Replicon accession | NC_010117 |
Strand | - |
Start bp | 542282 |
End bp | 543637 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641330136 |
Product | N-ethylammeline chlorohydrolase |
Protein accession | YP_001596452 |
Protein GI | 161829728 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAATG TTGATCTACT CATTAATGCG CGCTGGCTTC TTCCCATCGC TCCTGCTAAT CAAATTTTAG AAAATTTCGC ATTAGCCGTG CGCGATGAAT ACATTGTTGA TCTTCTTCCG CAGGCTGAAG CTAACAAAAA ATACACGGCC GATCAGCACC TCGAACTTAA CGATCATGTT GTCCTACCGG GGTTGGTTAA TGCTCATACC CATACTCCGA TGAACCTCTT TCGGGGGTTG GCTGATGATT TGCAATTACT GGATTGGTTG CAAAACCACA TCTGGCCAGC CGAAAAAGCC CTCATTAATG CTGAATCCGT TCGGGCTGGC ACGCGGCTTG CTATTGCCGA AATGTTACGC GGCGGTACGA CTTGTTTCAA CGATCATTAT TTTTTCCACG ACACAATCGC CAAAGCCGCC AGTGAAGCTG GTATGCGGGC GCTTATCGGA GTCGTAATAA TGAGCGTTCC CACGGAATGG GCTAGTGATG AAAAAGCTTA TTTAGCGCGC GCCCAAGAAA CATTGGAAAA AGCAGAAAAT CATTCGCTGA TCACCTGGGC GCTTGCCCCG CATGCCCCTT ATACCGTTAG TGACACCGCG TTTAAGGAAA TTAAAAAATT AGCTGAATAC TACGACCTAC CCATTCATAT ACACCTTCAT GAAACGAAGG TAGAGATTGA ACAAGGCTTA AAAAGCTATG GAAAAAGACC GCTCGCCCAT TTACATGACT TAGGGTTGCT GTCACAACGG CTTATAGCTG TCCATATGAC GCAGTTAACT TCGGAAGAAA TTAAATTAGT TGCGGATACT CAAACGAATA TCGTTCACTG CCCCGAATCT AATTTAAAAT TGAGCAGCGG CATTGCCCCT ATTGCAAAAT TGGTAGATGC CGGCGTTAAT GTAGCGATTG GCACTGACGG TGCGGCGAGC AATAACGACC TCGATTTATT CGGTGAAATG CGAACGGCTT CTTTCACGGC AAAAGTTTCC GGCCTCGACC CCACGCACTT ACCCGCTCCT GAAATTTTGA AAATGGCGAC GCTCAATGGC GCCAAAGCGC TGGGGCTAGA AGATAAAATC GGCTCACTCG AGCCGGGAAA ATTTGCCGAT GTCATTGCGG TGGATTTAAG TTCTTTTCTC ACCCAACCTG TTTTTAATCC GGTTTCTCAT TTGGTATACG CCATTAACCG TCTGCAAGTG AGCGACGTGT GGGTCGCGGG CAAACAATTG CTCAAAGGGG GGGAATTTAC CCAACTTGAT ACTGAACAAA TTGTCAAAGA CAGTTTAAAA TGGGCAAAAA AAGCGTTGCC TTTCAAAGCA GAAAACAGGC TTGCAGAAAC GAATGCCATT ACCTAA
|
Protein sequence | MENVDLLINA RWLLPIAPAN QILENFALAV RDEYIVDLLP QAEANKKYTA DQHLELNDHV VLPGLVNAHT HTPMNLFRGL ADDLQLLDWL QNHIWPAEKA LINAESVRAG TRLAIAEMLR GGTTCFNDHY FFHDTIAKAA SEAGMRALIG VVIMSVPTEW ASDEKAYLAR AQETLEKAEN HSLITWALAP HAPYTVSDTA FKEIKKLAEY YDLPIHIHLH ETKVEIEQGL KSYGKRPLAH LHDLGLLSQR LIAVHMTQLT SEEIKLVADT QTNIVHCPES NLKLSSGIAP IAKLVDAGVN VAIGTDGAAS NNDLDLFGEM RTASFTAKVS GLDPTHLPAP EILKMATLNG AKALGLEDKI GSLEPGKFAD VIAVDLSSFL TQPVFNPVSH LVYAINRLQV SDVWVAGKQL LKGGEFTQLD TEQIVKDSLK WAKKALPFKA ENRLAETNAI T
|
| |