Gene COXBURSA331_A0634 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCOXBURSA331_A0634 
Symbol 
ID5794065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCoxiella burnetii RSA 331 
KingdomBacteria 
Replicon accessionNC_010117 
Strand
Start bp542282 
End bp543637 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content46% 
IMG OID641330136 
ProductN-ethylammeline chlorohydrolase 
Protein accessionYP_001596452 
Protein GI161829728 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAATG TTGATCTACT CATTAATGCG CGCTGGCTTC TTCCCATCGC TCCTGCTAAT 
CAAATTTTAG AAAATTTCGC ATTAGCCGTG CGCGATGAAT ACATTGTTGA TCTTCTTCCG
CAGGCTGAAG CTAACAAAAA ATACACGGCC GATCAGCACC TCGAACTTAA CGATCATGTT
GTCCTACCGG GGTTGGTTAA TGCTCATACC CATACTCCGA TGAACCTCTT TCGGGGGTTG
GCTGATGATT TGCAATTACT GGATTGGTTG CAAAACCACA TCTGGCCAGC CGAAAAAGCC
CTCATTAATG CTGAATCCGT TCGGGCTGGC ACGCGGCTTG CTATTGCCGA AATGTTACGC
GGCGGTACGA CTTGTTTCAA CGATCATTAT TTTTTCCACG ACACAATCGC CAAAGCCGCC
AGTGAAGCTG GTATGCGGGC GCTTATCGGA GTCGTAATAA TGAGCGTTCC CACGGAATGG
GCTAGTGATG AAAAAGCTTA TTTAGCGCGC GCCCAAGAAA CATTGGAAAA AGCAGAAAAT
CATTCGCTGA TCACCTGGGC GCTTGCCCCG CATGCCCCTT ATACCGTTAG TGACACCGCG
TTTAAGGAAA TTAAAAAATT AGCTGAATAC TACGACCTAC CCATTCATAT ACACCTTCAT
GAAACGAAGG TAGAGATTGA ACAAGGCTTA AAAAGCTATG GAAAAAGACC GCTCGCCCAT
TTACATGACT TAGGGTTGCT GTCACAACGG CTTATAGCTG TCCATATGAC GCAGTTAACT
TCGGAAGAAA TTAAATTAGT TGCGGATACT CAAACGAATA TCGTTCACTG CCCCGAATCT
AATTTAAAAT TGAGCAGCGG CATTGCCCCT ATTGCAAAAT TGGTAGATGC CGGCGTTAAT
GTAGCGATTG GCACTGACGG TGCGGCGAGC AATAACGACC TCGATTTATT CGGTGAAATG
CGAACGGCTT CTTTCACGGC AAAAGTTTCC GGCCTCGACC CCACGCACTT ACCCGCTCCT
GAAATTTTGA AAATGGCGAC GCTCAATGGC GCCAAAGCGC TGGGGCTAGA AGATAAAATC
GGCTCACTCG AGCCGGGAAA ATTTGCCGAT GTCATTGCGG TGGATTTAAG TTCTTTTCTC
ACCCAACCTG TTTTTAATCC GGTTTCTCAT TTGGTATACG CCATTAACCG TCTGCAAGTG
AGCGACGTGT GGGTCGCGGG CAAACAATTG CTCAAAGGGG GGGAATTTAC CCAACTTGAT
ACTGAACAAA TTGTCAAAGA CAGTTTAAAA TGGGCAAAAA AAGCGTTGCC TTTCAAAGCA
GAAAACAGGC TTGCAGAAAC GAATGCCATT ACCTAA
 
Protein sequence
MENVDLLINA RWLLPIAPAN QILENFALAV RDEYIVDLLP QAEANKKYTA DQHLELNDHV 
VLPGLVNAHT HTPMNLFRGL ADDLQLLDWL QNHIWPAEKA LINAESVRAG TRLAIAEMLR
GGTTCFNDHY FFHDTIAKAA SEAGMRALIG VVIMSVPTEW ASDEKAYLAR AQETLEKAEN
HSLITWALAP HAPYTVSDTA FKEIKKLAEY YDLPIHIHLH ETKVEIEQGL KSYGKRPLAH
LHDLGLLSQR LIAVHMTQLT SEEIKLVADT QTNIVHCPES NLKLSSGIAP IAKLVDAGVN
VAIGTDGAAS NNDLDLFGEM RTASFTAKVS GLDPTHLPAP EILKMATLNG AKALGLEDKI
GSLEPGKFAD VIAVDLSSFL TQPVFNPVSH LVYAINRLQV SDVWVAGKQL LKGGEFTQLD
TEQIVKDSLK WAKKALPFKA ENRLAETNAI T