Gene COXBURSA331_A0438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCOXBURSA331_A0438 
SymbolthiO 
ID5794079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCoxiella burnetii RSA 331 
KingdomBacteria 
Replicon accessionNC_010117 
Strand
Start bp374752 
End bp375774 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content41% 
IMG OID641329960 
Productglycine oxidase ThiO 
Protein accessionYP_001596279 
Protein GI161831570 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR02352] glycine oxidase ThiO 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.017801 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATGA AAGTAGGCAT TGCGGGGGCA GGATTACTGG GGCGTTTATT AGCTTGGCAA 
TTAAGTAAAG TGGGCTTTGG GGTTACGCTA TTTGATAAAG ATGATAAAAG TGGTCAAAAG
AGCACGGCCT ATGCAGCGGC TGGGATGCTG TCACCCGTGG CTGAGTGTGA AATAGCAGAG
CAGATAATTT TTAATTTAGG AAGTTATTCA TTAAGGAAAT GGCCACTGTG GTTATCATCA
TTGAACCAAC CTGTTTATTT TAAACAAAAT GGAAGCATTG TAATTTCGCA TTCACACGAT
GAGGTAGAAA AAGAGCGCTG GTTGAAACAA ATAAGTCGTA AAATAAAAGA TTTCTCTCTT
GAAAAATTAT CATCTTCTGC ACTCCAACGA TTGGAGCCCG AATTAAATTT TGATGAAGGG
TATTATTTGC CGCAGGAAGC ACACCTAGAT TCACGTGCAC TTATGCAGAC CTTAGAAAAA
GAATTAAACG TGGAATGGCA TTCGAAAACC TTTGTGGAGA GCGTGGTTCC TTATCGTATC
TTAACGAAAG GAAAATCATA CCAATTTGAT TGCATATTCG ATTGTCGTGG CACAGGCGCA
GGAGAAATGT TTTCCGATTT GCGTTCGGTA CGTGGCGAGT TAATTTATTT GCATGCACCC
GATGTGCGTT TAAATCGTCC CATTCGATTA CTTCATCCGC GTTATCGACT TTATATTGTT
CCTCGCGCGC ATCATATTTA TCTTATTGGT GCGAGTGAAA TTGAGTCCAA TGATATTTCA
CCAATTTCTG TGCGTACGTG TTTGGAATTA TTATCGGCAG TTTATAGTGT ACACCCTGCA
TTTGCAGAAG CGCGGATCAT TGAAACGGTT ACCGCCCTAC GACCGGCGTT ATCGGATAAC
TTACCTCGTA TTCACTACCA GCCTGGATTA ATTGCTATTA ACGGATTATA CCGTCACGGT
TTTTTAGTAG CGCCAGCGTT AATTGATGAA GTTATTCACA ATCTTTCAAG AGGCATTAAA
TGA
 
Protein sequence
MKMKVGIAGA GLLGRLLAWQ LSKVGFGVTL FDKDDKSGQK STAYAAAGML SPVAECEIAE 
QIIFNLGSYS LRKWPLWLSS LNQPVYFKQN GSIVISHSHD EVEKERWLKQ ISRKIKDFSL
EKLSSSALQR LEPELNFDEG YYLPQEAHLD SRALMQTLEK ELNVEWHSKT FVESVVPYRI
LTKGKSYQFD CIFDCRGTGA GEMFSDLRSV RGELIYLHAP DVRLNRPIRL LHPRYRLYIV
PRAHHIYLIG ASEIESNDIS PISVRTCLEL LSAVYSVHPA FAEARIIETV TALRPALSDN
LPRIHYQPGL IAINGLYRHG FLVAPALIDE VIHNLSRGIK