Gene COXBURSA331_A1229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCOXBURSA331_A1229 
Symbol 
ID5794110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCoxiella burnetii RSA 331 
KingdomBacteria 
Replicon accessionNC_010117 
Strand
Start bp1111522 
End bp1112988 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content50% 
IMG OID641330665 
Productputative carbohydrate kinase 
Protein accessionYP_001596964 
Protein GI161831578 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGTAT TGTACCAAAA CCGCCAAATT CGCGAATTAG AGCGCTTGGC AGTTGAATCG 
GGCATTAGTG AATATGAATT AATGTGTCGC GCCGGAGAGG CCGCTTTTAA AGCGCTTCTA
GCGCGATGGC CCGAGGCTCA AGAAATTACC GTATGTTGTG GGAAGGGGAA TAACGGGGGC
GACGGGCTAG TGCTTGCGCG CTTGGCTTAC GAAAACGGAT TGAAAGTAAC CGTCTATTTG
GCCGGGCAAC GCCACCAACT GAAAGGTGCG GCTGCTCAAG CCGCTAATGC CTGTGAAGCT
TCTAATCTTC CCATTTTACC TTTCCCGGAG CCGCTTCTTT TCAAAGGAGA AGTGATCGTG
GATGCGCTTT TAGGCAGCGG ACTTTCTGGA GAAGTGAAAG CACCTTATGA TCATCTCATT
GCCGCCATTA ACCAGGCAGG ACAATACGTA TTAGCCCTTG ATGTGCCTTC GGGAATTAAC
GTTGACTCTG GAGAGGTCCA GGGGACGGCT GTAAAAGCCA ATTTAACGGT CACCTTCATC
GCTCCCAAAA GAGGTTTGTA TACCGATAAA GCGCCTGCCT ATTGTGGCGA GTTGATCGTG
GATCGCTTAG GGCTTTCGGA GTCCTTTTTT CGGGCTGTCT TTACCGATAC CCGTTTATTG
GAATGGAAAG GGGTGTTTCC CTTGTTACCT AAACGAGCGC GTGATGCCCA TAAAGGCAGC
TATGGGCACG TTTTGGTGAT CGGTGGTGAT TATGGTATGG GCGGAGCCGT ACGGATGGCT
GCGGAAGCCG CCGCACGTGT CGGCGCTGGA CTAGTGACGG TCGCCACGCG CCCAGAACAT
GTCCCTATCG TCAGCGGTCC GCGGCCCGAA TTAATGTGCC ACCAAGTGGC TGCTGCGGAT
GATTTAAAGC CGCTACTTAC TGCGGCGACT GTCGTGGTGA TTGGACCCGG TCTAGGGAAG
TCTGATTGGG CAAAATCTTT ATTAAACAAA GTATTAGAAA CAGATCTTCC TAAAGTACTT
GATGCTGATA GTTTAAACTT ACTCGCAGAG TCGCCCTCTC AACGAGAGGA TTGGATATTA
ACACCTCATC CCGGCGAAGC TTCCCGATTG TTGGGAATTT CTTGCAATGA AGTTCAACGC
GATCGTTTCC AAGCTATCAA CGACTTGCAA GAAAAATACC AGGGTGTCCT TGTGCTTAAA
GGGGTTGGGA CACTTATTAA AGATGAAAGC CAAGCCTATT ATGTGTGTCC AGCCGGCAAT
CCTGGTATGG CGACAGGAGG GATGGGTGAT ATTTTAAGTG GCATCATCGG TGGGTTGGTC
GCTCAAAGAT TGAGCTTAGC ATCAGCCGCT CAAGCCGGTG TTTTTATTCA CTCCATGGCC
GCTGATCGCG CAGCAGAGGA GGGAGGCGAG CGGGGATTAT TAGCCACTGA TTTATTTCCT
CATTTACGGG TTTTAGTGAA TCCTTAA
 
Protein sequence
MTVLYQNRQI RELERLAVES GISEYELMCR AGEAAFKALL ARWPEAQEIT VCCGKGNNGG 
DGLVLARLAY ENGLKVTVYL AGQRHQLKGA AAQAANACEA SNLPILPFPE PLLFKGEVIV
DALLGSGLSG EVKAPYDHLI AAINQAGQYV LALDVPSGIN VDSGEVQGTA VKANLTVTFI
APKRGLYTDK APAYCGELIV DRLGLSESFF RAVFTDTRLL EWKGVFPLLP KRARDAHKGS
YGHVLVIGGD YGMGGAVRMA AEAAARVGAG LVTVATRPEH VPIVSGPRPE LMCHQVAAAD
DLKPLLTAAT VVVIGPGLGK SDWAKSLLNK VLETDLPKVL DADSLNLLAE SPSQREDWIL
TPHPGEASRL LGISCNEVQR DRFQAINDLQ EKYQGVLVLK GVGTLIKDES QAYYVCPAGN
PGMATGGMGD ILSGIIGGLV AQRLSLASAA QAGVFIHSMA ADRAAEEGGE RGLLATDLFP
HLRVLVNP