Gene CBUD_0449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCBUD_0449 
Symbol 
ID5459199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCoxiella burnetii Dugway 5J108-111 
KingdomBacteria 
Replicon accessionNC_009727 
Strand
Start bp430207 
End bp431577 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content46% 
IMG OID 
Productcarboxy-terminal processing protease precursor 
Protein accessionYP_001423867 
Protein GI154706287 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCTAA AAAGAAAGAT TATAGCTATC GTAGTTGCTG CCTTCATCAG TCCGGGACTT 
ACCACCGTAT TCGCTTTTTC TCCTCCTTTT TTACCGAAAA CGCTGATCTC CACTCCAGCA
GAAGATAAAA ACGATGAGCT CTCCAGAAAA GACGTCGAAC GTTTCGTGAC GGCCATTGCA
TTGGTCCATC AGTATTACAT TAAAAATGTG AGTAATAAAA AATTACTCGA CAGCGCAATT
AGCGGCATGA TGGCCAACCT CGACCCACAT TCTAGTTATC TCGACAACAA CGACTTGAAA
GAATTGAAAA CCACCGTCTC TGGAGAGTTT GTGGGCGTTG GCATCGAGCT CACGGTCTCC
AAAGACGGTC TTTTAAAAGT CATCAGCCCG CTGGAAGATT CCCCCGCCGC GCGCGCGGGC
ATCCAACCCA ACGATTATAT TGTTAAAATT GACGACCAAT TAGTCCAAAA CATGAGTCTT
CCGGAAGCGG TGAGCCGAAT TAAAGGCAAA AAAGAGACGA CCGTCAAGTT AACGGTTTTA
CGCAAAAGTG CAAATAAGCC TTTAATTTTT TCGATTCAAC GTGAACCCAT TCATTTGGTT
AGTGTAAAAA GCAAAACTTT AGAACCCGGT TACGGTTATG TCCGAATCAC TTTCTTCCAA
GGGCCCGTGG AAAACCAGTT GCGTGATGCG ATTGATAAAT TGAAAAAAGA ATCGCAAGGT
CCTTTGAAAG GTTTAGTCCT CGATCTGCGT AATAATCCCG GCGGCCTGCT CGATGTCAGC
GCCCAAGTGG CGGACAGTTT CCTTGATGCG AGTAAGATGC ACCGCTATAA CGACCTCATC
GTTTACACAA AAGGACGCGT TCCGGGTGCC GATATTCAAA TCAAAGCGAC GCCTGGCGAT
CTCATTCCCC ACACACCGAT GGTCGTACTG ATCAACGGCG GATCGGCCTC TGCTTCAGAA
ATTGTGGCTG GCGCTCTTCA AGATTACAAA CGCGCTATTA TCATGGGAAC ACCCAGCTTC
GGGAAAGGGT CGGTCCAAAC CGTTTTACCC ATTGGGAAAG AGGACGCGAT TAAACTAACG
ACTGCTTTGT ATTACACCCC GGCAGGCCGC GAAATTCAGG CCAAAGGCAT TATACCGAAT
GTTGCGGTTC CGGAATTCAG TATTACGCCT CCTAAATCAC AGTTAACATT GGATGAAGCC
GATTTCCAAA ACCATTTGCC CAATGACGGC GCGGCTTCCA CTAAGGCAAA TCCCACAACG
GCCGAAGAAG AGAAAAATTT ATTACAAACC CAACTGCAAT TGGCGAAAAC CGATTATCAG
CTATATCAAG CTTTAATGAT GTTACAAGGT CTTCAGGTGG TTAAGCATTA G
 
Protein sequence
MSLKRKIIAI VVAAFISPGL TTVFAFSPPF LPKTLISTPA EDKNDELSRK DVERFVTAIA 
LVHQYYIKNV SNKKLLDSAI SGMMANLDPH SSYLDNNDLK ELKTTVSGEF VGVGIELTVS
KDGLLKVISP LEDSPAARAG IQPNDYIVKI DDQLVQNMSL PEAVSRIKGK KETTVKLTVL
RKSANKPLIF SIQREPIHLV SVKSKTLEPG YGYVRITFFQ GPVENQLRDA IDKLKKESQG
PLKGLVLDLR NNPGGLLDVS AQVADSFLDA SKMHRYNDLI VYTKGRVPGA DIQIKATPGD
LIPHTPMVVL INGGSASASE IVAGALQDYK RAIIMGTPSF GKGSVQTVLP IGKEDAIKLT
TALYYTPAGR EIQAKGIIPN VAVPEFSITP PKSQLTLDEA DFQNHLPNDG AASTKANPTT
AEEEKNLLQT QLQLAKTDYQ LYQALMMLQG LQVVKH