Gene COXBURSA331_A1196 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCOXBURSA331_A1196 
Symbol 
ID5794189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCoxiella burnetii RSA 331 
KingdomBacteria 
Replicon accessionNC_010117 
Strand
Start bp1081486 
End bp1082841 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content47% 
IMG OID641330634 
Productprotease Do 
Protein accessionYP_001596934 
Protein GI161831404 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT TATCCAAAAT TATTCTCAGC AGTATTTTTG CGGGGCTCCC ATTACTCCTG 
CCCGTCAGTA GTTACGCTCA CTTACCCTCC GCTGTCGAAG GAAAAACAAT ACCCAGCCTT
GCACCAATGT TGAATAAGAC CACACCGAGC GTCGTCAACA TTGCCGTGGA AAAGCTGATT
CCTCAAACGC CAAACCCCTT ACAACCCGAA ATGGATCAAA ACACAGCACC AACGAAAGTC
TTAGGCGTAG GCTCGGGTGT AATCATAGAC GCCAAAAAAG GCTATATCGT AACAAATGCT
CATGTCGTCA AAGACCAAAA AATCATGGTG GTGACGCTTA AAGATGGTCG CCGTTATCGA
GCGAAAGTCA TCGGAAAAGA TGAAGGGTTT GATCTGGCTG TGATTCAAAT TCACGCGAAC
CATTTGACCG CACTTCCCAT CGGAAATTCA GATCAATTAA AAGTGGGTGA TTTCGTCGTC
GCCGTGGGAA GCCCTTTTGG CTTAACTCAA ACAGTCACTT CCGGCGTCAT TAGCGCCTTG
AATCGCCAAG AACCGCGTAT CGATAATTTT CAAAGCTTTA TTCAAACCGA CGCGCCGATT
AATCCCGGCA ATTCCGGCGG GGCTTTAATC GATTTAGAGG GCAAATTAAT TGGTATTAAT
ACAGCGATTG TCACCCCGTC CGCGGGAAAT ATCGGCATCG GCTTTGCCAT TCCCAGCGAC
ATGGTCAAAA GCGTGGCCGA ACAATTAATT AAATATGGAA AAGTCGAACG CGGCATGCTC
GGCGTAACGG CTCAAAATAT TACCCCGGAA TTAGCGGACG CCCTAAATTT AAAACATAAC
AAAGGAGCGC TGGTAACCAA AGTGGTTGCT GAAAGTCCAG CGGCTAAAGC CGGGGTTGAG
GTGCAGGATA TTATTGAATC TGTCAACGGT ATTCGGATTC ATAGTTCAGC ACAACTCCAC
AACATGCTCG GGCTGGTGCG TCCAGGAACT AAGATTGAAC TAACCGTATT GCGCGACCAT
AAGGTTCTGC CTATAAAAAC GGAAGTAGCC GATCCTAAAA AAGTGCTATT GCAACGCGAA
CTGCCCTTCC TCGGCGGCAT GCGTATGCAG AAATTCAACG ACCTAGAGCC CGATGGCACT
ATTTTGCAAG GTGTTTTAGT TACCGGCGTG GACGATAGCA GCGATGGAGC GCTCGGCGGG
TTAGAGCCCG GCGATATCAT TATCAGTGCT AATGGCCAAT TAACGCCCAC GGTCGATGAG
CTAATGAAAA TCGCTGAAGG CAAGCCAAAG GAGTTGTTAC TGAAAGTGGC GCGGGGCGCG
GGACAATTAT TTTTAGTTAT CCAACAATCA CAATAA
 
Protein sequence
MKKLSKIILS SIFAGLPLLL PVSSYAHLPS AVEGKTIPSL APMLNKTTPS VVNIAVEKLI 
PQTPNPLQPE MDQNTAPTKV LGVGSGVIID AKKGYIVTNA HVVKDQKIMV VTLKDGRRYR
AKVIGKDEGF DLAVIQIHAN HLTALPIGNS DQLKVGDFVV AVGSPFGLTQ TVTSGVISAL
NRQEPRIDNF QSFIQTDAPI NPGNSGGALI DLEGKLIGIN TAIVTPSAGN IGIGFAIPSD
MVKSVAEQLI KYGKVERGML GVTAQNITPE LADALNLKHN KGALVTKVVA ESPAAKAGVE
VQDIIESVNG IRIHSSAQLH NMLGLVRPGT KIELTVLRDH KVLPIKTEVA DPKKVLLQRE
LPFLGGMRMQ KFNDLEPDGT ILQGVLVTGV DDSSDGALGG LEPGDIIISA NGQLTPTVDE
LMKIAEGKPK ELLLKVARGA GQLFLVIQQS Q