Gene COXBURSA331_A0437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCOXBURSA331_A0437 
SymbolthiC 
ID5793188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCoxiella burnetii RSA 331 
KingdomBacteria 
Replicon accessionNC_010117 
Strand
Start bp373106 
End bp374752 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content45% 
IMG OID641329959 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001596278 
Protein GI161830153 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value0.753273 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGCTGC ATCGCACTTT CCTTCTGCGT GTTTATTTGA AGGAGAGTGC TGTGATTACT 
TATCCTGCTT CCAAAAAGAT TTATTGCCAA GGCAAGATTT TCCCCACTAT TCGCGTAGGA
ATGCGTGAAA TTCAATTGAC GAACGGTGAT TCGCTAACCC TTTACGATAC CTCAGGTCCG
TATTCTGACC CGAATATCTC TATCAAATCG CCGCAGGGAC TTCCCCGCTT GCGCGAGCCT
TGGATTAAAG TGCGCCCGAG GAAAACGCAA TTGGCTTTCG CAAAAGAAGG CGTCATTACA
CCAGAAATGG AATACGCTGC GATTCGTGAA AATCAAAAAA GAGAGTTGAA GAAAAACACG
GATCAAGAAC GCAAACGTCG TTTGCAAGGA AATTCGTTGA GTGCGCGAAT TCCTAACCCA
ATTACGCCGG AATTTATTCG AAATGAAATC GCGTGCGGTC GTGCCATCTT GCCTGCGAAT
ATTAATCATC CCGAAAGCGA GCCTATGATT ATTGGTCGCC ATTTTTTAGT TAAAGTAAAC
GCGAATATTG GGAACTCGTC GCTTACGTCA TCTGTGGAGG AAGAAGTTGA AAAATTAATT
TGGGCATTGC GTTGGGGGGC GGATACGGTG ATGGATCTAT CGACGGGAAA AAAAATCAAA
GAAATTCGTG AAACAATTCT TCGTCATTCT CCGGTACCTA TCGGTACCGT GCCGTTGTAT
GAAGCATTGG AAAAAGTGGA TGGAGATGTA AAAGCCCTAA CCTGGGAAAT TTTCCGCGAC
ACCTTAATTT CGCAAGCTGA ACAAGGAGTT GATTATTTCA CGATTCATGC AGGTGTTTTA
AATCGTTTTA TTCCATTGAC CCAAAAACGC GTCACGGGAA TTGTTTCTCG CGGTGGCTCT
CTCATGGCCA AATGGTGTCT TTTGCACCGA GAGGAGAATT TTCTTTATAC GCATTTTACT
GAGATTTGTG AAATTATGCG CGCTTATGAT GTGAGCTTTT CGCTAGGGGA TGGGTTGCGC
CCAGGCTCTA TTGCTGATGC CAATGATGAA GCGCAATTTG CAGAATTGAA AATTCAAGGC
GAATTAAATC GCATTGCGTG GAAATACGGT GTGCAGGTGA TGAATGAAGG ACCAGGGCAT
ATCCCTCTAA ACCTCATCGA AGAAAATATG ACGAAACAGT TGGCTTATTG CCGGGAAGCG
CCATTTTATA CCCTGGGACC ATTAACTACC GATATTGCGC CTGGTTATGA TCATATCGGT
AGCGCTATTG GCGCTGCTTT TATCGCTTGG CAAGGGTGCG CGCTCTTGTG CTATGTCACG
CCGAAAGAAC ATTTGGGATT ACCGAACAAA CAAGACGTCA AAGAAGGACT GATTGCTTAT
AAAATTGCGG CCCATGCCGC AGATTTAGCA AAAGGACACC CAGCGGCTCG ACAGCGAGAT
TATTTATTAT CGCAAGCGCG GTTTGAATTC CGCTGGCACG ATCAATTTAA TTTAGCGTTA
GACGCTGAAA CAGCGCGCCT TTTTCATGAT GAGACTTTGC CAAAAGAGGC TGCTAAACAC
GCTCATTTTT GTTCATTGTG CGGCCCTAAA TTTTGCGCTT ACAAAACGAG CCACGAGGTT
AGGGATACCT TACAAAAGGT AACGTAA
 
Protein sequence
MRLHRTFLLR VYLKESAVIT YPASKKIYCQ GKIFPTIRVG MREIQLTNGD SLTLYDTSGP 
YSDPNISIKS PQGLPRLREP WIKVRPRKTQ LAFAKEGVIT PEMEYAAIRE NQKRELKKNT
DQERKRRLQG NSLSARIPNP ITPEFIRNEI ACGRAILPAN INHPESEPMI IGRHFLVKVN
ANIGNSSLTS SVEEEVEKLI WALRWGADTV MDLSTGKKIK EIRETILRHS PVPIGTVPLY
EALEKVDGDV KALTWEIFRD TLISQAEQGV DYFTIHAGVL NRFIPLTQKR VTGIVSRGGS
LMAKWCLLHR EENFLYTHFT EICEIMRAYD VSFSLGDGLR PGSIADANDE AQFAELKIQG
ELNRIAWKYG VQVMNEGPGH IPLNLIEENM TKQLAYCREA PFYTLGPLTT DIAPGYDHIG
SAIGAAFIAW QGCALLCYVT PKEHLGLPNK QDVKEGLIAY KIAAHAADLA KGHPAARQRD
YLLSQARFEF RWHDQFNLAL DAETARLFHD ETLPKEAAKH AHFCSLCGPK FCAYKTSHEV
RDTLQKVT