Gene Caci_8813 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_8813 
Symbol 
ID8340206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp10217265 
End bp10219070 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content68% 
IMG OID644961903 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_003119467 
Protein GI256397903 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0249011 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCTG TCAATGATGT GTCCGCATCG TCTGAATCCA CCGAATCATC CGATTCCGCT 
GGTACCTCTG ATTCCTCTGG TTCCTCAGCT TCGAGCAAGA CCTACCTGCC CTCCGCCGTG
CGCCCCGAGC TGCGCGTGCC GATGCGCCAG ATCGCGCTGA CCAACGGCGA CTCGGTCGTC
CTGTACGACA CCTCCGGTCC CTACACCGAC CCCGAGGTGC GCACCGACGT CCGCTTCGGC
CTGCCGGCGC TGCGCGCTCC GTGGATCGCC GAGCGCGGCG ACACCGCGGA GTACGACGGC
CGGACCTGGC AGCCGACCGA CGACGGGCTG AAGTCGGCCG ACCTGCGCAA CCTCGACGCC
GTGTTCTCCG GTGGACGCAA GCCGGTGCGC GGGACGGAGG AGCGGGGCGC GGTCACGCAG
CTCGCCTATG CCCGGCGCGG CCTCGTCACC GCCGAGATGG AGTACATCGC GGTGCGGGAG
GGGGTCACCG CGGAGTTCGT GCGGGACGAG GTCGCGCGGG GGCGGGCGGT CATCCCGGCC
AACGTCAACC ACCCCGAGGC CGAGCCGATG ATCATCGGCC GCCACTTCCT GACCAAGGTG
AACGCCAACA TCGGCAACTC CTCGGTCGCC TCCTCGATCG AGGAGGAGGT GGACAAGATG
GTGTGGGCCA CGCGCTGGGG CGCCGCCACC GTGATGGACC TGTCCACCGG CCGGAACATT
CACACCACCC GCGAATGGAT CCTGCGCAAC AGCCCGGTCC CGATCGGCAC CGTGCCGATC
TACCAGGCGC TGGAGAAGGT CAACGGCAAG GCCGAGGACC TCACCTGGGA GGTGTTTCGC
GACACCGTGA TCGAGCAGTG CGAGCAGGGC GTGGACTACA TGACGATCCA CGCCGGCGTG
CTGCTGCGCT ACGTCCCGCT GACCGCCAAC CGCAAGACCG GCATCGTCTC GCGCGGCGGC
TCGATCATGG CCGCCTGGTG CCTGGCGCAC CACGAGGAGA ACTTCCTCTA CACGAACTTC
AGGGAACTGA CGCAGATCCT GGCGCGCTAC GACGTCACCT ACTCCCTCGG CGACGGCCTG
CGCCCCGGCT CCATCTATGA CGCCAACGAC GCGGCCCAGT TCGCCGAACT GACCACCCTC
GGCGAACTGT CGAAGATCGC CCGCGAGCTC GGCGTCCAGG TGATGATCGA GGGCCCGGGC
CACGTCCCGA TGCACAAGAT CAAGGAGAAC GTCGAGCTCC AGATGGAGCT CTGCGACGAG
GCGCCCTTCT ATACCCTCGG CCCGCTCACC ACCGACATCG CCCCCGGCTA CGACCACATC
ACCTCCGCCA TCGGCGCGGC GATGATCGGC TGGTACGGCA CCGCGATGCT CTGCTACGTG
ACGCCCAAGG AACACCTGGG CCTGCCCAAC CGCGACGACG TCAAGCAAGG CCTGATCGCC
TACAAGATCG CCGCCCACGC CTCCGACCTC GCCAAGGGCC ACGAAGGCGC CCAGCGCTGG
GACGACGCAC TGTCCGACGC CCGTTTCGAA TTCCGCTGGG AAGACCAGTT CAACCTGGCC
CTGGACCCCG ACACCGCCCG CGCCTACCAC GACGAGACCC TGCCGGCCGC CCCCGCGAAG
ACCGCGCACT TCTGCTCCAT GTGCGGCCCG CACTTCTGCT CCATGCAGAT CAGCCGCAAC
ATCGCGGAGC AATACGGCGA CCAGATGGCC GCCACCGACG ACGGCGAGAT CAAGGCCGGC
ATGGACGCGA AGTCCGCAGA GTTCCTCGCC TCCGGCGCGC AGGTCTACCT GCCTCTCGCG
GACTGA
 
Protein sequence
MTAVNDVSAS SESTESSDSA GTSDSSGSSA SSKTYLPSAV RPELRVPMRQ IALTNGDSVV 
LYDTSGPYTD PEVRTDVRFG LPALRAPWIA ERGDTAEYDG RTWQPTDDGL KSADLRNLDA
VFSGGRKPVR GTEERGAVTQ LAYARRGLVT AEMEYIAVRE GVTAEFVRDE VARGRAVIPA
NVNHPEAEPM IIGRHFLTKV NANIGNSSVA SSIEEEVDKM VWATRWGAAT VMDLSTGRNI
HTTREWILRN SPVPIGTVPI YQALEKVNGK AEDLTWEVFR DTVIEQCEQG VDYMTIHAGV
LLRYVPLTAN RKTGIVSRGG SIMAAWCLAH HEENFLYTNF RELTQILARY DVTYSLGDGL
RPGSIYDAND AAQFAELTTL GELSKIAREL GVQVMIEGPG HVPMHKIKEN VELQMELCDE
APFYTLGPLT TDIAPGYDHI TSAIGAAMIG WYGTAMLCYV TPKEHLGLPN RDDVKQGLIA
YKIAAHASDL AKGHEGAQRW DDALSDARFE FRWEDQFNLA LDPDTARAYH DETLPAAPAK
TAHFCSMCGP HFCSMQISRN IAEQYGDQMA ATDDGEIKAG MDAKSAEFLA SGAQVYLPLA
D