Gene Caci_5104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5104 
Symbol 
ID8336458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5861426 
End bp5862640 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content72% 
IMG OID644958203 
ProductROK family protein 
Protein accessionYP_003115805 
Protein GI256394241 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.288839 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.92903 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCCAGG CCGGTGCGCA GACGACGCGC GATCTTCGCC GTCGTAGCCG CGCCACCTTG 
CTGTCGTGTA TCTACCTCGG GCGGGCGGTG AGCCGTCCGG AATTGGCGCG GTTGGCCGGG
ATGAGCTCGG CGGCGGTGAG CAATGTCGTG TCGGATCTGA TCTCCGACGG GCTGGTGGCC
GAGGCCGGGT CGGTGGACTC CAACGGTGGG CGGCCGCGCA CGATGCTCGC GGCGCGGCCC
GGGTTCGGCT ACGCGGTCGG CGTCGACATC GGCGAGACTC ACATCCACGT GGTGCTGTTC
GACTGGACGC TGTCCACCCT GGCGACCTCC ACGCACGAGA TCCGCGTCGG ACGCCTGGAT
CCGGATGTCG TGGTGCGCCT GGTCGTCTCC GGCGTGCGCT CCCTGCTGGA CAGCACCGGC
GTTCCGCACG AGCGGCTGCT CGGTATCGGT ATCGGCGTCC CCGGCGCGGT GCAGGAGGGC
GAGCGCGGCG TGGTCCACGC ACCGACGCTC GGCTGGTCCG GCGTACCGCT CGGCGACGCA
CTGCGAGCAG AGCTCGACGC GCCGATCCTC ATCGACAACT GCGCACGCAC CCTCGGCCAG
GCTGAGGCAT GGCGCGGCGC GGGACGCGAT GCACGCCGCG CGGTCGTCGC CCTGTGGGGC
GTGGGCGTCG GCGCCGCGAT CGCCGAAGGC TCCTCCCTTG CCGAAAGCGG CTCCAGCTCC
ACCAGCGAGT GGGGCCACGC GGTGATCGAA GCCCGCGGCC GCGCCTGCCG CTGCGGCTCC
CACGGCTGCC TCGAGGCCTA CGTCGGCGCC ACGGCGATCC TCGACGCGTA CCTGGCCCAC
CCCGCCGGCA AGCCCTTCAC CAGCGACGGC ACCGAAGCCA AAATGGCCGA ACTCGCCGCC
CGAGCCACCA CCGGCGCCGA CGAAGCCGCC ACCGCCACCT TCGACGAAGC AGCCGAGTAC
CTGGGCATCG GCGTCGGCAA CCTGATCAAC ATGATCAACC CCGACCAGGT CATCCTCGCC
GGCTGGGTAG GCGAACAACT GGGCCCCCTC CTCATGCCCG CCATCCGCGA AGCCGCCCGC
CGCCACGCCC TCCCCTACCT CTTCGACCAA ACCCGCATCG ACGTCGGCGA ACTGGGCCCG
GGCGCGGTAG CCCTCGGCGC CGCAACCCTG CCGGTGGCGC GACTGCTGGC AGCAGGCGGA
CACTTCGCAA GCTGA
 
Protein sequence
MIQAGAQTTR DLRRRSRATL LSCIYLGRAV SRPELARLAG MSSAAVSNVV SDLISDGLVA 
EAGSVDSNGG RPRTMLAARP GFGYAVGVDI GETHIHVVLF DWTLSTLATS THEIRVGRLD
PDVVVRLVVS GVRSLLDSTG VPHERLLGIG IGVPGAVQEG ERGVVHAPTL GWSGVPLGDA
LRAELDAPIL IDNCARTLGQ AEAWRGAGRD ARRAVVALWG VGVGAAIAEG SSLAESGSSS
TSEWGHAVIE ARGRACRCGS HGCLEAYVGA TAILDAYLAH PAGKPFTSDG TEAKMAELAA
RATTGADEAA TATFDEAAEY LGIGVGNLIN MINPDQVILA GWVGEQLGPL LMPAIREAAR
RHALPYLFDQ TRIDVGELGP GAVALGAATL PVARLLAAGG HFAS