Gene Caci_6688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_6688 
Symbol 
ID8338052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp7707532 
End bp7708752 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content76% 
IMG OID644959782 
ProductROK family protein 
Protein accessionYP_003117375 
Protein GI256395811 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.395851 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGGAC TGTGGCATTG TCAAGACATG CCCGCGCCCA CCAGCCCCGC CGGCCCGGCC 
AGAGCGACCG GCTCCGCCGA CCGCGCCGTC CCCAACCCCG CCCGGCAGGG GAGCATCCGC
AACGCCAACC TGGCGCTGCT CTACGGCCTG ATCCTGGACG CCCCGGCGCC GCTGTCCCGC
GCCGCGCTGG CCGCCACCAC CGGTGTGACC CGCGCCACAG CCTCCGCGCT GGCCGACACG
CTGCTGGAGG CGGGACTGGT CGCGGAAGTC TCACCGCCGC CGGCCACCGG CGCGGGCCGT
CCGGCCGCCG GCCTGGTCCC GGCCGCCGAG GGCCCGGCCG GGCTCGGGCT GGAGATCAAC
GTGGACTACC TGGCGGCCTG CGTGGTGGAC CTGACCGGCG CCGTCCGCGC CACCGTCATA
TCCGCGGGCG ACCAGCGCGA CCGCTCGGTG TCGGAGGTGC TGGCCGATCT GGCCGGGCTG
GCGCGCCAGG CCGTCTCGGA GGCCGGGCTG ACCGTCGCCG GCGCCGCGGT CGCCGTCCCG
GGTCTGGTCG AGGCGCCGCA CGGACGGATC CGGAGCGCGC CGAACCTGGT GTGGCAGGAC
GTGGAGATCG GCGCGGCGCT GCGCAGCGCG CTGCCGGAGA CGCCGTTCGA GCCGGTCGTC
GGGAACGAGG CGGATTTCGC AGCCCTGGCC GAGGCGCACG GGGTTTTCGA CGGGGACGCG
GACGGCCCGG CGGCGCCGCT GACCGACTTC CTGTACGTCT CGGGCGAGAT AGGCGTCGGC
GCGGGCGTCA TCCTGGACCG CGAGCTGTTC CGCGGCGCGC GGGGGTGGGC CGGCGAGATC
GGGCACGTCA CGGTCCAGCC CGAGGGGGTC CAGTGCCGCT GCGGCGCGCG GGGCTGTCTG
GAGACTGTCG CAGGACTCGA AGCGCTGCGC CGCGACGGAC CCGAAGCCGC TGCTTCGGCA
CTCGGCCGGG CGGCAGCGGC CGCGGTGAAC CTGCTGGATC TGCCGGCGGT CGTCCTCGGC
GGCGCCTATG CCCGGCCGGA GTTCGCCGCG CTGGTTCCGG GGGTGGAGAA GGCACTGGCC
GACCATGTGA TCTCGGCGCG ATGGGCTCCG GTCGCCGTGC ACGTGTCGCG GCGCGGAACC
GCGGCGGCGG TGACCGGCGC GGCGACGGCG GTCATCCGGC GGGTGCACGC CGATCCGGCG
GCTTGGATGG CGGCACGCTG A
 
Protein sequence
MDGLWHCQDM PAPTSPAGPA RATGSADRAV PNPARQGSIR NANLALLYGL ILDAPAPLSR 
AALAATTGVT RATASALADT LLEAGLVAEV SPPPATGAGR PAAGLVPAAE GPAGLGLEIN
VDYLAACVVD LTGAVRATVI SAGDQRDRSV SEVLADLAGL ARQAVSEAGL TVAGAAVAVP
GLVEAPHGRI RSAPNLVWQD VEIGAALRSA LPETPFEPVV GNEADFAALA EAHGVFDGDA
DGPAAPLTDF LYVSGEIGVG AGVILDRELF RGARGWAGEI GHVTVQPEGV QCRCGARGCL
ETVAGLEALR RDGPEAAASA LGRAAAAAVN LLDLPAVVLG GAYARPEFAA LVPGVEKALA
DHVISARWAP VAVHVSRRGT AAAVTGAATA VIRRVHADPA AWMAAR