Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4416 |
Symbol | |
ID | 8335770 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5016004 |
End bp | 5017194 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 644957519 |
Product | ROK family protein |
Protein accession | YP_003115121 |
Protein GI | 256393557 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACCCC GCCTGTCCGG CGACCTGCGC CGTGCGAACC GCGTGGAAGT GATGCGGAGC TTTTACGGCG GCCGCACGCT GACCCGGGGC GACGTCGCCG CGCAGCTCGG CGTGTCGGTG GCGACCGCCG GGACGATCAT CGGGGAGCTG ACCGCCGCCG GGCTGCTGGC CGAGACGCAG AGCCCGCGCT CCGGCGGCGG CCGTCCGGCC AGCCAGCTGA CGATGCGCGC CGGGGCGCCC TACCTGGTCG GCGTGGACCT CGCCGAGACC TACGTGATCG CCGAGATCTT CGACCAGGCG ATGACGCGCG TCGGGCACTT CCAGACCCCG GTGGCGCCGA CCGACGACGA CCCGGATTCG GTGGTCGGGC ACGTCGTGGA CAGCGTCCAG GGCGTCATCG CCGCCCTGGA CGGCGTCAGC GCGGCCGACG TCGCCGGGGT CGGCGTCAGC CTGCCCGGCC AGGTGGACCG CGAGGGCGGG GTCTCGGTGC ACGCGCCGAA CTGGGGCTGG CACGGCGTGC CGTTCACCTC CCTGTTCCAC AAGCGCTGCG ACCTGCCGGT CCTGCTGGAC AACCCGCTCA AGGCGATAAC CCTCGCCGAG ATGATGTTCG GCGAGGCCGG CGACCACGAC GACGCCGTGG TGGTGAACCT GGGCACCGGC GTGGGCCTCG GCGTGGTCGC CGAGGGCCGG CTGCTGCGCG GGCGCACCAA CACCGCCGGG GAGTGGGGAC ACACGATCCT GGTCGCCGAC GGACTGCCCT GCCACTGCGG CAGCCGCGGC TGCGTCGAGG CCTACGTCGG CGCCGCCGCC CTGCTGGACC TGCTCACCGA CGTCGATCCG GACAGCCCGC TGCTGGTCCC CGGCGACCAG GCGGCGACCG TGGCCCGGCT CGCCGAGGCC GTCGCGAGCG CCGATCCGGT CGCGGTGGCG ACCCTGGAGC GCTTCGCCCG ACCGCTGGGC ATGGCGCTGG CCAACGCCGT GAACATGCTC AACCCCGAAC TCCTCGTGGT CGGCGGCTGG GTCAGCGCCG CCTTCGGCGA GCCGCTGCTG GCCGCGGTCG AGCCGGTGGT CAAGCAGTTC TCCCTGGCCG TCCCCTACGA CGCGGTCACG CTGGCCGCCT CCCGCATCGC CGACAACCCG GTGTCCCTGG GCATGGCGGT GCTCGCCTTC GAGACGTTCG TCCTGCCCTA G
|
Protein sequence | MRPRLSGDLR RANRVEVMRS FYGGRTLTRG DVAAQLGVSV ATAGTIIGEL TAAGLLAETQ SPRSGGGRPA SQLTMRAGAP YLVGVDLAET YVIAEIFDQA MTRVGHFQTP VAPTDDDPDS VVGHVVDSVQ GVIAALDGVS AADVAGVGVS LPGQVDREGG VSVHAPNWGW HGVPFTSLFH KRCDLPVLLD NPLKAITLAE MMFGEAGDHD DAVVVNLGTG VGLGVVAEGR LLRGRTNTAG EWGHTILVAD GLPCHCGSRG CVEAYVGAAA LLDLLTDVDP DSPLLVPGDQ AATVARLAEA VASADPVAVA TLERFARPLG MALANAVNML NPELLVVGGW VSAAFGEPLL AAVEPVVKQF SLAVPYDAVT LAASRIADNP VSLGMAVLAF ETFVLP
|
| |