Gene Caci_4416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4416 
Symbol 
ID8335770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5016004 
End bp5017194 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content74% 
IMG OID644957519 
ProductROK family protein 
Protein accessionYP_003115121 
Protein GI256393557 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACCCC GCCTGTCCGG CGACCTGCGC CGTGCGAACC GCGTGGAAGT GATGCGGAGC 
TTTTACGGCG GCCGCACGCT GACCCGGGGC GACGTCGCCG CGCAGCTCGG CGTGTCGGTG
GCGACCGCCG GGACGATCAT CGGGGAGCTG ACCGCCGCCG GGCTGCTGGC CGAGACGCAG
AGCCCGCGCT CCGGCGGCGG CCGTCCGGCC AGCCAGCTGA CGATGCGCGC CGGGGCGCCC
TACCTGGTCG GCGTGGACCT CGCCGAGACC TACGTGATCG CCGAGATCTT CGACCAGGCG
ATGACGCGCG TCGGGCACTT CCAGACCCCG GTGGCGCCGA CCGACGACGA CCCGGATTCG
GTGGTCGGGC ACGTCGTGGA CAGCGTCCAG GGCGTCATCG CCGCCCTGGA CGGCGTCAGC
GCGGCCGACG TCGCCGGGGT CGGCGTCAGC CTGCCCGGCC AGGTGGACCG CGAGGGCGGG
GTCTCGGTGC ACGCGCCGAA CTGGGGCTGG CACGGCGTGC CGTTCACCTC CCTGTTCCAC
AAGCGCTGCG ACCTGCCGGT CCTGCTGGAC AACCCGCTCA AGGCGATAAC CCTCGCCGAG
ATGATGTTCG GCGAGGCCGG CGACCACGAC GACGCCGTGG TGGTGAACCT GGGCACCGGC
GTGGGCCTCG GCGTGGTCGC CGAGGGCCGG CTGCTGCGCG GGCGCACCAA CACCGCCGGG
GAGTGGGGAC ACACGATCCT GGTCGCCGAC GGACTGCCCT GCCACTGCGG CAGCCGCGGC
TGCGTCGAGG CCTACGTCGG CGCCGCCGCC CTGCTGGACC TGCTCACCGA CGTCGATCCG
GACAGCCCGC TGCTGGTCCC CGGCGACCAG GCGGCGACCG TGGCCCGGCT CGCCGAGGCC
GTCGCGAGCG CCGATCCGGT CGCGGTGGCG ACCCTGGAGC GCTTCGCCCG ACCGCTGGGC
ATGGCGCTGG CCAACGCCGT GAACATGCTC AACCCCGAAC TCCTCGTGGT CGGCGGCTGG
GTCAGCGCCG CCTTCGGCGA GCCGCTGCTG GCCGCGGTCG AGCCGGTGGT CAAGCAGTTC
TCCCTGGCCG TCCCCTACGA CGCGGTCACG CTGGCCGCCT CCCGCATCGC CGACAACCCG
GTGTCCCTGG GCATGGCGGT GCTCGCCTTC GAGACGTTCG TCCTGCCCTA G
 
Protein sequence
MRPRLSGDLR RANRVEVMRS FYGGRTLTRG DVAAQLGVSV ATAGTIIGEL TAAGLLAETQ 
SPRSGGGRPA SQLTMRAGAP YLVGVDLAET YVIAEIFDQA MTRVGHFQTP VAPTDDDPDS
VVGHVVDSVQ GVIAALDGVS AADVAGVGVS LPGQVDREGG VSVHAPNWGW HGVPFTSLFH
KRCDLPVLLD NPLKAITLAE MMFGEAGDHD DAVVVNLGTG VGLGVVAEGR LLRGRTNTAG
EWGHTILVAD GLPCHCGSRG CVEAYVGAAA LLDLLTDVDP DSPLLVPGDQ AATVARLAEA
VASADPVAVA TLERFARPLG MALANAVNML NPELLVVGGW VSAAFGEPLL AAVEPVVKQF
SLAVPYDAVT LAASRIADNP VSLGMAVLAF ETFVLP