Gene Caci_7220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_7220 
Symbol 
ID8338588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp8389480 
End bp8390688 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content72% 
IMG OID644960301 
ProductROK family protein 
Protein accessionYP_003117890 
Protein GI256396326 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.501197 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCTCCC GCGCCCAGAA AACGACTCGT GACCTGCGGT GGCACAACCG CGCCGATCTG 
CTGACACGGC TGTACCTGGG CGAGGCAACA AATCGCAACG ATCTGGCCCG GGCGTCCGGA
CTCAGCGCGG CGACGATCAG CAACGTCGTC TCGGATCTGA TCGGCGACGG ACTCGTCGGC
GAGAACGGCT CGCAGAGCTC GGCCGGCGGC CGTCCGCGCT CGCTGCTGCG CGTCCTGCCA
GCGTTCGGTC ACGTCGTCGG CATCGACATC GGCGAGACCG AGATCCGGGT CGGGCTCTTC
GACTGGACCC TGCATCCGGT CGCCGAAGAG GCGCGGCCCG TGGACATCCC GCGCGTCCCG
CCGCAGCAGG TCGCCGACCA GGTTCTGTCC GAGATAGCAG CGGTCACCGC ACGCGCCGGG
ATCGCTGTGG ACGACCTGCT CGGCGTCGGC ATCGGCGTGC CCGGCGCCGG CGGATCGGTG
ATCCACGCGC CCACGCTCGG TTGGTCCGCG GTCCCGCTCG CCGGTCTGCT CCGCGACCGC
CTCGGCTTCA CCCCCGACAT CGACAACGGC GCCATGGCGC TCGGTCAAGC CGAAGCCTGG
CGCGGAGCCG CACGAGGCGC CGAACGTGCG GTGGCCCTCC TGCTCGGTAC CGGCGCCGGC
GGAGCGCTCT CGCTCGCCGC CGGTCCCGGC GGCCGAGCGC GCAGCTTCAC CATGGAGTGG
GGACACACGG TCGTCGACCT CGAAGGTCCC CACTGCCGCT GCGGAGCACG CGGCTGCCTG
GAGACCTACA TCGGCGCCGA GGCGATCCTC GCGCGCTACG CCGCGACGCC GGGCAGCACT
CCGCTGGCCG AAGACGGCGT CGAAGCCCAG CTGTCCGAAC TCGTCGCCCG CGCCTCCCAG
CACCACGAAT CCGCGGCGCT CGAGGTTCTG GACGCCACGG CGACCTACCT CGGCGTGGGA
ATCAGCAACC TGATTAACCT CGTCGCCCCG GACCGGGTGA TCATCTCCGG CTGGGCCGGC
GCGTTGCTCT GCGACGCGGC CCTGCTTCCC GCCGTCCGGC GCGTCGTGCG CCGGCACGCC
CTGCCCTACC TGCAGGAATT CACGCGCATC GAGCCGGGCG AACTCGGTCC CTCGGCGACG
GCACTCGGAG CGGCGACGCT TCCGGTCGCG CGGCTGCTGG CCGACGGCGG CCGACGCGAG
GAGCGCTGA
 
Protein sequence
MTSRAQKTTR DLRWHNRADL LTRLYLGEAT NRNDLARASG LSAATISNVV SDLIGDGLVG 
ENGSQSSAGG RPRSLLRVLP AFGHVVGIDI GETEIRVGLF DWTLHPVAEE ARPVDIPRVP
PQQVADQVLS EIAAVTARAG IAVDDLLGVG IGVPGAGGSV IHAPTLGWSA VPLAGLLRDR
LGFTPDIDNG AMALGQAEAW RGAARGAERA VALLLGTGAG GALSLAAGPG GRARSFTMEW
GHTVVDLEGP HCRCGARGCL ETYIGAEAIL ARYAATPGST PLAEDGVEAQ LSELVARASQ
HHESAALEVL DATATYLGVG ISNLINLVAP DRVIISGWAG ALLCDAALLP AVRRVVRRHA
LPYLQEFTRI EPGELGPSAT ALGAATLPVA RLLADGGRRE ER