Gene Caci_3248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3248 
Symbol 
ID8334601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3583492 
End bp3584691 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content71% 
IMG OID644956393 
Productglycosyltransferase, MGT family 
Protein accessionYP_003113996 
Protein GI256392432 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID[TIGR01426] glycosyltransferase, MGT family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGGT TCCTGTTCGT CAGCCTCCCG CTGACCGGCC ACGTGAACCC GATGGCGGCG 
GTCGCGAAAA CGCTGTCCGC GCAAGGGCAC GACGTCGCCT GGGCCGGCTC GGAGTCCTAT
CTGCGCCCCC TGGTGGGACC GGACGCGGTG ATCCAGCAGA TCCCGCTGCG TCCGCACCGC
CGCCAGGCCG ATCGCGGGAT GGCTGCGGCG AAGACCCGGT GGGAGGAGTA CATCGTCCCG
CACTGCAAGG TCTCGCTGAA GGGCGTCGAC AAGGCGGTGA AGGCTTTCGC GCCGGACGTG
ATGGCGGTGG ACCAGCACGC CGTCGCCGGC GCGCTGGTCG CGCACCGCTA CGGCCTGCCG
TGGGCCTCGA TGGCGCCGAC GACGATGGAA CTGACCCGGC CCTATCGCGC GCTGCCGAAG
GTCGAGGCCT GGATCCAGGG CCACCTGTCG GCGCTGTGGA CCGGCGCGGG TCTGCCGGGG
GAGCCGCCGC ACGATCTGCG GTTCTCCCCG GATCTGCTCA TCGCTTTCAC CGGTACGGCA
CTGACCGGTC CGCTGTCCTG GCCGGACAAC GCGGTCCTGG TCGGTCCGGC ACTCGCCGAG
CGTCCCGCCG ATCTCGACTT CCCCCGGGAT TGGCTGGACC CCGCCAAGAA GCTCGTCCTG
ATCTCGATGG GCACGCTGGC CGCCGAGACC TCGCACGGCT TCTACGAACG CGCCGTGGAA
GCGGTGCGTC CCCTCGGCGA TCGCGTCCAA GTCCTGCTGA CCGCGCCGCC GGAGACCATC
CCCGATCCGC CCGAGCACGT CCTGGTCCGC ACGCGTGTGC CGGTCCTGGA GTTGATGCCG
AGGCTGGACG CGGTGGTCTC GCACGGCGGA CTGAACACCG TCTGCGAGTC GCTGGCGCAC
GGCGTGCCGC TGGTCGTCGC GCCGATCAAG GGCGACCAGC CGATCAACGC CTCGCAGGTG
GCGGCGGCCG GAGCCGGAGT CCGCGTGAGC TTCGCCCGGG TGCGTCCCGA GGCGCTGCGC
GCGGCGATCG TGTCAGTGCT CGAAGATCCG TCCATCCGCG CGTCGGCGGC AGCCGTGCGC
GACTCGTTCG CCGCCGCCGG GGGTGCCGCC GCCGCGAGCG CGCGGCTGGC CCGTCTCGCC
GACGCCGGCC AAGACGCCTC ACGGCGAGAA AGGGAGGGAT CCCTTGAAAA TCTTTCGTGA
 
Protein sequence
MSRFLFVSLP LTGHVNPMAA VAKTLSAQGH DVAWAGSESY LRPLVGPDAV IQQIPLRPHR 
RQADRGMAAA KTRWEEYIVP HCKVSLKGVD KAVKAFAPDV MAVDQHAVAG ALVAHRYGLP
WASMAPTTME LTRPYRALPK VEAWIQGHLS ALWTGAGLPG EPPHDLRFSP DLLIAFTGTA
LTGPLSWPDN AVLVGPALAE RPADLDFPRD WLDPAKKLVL ISMGTLAAET SHGFYERAVE
AVRPLGDRVQ VLLTAPPETI PDPPEHVLVR TRVPVLELMP RLDAVVSHGG LNTVCESLAH
GVPLVVAPIK GDQPINASQV AAAGAGVRVS FARVRPEALR AAIVSVLEDP SIRASAAAVR
DSFAAAGGAA AASARLARLA DAGQDASRRE REGSLENLS