Gene Caci_8502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_8502 
Symbol 
ID8339882 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp9861562 
End bp9863094 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content71% 
IMG OID644961589 
Productexopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 
Protein accessionYP_003119166 
Protein GI256397602 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.129361 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGAGA TTCCGTGGGG GGATTTCGGC ATGGCGGTCG GGTCAAGCTT GGAAAGTCCA 
GTCGTGCGGG TACACCTCGC ACGGCCCTGG GCGACGAGAT ATCTAAGGAG CGCCATCACG
GCGCTCCTCC TAGCGATGGA CTGGACCTGC GCGACAGCGG CCACCTGGCT CGCCGTCCCG
GTCCAGGACG ACTGGCCGGC GTTGGTCGCC GTGCCCGTCG CATGGGTCGG CGCCGCGGGC
GCGCACCGGC TCTACGAACG GCGCCATGTA GGACCGGGGA CCGAGGAGTA TCACAGGATT
CTGCGGGCCT GCCTGGCCGC GATGGCAGCG CTGTCGGCGC TGTGCGCGCT GGTGGCCCGC
AACGACCGTT TGGTGCGCAG TGTTCTGGTC GCCGTGCCGA TCGCGGCGGT GGGCTCGCTT
CTGGCCCGCA AGGCGTTCCG TTCGTTGCAG GCCCGGCACG CCGGGCTTGC TTCGCGTCCG
GCGCTTCTGG TCGGCAGCTC CATTCAATGC TCGGCGATGG CGGCGATCCT GCGGCGCGAG
CGTTCGGCGC TGCGCGCGGT CGCGGCGCTG AACGTCGCCG GACCGAGCCA GGGGACGGCG
CCGCTGTCCC AGGCTTCGGG CTCGGTGCCG CCGACACCGC AGGACCCGCT GGACCCGTTG
AACGTCGGCA CGCTGACCGC GCACGGCGGG CCCAGCGACG CCGAGCAGGT CGCAACGGCC
CTGGAGGTGA CCGGCTGCGA GGTCGTGGTG CTGATGCCCG GTCCGCATCT GGGAGCGGCG
GCGTTGAGCG CTCTGGGCTG GCGGCTGGCC AGCCTGGGCG TGGACATCCT GGTCGCCCCG
TTCCTCACCG AGATCGCGCC GGCGCGGCTG GCGGTCCGGC GCGACGGCGG CGTCCCGCTG
TTCCACGTCC GCGCGCCGCG GCTCTCGCGC GGCGCGCGGG TCCCGAAGGA GCTCGGCGAG
CGGGTGATGG CGGCGATCGG CCTGCTGCTG CTGGCGCCGA TCTTCCTGGC GGTATCGCTG
GCGGTGCTGC TCGGCGACGG GCGGCCGATC TACTTCCGGC AGACGCGGGT CGGGCTCAAC
GGGGAGCACT TCGTCCTCTA CAAGTTCCGC ACCATGTCCA CCGGCGCGGC ACAGGCGAAG
AAGGAACTCG CGCACCTGAA CGTCAACTCC GACGGTCTGC TGTTCAAGAT GCGGCGGGAC
CCGCGGGTGA CGAAGGTCGG CGCGGTGCTG CGCCGGTACT CGCTGGACGA GCTGCCGCAG
CTGCTCAACG TCGTGCGCGG CGACATGGCC CTGGTCGGGC CGCGTCCGCC GCTGCCGGAA
GAGGCCGCCA AGTACAGCGA GGAGGTCCGG CGCCGGCTGC TGGTCAAGCC GGGCCTGACC
GGGCTGTGGC AGGTGAGCGG ACGTTCCGAC CTGGCGTGGG CGGACGCGGT GCGGTTGGAC
CTGGGATACG TCGAGAACTG GTCGCTGGGC CTGGACGCGG AGATCTTGCT CCGCACCGGC
TCGGCGGTCG TCAAGGGCAA AGGAGCTTAC TGA
 
Protein sequence
MDEIPWGDFG MAVGSSLESP VVRVHLARPW ATRYLRSAIT ALLLAMDWTC ATAATWLAVP 
VQDDWPALVA VPVAWVGAAG AHRLYERRHV GPGTEEYHRI LRACLAAMAA LSALCALVAR
NDRLVRSVLV AVPIAAVGSL LARKAFRSLQ ARHAGLASRP ALLVGSSIQC SAMAAILRRE
RSALRAVAAL NVAGPSQGTA PLSQASGSVP PTPQDPLDPL NVGTLTAHGG PSDAEQVATA
LEVTGCEVVV LMPGPHLGAA ALSALGWRLA SLGVDILVAP FLTEIAPARL AVRRDGGVPL
FHVRAPRLSR GARVPKELGE RVMAAIGLLL LAPIFLAVSL AVLLGDGRPI YFRQTRVGLN
GEHFVLYKFR TMSTGAAQAK KELAHLNVNS DGLLFKMRRD PRVTKVGAVL RRYSLDELPQ
LLNVVRGDMA LVGPRPPLPE EAAKYSEEVR RRLLVKPGLT GLWQVSGRSD LAWADAVRLD
LGYVENWSLG LDAEILLRTG SAVVKGKGAY