Gene Caci_3686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3686 
Symbol 
ID8335039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4125945 
End bp4126925 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content67% 
IMG OID644956826 
Productglycosyl transferase family 2 
Protein accessionYP_003114429 
Protein GI256392865 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.82646 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGGC GGGCATTTCG GGACGCGGAA TCAGAGGGCG ATCTTGCTGA TGTCCGGGAT 
GCCCCGCTTT CGCGGATGAC TGTGGACATC GCTATACCGG TTTATAATGA GGAGAGGGCG
CTTCCTGGGT GTATCGAGAC ATTGTGGACG TATCTCAGTG AGCGGTTTCC GTTCGCGTGG
GAAATCACGA TCGTCGACAA CGGGAGTACC GACGGTACGC TGCTCGCCGC TGAGGGGCTT
GCCTCGGCGT GGCCGTATGT GAGTGTGCTG CACCAGGACC GGAAGGGTAA GGGGCTGGCC
GTGCGTACGG CGTGGCTCGC CAGTACGGCG GATGTCGTCG CATATATGGA TGTGGACCTG
TCGACCGGGC TTGATGCGCT GCTGCCGATG GTTGCCTCGC TTGTCAATGG ACACGCCGAT
ATTGCGGTGG GGTCGCGGCT GGCTTCGGGC GCGCGGGTGA TTCGGGGTGT GAAGCGGGAT
ATCACGTCGC GGGGGTACAA CGCGCTGCTG CGACTGGTTC ACGGGGTGCG GTTCACGGAT
GCGCAATGCG GGTTCAAGGC GGCTCGGGCG GAAGTCGTCG TGCCGCTATT GCGGCGGGTG
CGGGACAATG GCTGGTTCTT TGACACCGAA TTGCTGCTGC TGGCCGAGCT CAACGGGCTC
CGGTTGCACG AGGTCGCCGT CGACTGGGTC GATGACGTCG CGAGCCAGGT GGCGATTCCG
CGCGTCGCGT CGGGCAACCT GCGCGGGATG CTGCGGCTCG CGCGCCTGCG GCTCCTCGGC
GCGGCGACGG TGAGCGGCCT TCCGCAGCGG CCCGCGCCGA CCGCGACGCA TCCCGACGCG
GTCCTGGGCC AGGCTCGCCG GGTCACGAGC CGACGCCGGT ACGCCAGCCG GGCGCCGGTC
CATCTGGCAG GCGCCGCCGC GCTGGTGGCG TATGTGATCA CTCGGTTCGT CGTCGCCGGT
CCCGGACGGC GCGCGCCGTA A
 
Protein sequence
MSRRAFRDAE SEGDLADVRD APLSRMTVDI AIPVYNEERA LPGCIETLWT YLSERFPFAW 
EITIVDNGST DGTLLAAEGL ASAWPYVSVL HQDRKGKGLA VRTAWLASTA DVVAYMDVDL
STGLDALLPM VASLVNGHAD IAVGSRLASG ARVIRGVKRD ITSRGYNALL RLVHGVRFTD
AQCGFKAARA EVVVPLLRRV RDNGWFFDTE LLLLAELNGL RLHEVAVDWV DDVASQVAIP
RVASGNLRGM LRLARLRLLG AATVSGLPQR PAPTATHPDA VLGQARRVTS RRRYASRAPV
HLAGAAALVA YVITRFVVAG PGRRAP