Gene Caci_5074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5074 
Symbol 
ID8336428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5826215 
End bp5827471 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content71% 
IMG OID644958173 
ProductUDP-N-acetylglucosamine 
Protein accessionYP_003115775 
Protein GI256394211 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR03449] UDP-N-acetylglucosamine: 1L-myo-inositol-1-phosphate 1-alpha-D-N-acetylglucosaminyltransferase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.838773 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.526532 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGC GACGGGGATC CGCGCCGCAC CGGATCGCGA CGATCAGCGT GCTGACATCG 
CCGCTGGCGC AGCCGGGCGG AGGCGACGCC GGAGGGCTCA ACGTCTATGT GGTCGAGACC
GCCCGACGCT TCGCCGAGAC TGGCGTCCAG GTGGACATCT TCACCCGGGC CGCCGCTCCG
CGCCTGCCGC CGATCGTCGA ACTCTGCGAC GGCGTCGTGG TGCGGCACGT ACCGGCCGGT
CCGCCGCGCG AGGTGGACAA GGGCGCTTTG CCCAGAGTCC TCGGCGAGTT CACCGCCGGC
ATGCTGCGCG CCCCGGGGGA CTACGACGTG GTGCACGCCC ACCACTGGCT CTCCGGGCGC
GTCGGCGCTC TGGTGGCCCG CTCCCGGGGT GTTCCGCTCG TCCAGTCGAT GCACTCGCTC
GGGCTGGTGA AGAACGCGGT CCTGCCCGGC GAGGACGGAT CGGCGCCGCC GGCTCAGATA
GCGGGAGAGA GCGCGGTGAT CGCCGCCGCC GACCGCCTCG TCGCCAACAC CGCGCAGGAG
GCGGATCAGC TCATCGCGCT CTACGGCGCG GCTCCCGAAC GCGTGCACAC CGTGCATCCC
GGCGTGGATC TGGAGCTGTT CCGGCCGGGT GATCGAGACC AAGCCCGAGC CCGGCTCGGT
CTTCCTCATG ACGCCTTCGT CCTGCTGTTC GCCGGGCGCG TGCAGCGGCT CAAAGGTCCT
GACATCCTCA TGCGCGCCGC CGCGCAGTTG CTGCATGCGG ACCTTGACCT TGCTCAGCGC
CTCGTGGTGG CCTTCGTCGG CGGTCCGAGC GGTGAATTGC AAGCAGACCC AGACCAGCTC
ACGAAGCTCG CGACGGATCT GGGGATCGGC GAGCAGGTGC GCGTGGAACC ACCGTGTCCG
CATCCGGAAC TCGCCGACTG GTATCGCGCC GCGACCCTCG TCGTCGTCCC GTCGCGGGCT
GAGACCTTCG GTCTGGTGGC GGTGGAGGCG CAGGCTTGCG GGACGCCGGT GGTCGCCGCG
GCGGTCGGCG GTTTGCAGAC CGCCGTGCGA GCAGGGGTCT CAGGAGTCCT GGTGGAAGGA
CACGATCCGG CGCGGTACGC GGAGGTGATC AGAGCTCTGA TCGACGATCC GGCGCGGCTG
ACGGCGTTGC GGGCGGGCGC GTTGCAGCAC GCCGCCGGAT TCGGCTGGAG CGAGGCCGTG
GACCGGCTGC TCGCGGTCTA CCGGTCCGCT ATCGAAGGCG GTCGCGGACG GCCGTGA
 
Protein sequence
MTARRGSAPH RIATISVLTS PLAQPGGGDA GGLNVYVVET ARRFAETGVQ VDIFTRAAAP 
RLPPIVELCD GVVVRHVPAG PPREVDKGAL PRVLGEFTAG MLRAPGDYDV VHAHHWLSGR
VGALVARSRG VPLVQSMHSL GLVKNAVLPG EDGSAPPAQI AGESAVIAAA DRLVANTAQE
ADQLIALYGA APERVHTVHP GVDLELFRPG DRDQARARLG LPHDAFVLLF AGRVQRLKGP
DILMRAAAQL LHADLDLAQR LVVAFVGGPS GELQADPDQL TKLATDLGIG EQVRVEPPCP
HPELADWYRA ATLVVVPSRA ETFGLVAVEA QACGTPVVAA AVGGLQTAVR AGVSGVLVEG
HDPARYAEVI RALIDDPARL TALRAGALQH AAGFGWSEAV DRLLAVYRSA IEGGRGRP