Gene Noca_4584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4584 
Symbol 
ID4598682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4853655 
End bp4855295 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content78% 
IMG OID639779193 
Productglycosyl transferase family protein 
Protein accessionYP_925766 
Protein GI119718801 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGCCG ACCTCGCCGC ACCGGCCCTG GTGCTCGTCG GCCCCGGGGT CGACCTGGCG 
ACCGGCTCCG GCCCGGTGCC CGAGGGCGTG ACCCGCGTGG CGCTCGGGGA CGAGGTCCCG
GACGGGGTCT GGGCCAGCGT GCTCCTGGCG GTCGCTGACG AGGCCGCGCT GCGGGTGGCC
GTCTCCCGGC TGCCCCGCCT CGGCCGGACC AAGCACGTCG GGTGCCACCT GGCGTCGGCG
ACCGGCCCGG TGACCACGGT GCTCCGTCCG GAGTGGCCGC CGCTGGCCAA CCTGGTCGCG
GAGACCGCGG ACGGCGGCGC GCACACCCGG CTGACCTTCC GGCGACCCGC CCCGGCCGGG
CCGGTGCTCG TCGAGCTCGC ACGGCACACC GGCACCCGGG TCGTGACCGG CAACCACGGG
CTGGTCGTCA CCGGTGCCCC GGTGCCGGTG GACCCGACCG CCGTCACCGA CGAGGTCCCG
GCCGCGGTGG TCGTCGGCGG CCGCGCCGAC CTGCCCGAGC ACCCGGTCCT GGGCCGGGCG
CCGGTCGCGC TGCACCTGCG CGGCGAGCCG GCCGGCCCGC TCGACGAAGC CCCCCTCGAT
GAGGCCCTGC TCAACCCGGT CGGCTTCCGC CGCGACTGGG ACCGCGGGCC GGTCCCGCTG
CCCGCGGGCG ACCCGACCCC GGAGCTGGTC GCCGCCGTCC GCGACGCCCA GGCCGTCGTG
GTGCCCGACG ACGCAGATCC CCGCACGGTG GCCGGGCTGG CGATGGCCGG TGTGCCGCTG
CTCGGCGCTG GCTACCCGGG CATCGACCCG GTCGACCCGG CCGACCTGGA CGACCCGCTG
CGCCGCGAGG AGCACTCGGT CCGGCTGCGC CGGGCCGCCC TGCGCGAGCA CTCGCACCTG
GCCTGGCGGC ACCGGGTCGC CCGGCGCGCC GGGCTGCGGG CAGCGGCGTA CCCCTCGGTC
AGCGTGCTGT TGCCGACCCG CCGACCCGAG CAGCTCGCCT TCGCGCTCGA CCAGGTCGCC
CGCCAGCGCG GCGTCGCCGA GCTCGAGCTG GTGCTGGCCA CCCACGGGTT CGAGGCCGAC
CCCGGGCTGG TGCGCGAGCG GCTGGGCGAG CGGCCGGTCA CGCTGCTGCC GCTGCCGGCC
GACACGGTCT TCGGTGACGT GCTCCGCGCC GCGACCGACG CCGCCACGGG CGACGTGGTC
GTGAAGATGG ACGACGACGA CTGGTACGGG CCGGACCTCC TCGCCGACCT GCTGCTCGCC
AAGCACTACT CCGGCGCGGA CCTGGTCGGC ACGCCCGCGG AGCTGGTCTA CCTGGAGCCG
ATCCGCACCA CCGTGCGCCG CCGCGGCCCG AGCGAGAGCT TCGGCGCCGT GGTCGCCGGC
GGCACGATGA CCCTCGACCG GGCGCTGCTG CGCGCGGTGG GCGGGTTCCG CAGCGTCCCG
CGGCACGTCG ACGCCCGGCT CCTCGAGGAC GTCCGCGCCG CGGGGGGCAG CGTGTACCGG
ACCCAGGGCC TCGGGTACGT CCTGCGGCGT ACGACGCACG GCCACACCTG GGACACCGGA
CTCGGCTACT TCCTGACCCG GCAGAGCGTC GCCGCGCAGT GGCGCGGCTT CCGGCCCAGC
AGACTCCTGG AGGCAGGGTG A
 
Protein sequence
MLADLAAPAL VLVGPGVDLA TGSGPVPEGV TRVALGDEVP DGVWASVLLA VADEAALRVA 
VSRLPRLGRT KHVGCHLASA TGPVTTVLRP EWPPLANLVA ETADGGAHTR LTFRRPAPAG
PVLVELARHT GTRVVTGNHG LVVTGAPVPV DPTAVTDEVP AAVVVGGRAD LPEHPVLGRA
PVALHLRGEP AGPLDEAPLD EALLNPVGFR RDWDRGPVPL PAGDPTPELV AAVRDAQAVV
VPDDADPRTV AGLAMAGVPL LGAGYPGIDP VDPADLDDPL RREEHSVRLR RAALREHSHL
AWRHRVARRA GLRAAAYPSV SVLLPTRRPE QLAFALDQVA RQRGVAELEL VLATHGFEAD
PGLVRERLGE RPVTLLPLPA DTVFGDVLRA ATDAATGDVV VKMDDDDWYG PDLLADLLLA
KHYSGADLVG TPAELVYLEP IRTTVRRRGP SESFGAVVAG GTMTLDRALL RAVGGFRSVP
RHVDARLLED VRAAGGSVYR TQGLGYVLRR TTHGHTWDTG LGYFLTRQSV AAQWRGFRPS
RLLEAG