Gene Noca_1669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1669 
Symbol 
ID4600048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1774808 
End bp1776280 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content72% 
IMG OID639776268 
Productdiacylglycerol O-acyltransferase 
Protein accessionYP_922869 
Protein GI119715904 
COG category[R] General function prediction only 
COG ID[COG4908] Uncharacterized protein containing a NRPS condensation (elongation) domain 
TIGRFAM ID[TIGR02946] acyltransferase, WS/DGAT/MGAT 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.445844 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGGAC GACAGCGGTT GAGCGGCCAG GACGCGCTCT GGTTGGCGAT GGACAAGCCC 
GGCAACCTGA TGGTCGTCGA CAGTCTCTTC TGGACGGCCG AGCCGATCGA CTGGGACCGC
TTCCGTGAGG TGATGAGGGA GCGCTTCTGG GAACGGTACG ACGTGGTCCG CAGCGTCATC
GTGCGCGACG AGGACGGCGC CCTGTGCTGG GAGGAGGTGC CCGAGGCCGA CCTCGACGAC
CGCTTCGAGC AGGTGGTGCT CCCGGCGCCC GGGGGTGACG CGGAGCTGCA GGACCTGATC
GCCGCGCAGC GGGTGCTCCC GCTCGATCGC GGCGAGCCGC TGTGGCGTGC GGTGCTCGTG
GACGGGTTCC ACGGCGGCAG CGCCGTGCTG TTCCGGGGCC ACCACTCGAT CGCCGACGGG
ATCCGGATGG TCCAGCTCGT GCTGCGGGTC TTCGACTGCA GCCCCGACGG CGAGGACCCC
GGCCCAGCGC GGAAGACGGC CAGGAAGACG GTGCGGAAGG CGCCTGATGC CGCTCGGACG
CCGGTCCCGC GCCGGGGCGA CACCTCCCTG ACCGGCCGTG CGGTGGCCGC TGCGACCACC
TCCCTGCAGG TCGCCCGGAG CGCGATGACC AACCCGGTAG GGGCCGCGCA CAGCGCCCTC
ACGCTCTCCG AGGCCATGCT CGGCCGGTTC GGTGCCCTGC CGGTGGTCTC GGCGCTCCCC
GGCGACGTCG ACGCTGCTCG CAAGCTGGTC CTGGGGACCC GCAACGACAC CACCGCGTGG
AGCGGGACGG TGGGGGACCG CAAGGCGATC GCCTGGACCG CTCCGTTGTC CCTCGCCGAG
GTCAAGGCGG TCGCGCACGC GCACGGGGCG ACCGCGAACG ACGTGCTGGT CAGCTGCGTC
GCGCAGTCGC TGCGGGCGTA CCTCGAGGCC CATGACGCGG TGTGCCACAG CGTCACGTGG
GACGTGCCGG TCAACCTCAA GCCGTTCGAC CCGGACCTGC CCGTCGAGCT CGGCAACGGG
TTCGCGCTGG TGCAGCTCGA GCTGCCGACG AACATCGACG ACCCGGTCCG CGCCCTCGAC
GTGGTGCGGC GCCGGATGAG CCGGATCAAG AACGGCCACG AGGCGGTCGT CGACTACGGG
ATCCAGGCCG CCATCGGTCG GATGAGCACG GCCCTCTACC GCGCGACCAT CGACCTCCTG
GCCAACCGGG CGGTCGGAGT GCTGACGAAC GTGCCGGGGC CGCAGGTGCC GCTCTACATC
GCCGGGCGGA AGGTCGAGGC GATGCTGGGC TGGGCGCCGC TGACGGCCGA CCAGGCGATG
AGCCTGACGA TCTACAGCTA CGACGGCAAG GTGTTCGTCG GCCTCGCCGC CGACGCCGGC
CTGGTGCCGG ACCACCAGCA GGTGGTCGAC GGTTTCGCCC AGGCGTTCGC GCGCCTGGTC
GAGCGGACCG AGGCGACCCG GCGCGCCGGC TGA
 
Protein sequence
MGGRQRLSGQ DALWLAMDKP GNLMVVDSLF WTAEPIDWDR FREVMRERFW ERYDVVRSVI 
VRDEDGALCW EEVPEADLDD RFEQVVLPAP GGDAELQDLI AAQRVLPLDR GEPLWRAVLV
DGFHGGSAVL FRGHHSIADG IRMVQLVLRV FDCSPDGEDP GPARKTARKT VRKAPDAART
PVPRRGDTSL TGRAVAAATT SLQVARSAMT NPVGAAHSAL TLSEAMLGRF GALPVVSALP
GDVDAARKLV LGTRNDTTAW SGTVGDRKAI AWTAPLSLAE VKAVAHAHGA TANDVLVSCV
AQSLRAYLEA HDAVCHSVTW DVPVNLKPFD PDLPVELGNG FALVQLELPT NIDDPVRALD
VVRRRMSRIK NGHEAVVDYG IQAAIGRMST ALYRATIDLL ANRAVGVLTN VPGPQVPLYI
AGRKVEAMLG WAPLTADQAM SLTIYSYDGK VFVGLAADAG LVPDHQQVVD GFAQAFARLV
ERTEATRRAG