Gene Namu_2040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2040 
Symbol 
ID8447649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2250281 
End bp2251261 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content71% 
IMG OID645041166 
Product3-oxoacyl-(acyl-carrier-protein) synthase III 
Protein accessionYP_003201412 
Protein GI258652256 
COG category[I] Lipid transport and metabolism 
COG ID[COG0332] 3-oxoacyl-[acyl-carrier-protein] synthase III 
TIGRFAM ID[TIGR00747] 3-oxoacyl-(acyl-carrier-protein) synthase III 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0180217 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0109261 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCA CCATTCGCAC CGCTGCCGGC TCCCCCGGGT CCAAGATCGT CGGTCTGGGT 
CACTACCGGC CCGACCGGGT GGTCACCAAC GACGATCTCG CCCAGATCAT GGACACCAAC
GACGAGTGGA TCCAGGCCCG GGTCGGCATC GCCGAGCGCC GGTTCGCCGC CGCCGACGAG
TCGGTGGCCT CGATGGGCGC CCAGGCCGGC GCCAAGGCCC TGGCCGAGGC GGGCCTGCAG
CCCGAGCAGA TCGACACCGT GATCACCGCG ACCTGCAGCC TGGACTCCCC CGTCCCGCAC
GCCTCGACCC AGATCGCCAG CCTGCTGGGC ATTCACGCGC CGGGTTCGTT CGACCTCAAC
GCGGCCTGCG CCGGCTTCTG CTACGCGATC GCGGCCGCCG ACCAGGCGGT GCGCACCGGT
GCCTCGCGCA ACGTGCTGGT GGTCGGCTCG GAGAAGCTGA CCGACTGGAC CAAGCGGGAC
GACCGGGCGA CGGCGATCAT CTTTGCCGAC GGGGCCGGCG CCGTGGTGGT TTCGGCCGCC
GACGAGCCGG GCATCGGCCC GGTCGTTTGG GGTTGCGACG AGGACCACAC CCAGACCATC
CGGATCGAGG GCCGCAACGG CCATTTCATC CAGGAGGGCC AGACGGTCTT CCGCTGGGCC
ACCTCCGCGA TCGCCCCGGT GGCGATCCGC GCGGCGGCGG CGGCCGGCGT CGCACTGGAC
GAGATCGACG TGCTGGTCAC CCATCAGGCG AACCTGCGGA TCATCGACGG CATCGCCAAG
AAGATCATCA GGGAAGGCGC GCGCCAGGAT CTCAAGGTCG GCCGGGACAT CGTCACCACC
GGCAACACCT CCTCGGCGTC CATCCCGATC GCGCTGGACC GGATGCGCGC CGCCGGCGAG
GTCTCCTCGG GCCAGGTCGT GCTCTCGGTC GCCTTCGGCG CGGGACTCAC CTACGCCAGC
CAGGTGTTCG TCTGCCCCTG A
 
Protein sequence
MSATIRTAAG SPGSKIVGLG HYRPDRVVTN DDLAQIMDTN DEWIQARVGI AERRFAAADE 
SVASMGAQAG AKALAEAGLQ PEQIDTVITA TCSLDSPVPH ASTQIASLLG IHAPGSFDLN
AACAGFCYAI AAADQAVRTG ASRNVLVVGS EKLTDWTKRD DRATAIIFAD GAGAVVVSAA
DEPGIGPVVW GCDEDHTQTI RIEGRNGHFI QEGQTVFRWA TSAIAPVAIR AAAAAGVALD
EIDVLVTHQA NLRIIDGIAK KIIREGARQD LKVGRDIVTT GNTSSASIPI ALDRMRAAGE
VSSGQVVLSV AFGAGLTYAS QVFVCP