Gene Namu_3126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3126 
Symbol 
ID8448740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3446975 
End bp3448195 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content73% 
IMG OID645042207 
Productsecretory lipase 
Protein accessionYP_003202448 
Protein GI258653292 
COG category[I] Lipid transport and metabolism 
COG ID[COG2267] Lysophospholipase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0242733 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0408746 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATCCCC GGAGCCGCTG GATCGCTGCG CTGGCCACCG CCGGCCTGCT GCTGGGCGGC 
TGCGCGCAGT CGACCGACTC CCGGTTGGAG CAGGGGGCGC TGGAGGAGGC GACCGAGCAG
TCGTTCTACG CGCTGCCCGA CCCGATCCCG GCTGGCGGTC CGGGTGAGGT GGTCCGCACC
GAACAGCTGC AGTCGGCCCC GGCCGGCACC ATCGCCTGGC GGGTCATGTA CCACTCCACC
GACGTCACCG GGGCGTCGAT CCTGACCTCG GCGGTGGTGA TCGCACCGAC CGCGCCGTGG
CCCGGCGGGG GTCCGCGGCC GGTGGTGGCC TGGGGTCATC CGACGACCGG GATGGCCGGC
CACTGCGCGC CCTCGACCGG CGTCGATCCG TTCGACCTGA TCGAGGGCAT GACCGATCTG
CTGAACGCCG GCTACGCGGT CGCCGCGGCC GACTACCCGG GGATGGGGGT GGCCGGTCCC
GACGCCTACC TGGTGGGGGT CAGCGAGGGC AACAGCGTGC TGGATGCGGT CCGCGCCGCG
CAGCACATCG AGCAGACGGG CGCCACCGCG GCCGGTGACG TGCTGCTGTG GGGGCATTCG
CAGGGCGGCC ATGCGGTGCT GTTCGCCGCG CAGCAGGCGG CCGGCTACGC CCCGGAGTTG
AAGGTGCGGG CGGCCGCGGT GGCCGCGCCC GCGACCGAGC TCGGCGCCCT GCTCAACGAC
GACATCGGCG ACGTCTCCGG GGTGTCCCTT GGCTCCTATG CGTTCCAGAC CTACCAGAGC
GTCTACGGGC CGAGCATCCC CGGGATGAGC CTGACCCAGG TGCTCACCGA TGCCGGCGCC
GCGGCCACGC CGCAGATGGC GGCGCTGTGC CTGATCGGCC AGAACAGCGA GCTGCACGCG
ATCGCCGGGC CGCTGGTCGG CCAGTACCTG CGCAGCGACC CGACCACGAC GGCGCCGTGG
TCGGACATCC TCGCGCAGAA CACCCCGGGC GGCGTGCCGA TCACCGTGCC GCTGCTGGTC
GCCCAGGGCG AGGCCGACGA GTTGGTGCAT CCGGCAGCGA CGCAGCAGTT CGTGACGCAG
CAGTGCGCCA AGGCCGCGCA CGTGATCTTC AAGCAGTTCC CGGGCATCGG GCACGGCGAG
ATCGCGCTCA CCGCCCTGCC GGACGTGCTG AGCTTCTTCG CCGCGGTGCG GGCCGGCTCG
ACCCCGGCCA GCACCTGTTA G
 
Protein sequence
MNPRSRWIAA LATAGLLLGG CAQSTDSRLE QGALEEATEQ SFYALPDPIP AGGPGEVVRT 
EQLQSAPAGT IAWRVMYHST DVTGASILTS AVVIAPTAPW PGGGPRPVVA WGHPTTGMAG
HCAPSTGVDP FDLIEGMTDL LNAGYAVAAA DYPGMGVAGP DAYLVGVSEG NSVLDAVRAA
QHIEQTGATA AGDVLLWGHS QGGHAVLFAA QQAAGYAPEL KVRAAAVAAP ATELGALLND
DIGDVSGVSL GSYAFQTYQS VYGPSIPGMS LTQVLTDAGA AATPQMAALC LIGQNSELHA
IAGPLVGQYL RSDPTTTAPW SDILAQNTPG GVPITVPLLV AQGEADELVH PAATQQFVTQ
QCAKAAHVIF KQFPGIGHGE IALTALPDVL SFFAAVRAGS TPASTC