Gene Namu_4702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4702 
Symbol 
ID8450332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5229598 
End bp5231118 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content71% 
IMG OID645043742 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_003203967 
Protein GI258654811 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGA GCTTGGCGGC CATCCTGTCC GAATCCGCGT CGCGCTACCC CGATCGGGAC 
GCGGTGGTGA TGGGGCCGCA ACGGATCGGG TACGCCACCC TGTGGCAGGA GTCTCGCCGG
TACGCCGCGG TGCTGCGCGA GCGCGGCGTC GGCCCGGGCG ACCGGGTGGC CCTCCTGTTG
CCGAACGTGC CCGACTTTCC CCGGGTCTAC TACGCCGTGC TGTCCCTGGG CGCCGTCGTC
GTGCCGGTGC ACGCGCTGCT GGTCGCCCGG GAGATCGGTT TCGTGCTGAC CGACTCCCAG
GCCTCGCTGC TGGTCGCCGC CGGGCCGCTG CTGGCGCAGG GGCTGCCCGG GGCGGAGCAG
GCCGGGGTGC CGGTGCTGGC GGTGCTCGGC GGGCCCGAGG GCGTCGACCG GCTGGACCTG
CTGGCCGCCG ACGTCGAGCC GATCCGCACC TACGTCCAGC GCGAACCGTC GGACGAAGCG
GTGATCCTGT ACACCTCGGG CACCACCGGC TCGCCCAAGG GTGCGGTGCT CACCCAGCTG
AACATGGCGA TGAACGCCAT GATCAGCGCG ACGACCGTGC TGGATCTGAC GCCCGAGGAC
GTGATCCTGG GCTGCCTACC CCTTTTCCAC TCGTTCGGCC AGACCTGCTC GATGAACGCC
GGCTTCTACG CGGGCAGCAC GTTGGTGCTG CTGCCGCGCT TCGACGGGGC GGCCGCACTC
GAGCTGATCG TGGGCGAGAG CGTGAACGTG TTCATGGGCG TGCCCACCAT GTACATCGGC
CTGCTGGCCG CCGCCCGGGA GGACGAGCGT CGGCCGGTGC TGCGGCGGGC GGTCTCCGGC
GGGGCGAGCC TGCCGGTGGC CGTCATCGAC GCGTTCAAGC GGGTGTTCGA GGCCGACATC
TACGAGGGGT ACGGGCTGTC CGAGACCTCG CCGGTGGCCA CCTTCAACCA GGCCGTGTTC
GGCCGCAAGC CGGGCACGGT CGGCCGCGCG ATCTGGGGCA CCGAAGCGGA GATCGCCGAC
CCGGCGATCG AGGACCGGAT CGCGCTGCTG CCGCAGGGCG AGGTCGGCGA GGTGGTGCTG
CGCGGCCACA ACATCTTCGC CGGCTATCTG AACAACCCGC AGGCCACCGC GGCCGCCGTG
GTCGACGGCT GGTTCCGCAG CGGCGATCTG GGGGTCAAGG ACGCCGACGG GTTCCTCTCG
ATCGTCGACC GGAAGAAGGA CCTGATCATC CGCGGCGGGT TCAACGTCTA CCCGCGCGAG
GTGGAGGAGG TGCTGGCCAG CCACCCCGGG ATCGCCCAGG TCGCCGTCGT CGGGGTGCCC
GACGCCACCC ACGGTGAGGA GATCTGCGCC GTCGTCGTGC GCTCGCCGGA GGGACAGGAC
CTGGACGCCG ACACCCTGAT GACCTGGTCC CGGGAGAAGT TGGGCCGGCA CAAGGTGCCC
CGGCGGGTCG AGTTCGTCGA GACGTTGCCG CTGGGCCCCA GCGGCAAGAT CCTCAAGCGG
GAACTGATCA AGCAGCTGTA G
 
Protein sequence
MSLSLAAILS ESASRYPDRD AVVMGPQRIG YATLWQESRR YAAVLRERGV GPGDRVALLL 
PNVPDFPRVY YAVLSLGAVV VPVHALLVAR EIGFVLTDSQ ASLLVAAGPL LAQGLPGAEQ
AGVPVLAVLG GPEGVDRLDL LAADVEPIRT YVQREPSDEA VILYTSGTTG SPKGAVLTQL
NMAMNAMISA TTVLDLTPED VILGCLPLFH SFGQTCSMNA GFYAGSTLVL LPRFDGAAAL
ELIVGESVNV FMGVPTMYIG LLAAAREDER RPVLRRAVSG GASLPVAVID AFKRVFEADI
YEGYGLSETS PVATFNQAVF GRKPGTVGRA IWGTEAEIAD PAIEDRIALL PQGEVGEVVL
RGHNIFAGYL NNPQATAAAV VDGWFRSGDL GVKDADGFLS IVDRKKDLII RGGFNVYPRE
VEEVLASHPG IAQVAVVGVP DATHGEEICA VVVRSPEGQD LDADTLMTWS REKLGRHKVP
RRVEFVETLP LGPSGKILKR ELIKQL