Gene Namu_5043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_5043 
Symbol 
ID8450674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5626815 
End bp5628026 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content76% 
IMG OID645044079 
ProductOrn/DAP/Arg decarboxylase 2 
Protein accessionYP_003204303 
Protein GI258655147 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0019] Diaminopimelate decarboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACTTC TCGCCGCCGT CGGGGAACCG ACGCCCTCGC CCGATGCCCG GCCCGCCGAG 
CCGGACGCCA CCCCCGTCCT GCGGGTGGAC TGTGCCGTCG TGCAGGCCCG GTACCGGCGG
TTGGCCGCCG TCCTGCCCGG CGTCGACCTG CACTATGCGG TCAAGGCCAA TCCGAGCCCG
CCGGTTCTGC GTACCCTGGC CGGTCTCGGC GCGCGGTGGG ACGTGGCCAG CCCGGGTGAG
ATCGAGGCGG TTCTGGCGGT CGATCCGGAT CCCCGCCACC TCTCGTACGG CAACCCGATC
AAGAAGTCGA GCGACATCGC CGCGGCCGCC CGTCGCGGCG TGCGCCGCTA CACCGTGGAC
AGCCCGGCCG AGCTGGCCAA GGTCAGCGCG CACGCGCCCG GCGCGCAGAT CCTGGTCCGG
CTGAGCACGT CCGGGGCCGG CGCCGACTGG CCGTTGGGCG GCAAGTTCGG CTGCCCGGAG
CGGGAGGCGC GGACCCTGCT GCGCACCGCG GCCGCGGCCG GCCACGAGGT CGGGATCAGC
TTCCACGTCG GCACCCAGCA GCGTGATCCC GGGGCCTGGG ACGGGCCGCT GGCCGCCGCC
GGCCGGCTGG ACCGGGGCCT GCGGGGGGCC GGCGCCCGGC TGAGCACCGT CAACCTCGGC
GGCGGATTCC CGGCCGGCAT GCTGGGCCGG ACCCAGAGCG CGGCCCGGTA CGGGCAGGCG
ATCCGAGACG CGGTGCGGCG GGCCTTCGGG GCGCAGCCGC CGGCCCTGAT GGCCGAGCCC
GGCCGGTTCC TGGTGGCCGA TGCCGGGACG CTGGTCAGCG AGGTCGTGCT GGTCAGCGAC
CGCGGGGGCG AGCGGTGGGT CTACCTGGAC GCCGGGCTGT TCACCGGACT GGTCGAGGCC
TACGGCGAGA GCCTGCGCTA CCGGCTGGCC GTCGAACGCA CCGGCGGGCC GTTCGCCTCG
GCCACCACCG AGGCCATCCT GGCCGGCCCG ACCTGCGACA GCCTGGACGT GCTGTACCGG
CGGCACCGGT ACCGGCTGCC GGCCGATCTG CGACCCGGCG ACCGGGTGCA TTTCCTGTCC
GCCGGCGCCT ACACGGCCAG CTACTCCACC GTCGGCTTCA ACGGCTTCGC CCCGCTTCGG
GTGGAGTTCA CCGGGTCGAC CGGCGCAGGG TCGGCCGGCG CAGGGTCGGC CGGCGCGGGG
CCGACGGGAT GA
 
Protein sequence
MGLLAAVGEP TPSPDARPAE PDATPVLRVD CAVVQARYRR LAAVLPGVDL HYAVKANPSP 
PVLRTLAGLG ARWDVASPGE IEAVLAVDPD PRHLSYGNPI KKSSDIAAAA RRGVRRYTVD
SPAELAKVSA HAPGAQILVR LSTSGAGADW PLGGKFGCPE REARTLLRTA AAAGHEVGIS
FHVGTQQRDP GAWDGPLAAA GRLDRGLRGA GARLSTVNLG GGFPAGMLGR TQSAARYGQA
IRDAVRRAFG AQPPALMAEP GRFLVADAGT LVSEVVLVSD RGGERWVYLD AGLFTGLVEA
YGESLRYRLA VERTGGPFAS ATTEAILAGP TCDSLDVLYR RHRYRLPADL RPGDRVHFLS
AGAYTASYST VGFNGFAPLR VEFTGSTGAG SAGAGSAGAG PTG