Gene Namu_1421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1421 
Symbol 
ID8447017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1568293 
End bp1569837 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content75% 
IMG OID645040552 
Producthistidine ammonia-lyase 
Protein accessionYP_003200811 
Protein GI258651655 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.466068 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGAGCC CTCTGCAGAT CGGAACCCAG CCCCTGACCC AGGCCGACGT GGTCGCGGTC 
GTGCGGCACG CCCGCCCGGT CGTCCTCGGC CCCGACGCGC TGGCCGCGAT GGCCGCCAGC
CGGGCCGTCG TGGACACCCT GGCCGGCGAC GCGCACCCGC ACTACGGCAT CTCCACCGGG
TTCGGTGCCC TAGCCACGAC GTCCATCCCG CCGTCCCGGC GGACGGCGCT GCAGCAGTCG
CTGATCCGGT CGCACGCCGC CGGCAGCGGT GAGCCGGTCG AGCGGGAGGT GGTCCGCGGC
CTGATGCTGC TGCGGCTGGC CACCCTGGCC CGCGGTCGCA CCGGTGTCCG GCCGGCCACT
GCCGCCCTGC TGGCCGCCAC CCTGTCCGCG GGGATCACCC CGGTCGTCCC CGAATACGGT
TCGCTGGGGT GCTCGGGCGA CCTGGCCCCG CTGGCCGCCG TCGCCCTGAC CCTGATCGGC
GAAGGGGAGG TGCACGACGC GCGGGGTCGG CGCCGCCCGG CGGCCGATGC GCTGGCCGAG
GCCGGTCTGA CCCCGGTGAC GTTGGCCGAG AAGGAGGGCC TGGCCCTGAT CAACGGCACC
GACGGCATGC TGGGCATGCT GGTGCTGGCC CTGCACGACC TGGCCGGTCT GCTGGACGCG
GCCGACCTGG CCGCCGCGAT GTCGGTGGAA GCCCTGCTGG GCACCGACCG GGTGTTCGCC
GCAGACCTGC AGCGGTTGCG CCCGCAGGCC GGACAGGCGG TCAGTGCGGC CCGGATCGCG
GCCGCGCTGG CCGGCTCGCC GATCGTTGCC TCGCACGCCG GGCCCGAGGA CACCCGGGTG
CAGGACGCCT ACTCGCTGCG CTGTGCCCCG GCCGTGCACG GCACCGCGCG GGACACGGCG
GGGTACGCCG CCGCCGTCGC CGATCGGGAG CTGGCCTCCT CCATCGACAA CCCGGTCGTG
CTGCCGGACG GACGGGTCGA GTCCAACGGC AACTTCCACG GCGCCCCGAT CGCGGCCGTG
CTGGACTTCC TGGCCATCTC GGTGGCCGAC GTGGCCAGCA TCTGCGAGCG GCGGACGGAC
CGGATGCTCG ACCGCACCCG GTCGCACGGG CTGCCGCCGT TCCTGGCCCA CGAGGTCGGC
GTGGACTCCG GGCTGATGAT CGCCCAGTAC ACCCAGGCCG GCATCGTCAG CGAGCTCAAG
CGGCTGGCCG TCCCGGCCTC GGTCGACTCG ATCCCCTCGT CGGCCATGCA GGAGGACCAC
GTTTCCATGG GCTGGCACGC GGCCCGCAAG CTGCGCCGCG CGGTGGACGG GCTGCGCCAG
GTCATCGCCA TCGAGATGCT CGCCGCGGCA AGAGCTCTGG ACCTGCGCGC GCCGCTGGCC
GCGGGCCCGG TGACCGGCGC GATGCGCGAG GTCATCCGGA CGGCGGTGCC CGGCCCGGGG
CCGGATCGTC ATCTGGCGCC GGAGATCGAG GCCGTGGTAG CACTGTTGGC GTCGGGAGCC
ATCCTCGCCG CCGGGTCGCC GGCCGCACCC GGCCCGGTCC GATGA
 
Protein sequence
MTSPLQIGTQ PLTQADVVAV VRHARPVVLG PDALAAMAAS RAVVDTLAGD AHPHYGISTG 
FGALATTSIP PSRRTALQQS LIRSHAAGSG EPVEREVVRG LMLLRLATLA RGRTGVRPAT
AALLAATLSA GITPVVPEYG SLGCSGDLAP LAAVALTLIG EGEVHDARGR RRPAADALAE
AGLTPVTLAE KEGLALINGT DGMLGMLVLA LHDLAGLLDA ADLAAAMSVE ALLGTDRVFA
ADLQRLRPQA GQAVSAARIA AALAGSPIVA SHAGPEDTRV QDAYSLRCAP AVHGTARDTA
GYAAAVADRE LASSIDNPVV LPDGRVESNG NFHGAPIAAV LDFLAISVAD VASICERRTD
RMLDRTRSHG LPPFLAHEVG VDSGLMIAQY TQAGIVSELK RLAVPASVDS IPSSAMQEDH
VSMGWHAARK LRRAVDGLRQ VIAIEMLAAA RALDLRAPLA AGPVTGAMRE VIRTAVPGPG
PDRHLAPEIE AVVALLASGA ILAAGSPAAP GPVR