Gene Noca_4398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4398 
Symbol 
ID4596916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4650257 
End bp4651858 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content75% 
IMG OID639779008 
Producthistidine ammonia-lyase 
Protein accessionYP_925582 
Protein GI119718617 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCACG ACTCCCACCC GTCGGTCGGC GTCGGCGTCG GACCGGTCTC CTTCGCCGAG 
CTCCGCGCCG TGGCGCGCGA CGGCGCCCCG GTCCACCTGA CCGACGACGC ACTGGCGGCG
ATCGCCCGGG CGCGCGCGGT GGTCGAGGAG CTGGCCGCCT CCGAGACTCC CGTGTACGGC
GTCTCGACGG GCTTCGGCGC CCTGGCGACG CGGCACATCC CCGCCGAGAT GCGCGCCCAG
CTGCAGCGCT CCCTGGTCCG CTCGCACGCC GCCGGCTCCG GCCCCGAGGT GGAGCGCGAG
GTGGTCCGGG GGCTGATGCT GCTGCGGCTC TCGACGCTGG CCACCGGACA CACCGGCGTC
CGGGTCGAGA CCGCCCGCCT GCTCGCCGGC CTGCTCGAGC ACGGCATCAC ACCTGTGGTG
CGCGAGTACG GCTCGCTCGG CTGCTCCGGT GACCTCGCCC CGCTGGCCCA CTGCGCCCTG
GCCTTGATCG GTGAGGGCGA GGTCCGCGAC GCGTCCGGCG CGCTGCTGCC GGCCGCCGAC
GCGCTGGCCG CCGTCGGGCT GGAGCCGGTC GAGCTCGCCG CCAAGGAGGG CCTCGCGCTG
ATCAACGGCA CCGACGGGAT GCTCGGCATG CTGGTGCTGG CCATCGAGGA CCTGCGGATG
CTGCTGCGCA CCGCGGACAT CGCCGCCGCC ATGTCGGTGG AGGGCCAGCT CGGCACCGAC
CGGGTCTTCG CCGCGGAGCT CCAGGCGATC CGGCCGCACC CGGGCCAGGC GCGCTCGGCC
GCGAACCTCA CCGCGCTGCT CGCCGACTCA GGCGTGGTGG CGTCGCACCG CGGCCCGGAC
TGCAACCGGG TCCAGGACGC CTACTCCCTG CGCTGCTCGC CCCAGGTGCA CGGTGCCGCC
CGCGACACCG TCGAGCACGC GGCGACGGTC GCCACCCGCG AGCTCGCCTC GGCCGTGGAC
AACCCGGTGG TCGTCTTCGA CGACCTGGGC GGGCGGGGGA CCGGGGGTCT GGGGGGCGGA
GCCCCCGGGC GGGTCGAGTC GAACGGGAAC TTCCACGGGG CGCCGGTCGC CTACGTCCTC
GACTTCCTCG CGATCGTCGC GGCCGACGTG GCCTCGATCA GCGAGCGTCG TACCGACCGG
TTCCTCGACA AGGCGCGCAA CCACGGGCTG CCGCCGTTCC TCGCCGACGA CCCCGGGGTC
GACAGCGGGC ACATGATCGC GCAGTACACC CAGGCCGCGA TCGTCTCCGA GCTGAAGCGG
CTCGCCGTGC CGGCCTCGGT CGACTCGATC CCCTCCAGCG CGATGCAGGA GGACCACGTG
TCGATGGGGT GGTCGGCCGC CCGCAAGCTG CGCCGCTCGG TCGACGGGCT GACCCGCGTC
GTCGCGATCG AGGTGCTCAC CGCGGCCCGG GCGCTCGACC TGCGCCGACC GCTCGAGCCG
TCGCCGGCCA CCGGTGCCGT CATCGGGCTG CTGCGCGGCG CCGGGGTCGC CGGCCCCGGA
CCCGACCGAC ACCTCTCGCC CGAGATCGAG ACCGTGGTCG GCCTGGTCTC CTCCGGCGCC
GTACTCCATG CTGCCGAGAC CGTGATCGGA GAACTGTCGT GA
 
Protein sequence
MNHDSHPSVG VGVGPVSFAE LRAVARDGAP VHLTDDALAA IARARAVVEE LAASETPVYG 
VSTGFGALAT RHIPAEMRAQ LQRSLVRSHA AGSGPEVERE VVRGLMLLRL STLATGHTGV
RVETARLLAG LLEHGITPVV REYGSLGCSG DLAPLAHCAL ALIGEGEVRD ASGALLPAAD
ALAAVGLEPV ELAAKEGLAL INGTDGMLGM LVLAIEDLRM LLRTADIAAA MSVEGQLGTD
RVFAAELQAI RPHPGQARSA ANLTALLADS GVVASHRGPD CNRVQDAYSL RCSPQVHGAA
RDTVEHAATV ATRELASAVD NPVVVFDDLG GRGTGGLGGG APGRVESNGN FHGAPVAYVL
DFLAIVAADV ASISERRTDR FLDKARNHGL PPFLADDPGV DSGHMIAQYT QAAIVSELKR
LAVPASVDSI PSSAMQEDHV SMGWSAARKL RRSVDGLTRV VAIEVLTAAR ALDLRRPLEP
SPATGAVIGL LRGAGVAGPG PDRHLSPEIE TVVGLVSSGA VLHAAETVIG ELS