Gene Noca_0655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0655 
Symbol 
ID4599518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp694654 
End bp695829 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content74% 
IMG OID639775254 
Productfumarylacetoacetate hydrolase 
Protein accessionYP_921868 
Protein GI119714903 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID[TIGR01266] fumarylacetoacetase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGGACCT GGGTCCCCGG CGCGGCCGGC TCCGGGTTCG ACGTCGACCA CCTGCCGTAC 
GGCGTGTTCT CCCGCGCCGG GGAGGAGCCG CGGGTCGGCG TCCGCATCGG CGACCAGGTG
CTCGACCTGG CACCGGTCGC CGCCGCCGAG ATGCTCGACG TCCACCGCGT CCTCGAGGAG
CGCTCGCTGA ACGCGCTGAT GGCGGAAGGG CGCCGGGTCC GCGAGTCGGT GCGCGCCTGG
GTCACCGGGC TGCTCAGCGA CGAGGCCGAG CGCGACCTGG TCGAGCCGCA CCTGGTCCCG
CTCGCAGAGG TCGCCCTGCA CCTGCCGGTC GAGGTGGCCG ACTACGTCGA CTTCTACGCG
AGCGAGCACC ACGCCTCCAA CGTCGGCCGG ATCTTCCGCC CCGACGCCGA GCCCTTGCTG
CCGAACTGGA AGCACCTGCC GGTCGGCTAC CACGGTCGCG CCGGCACGGT GGTCGCGTCC
GGCACCCCCG TCGTCCGCCC GTGCGGGCAG CGGCGCGGCG ACGCAGGCCC GACGTACGGC
CCCTCGACAC GGCTCGACAT CGAGGCCGAG CTCGGCTTCG TGGTCGGCAC GCCGTCCGAG
CTCGGGACTC CGGTGCGGTA CGACGCGTTC GCCGACCACG TGTTCGGCGT GGTCGGCCTC
AACGACTGGT CGGCGCGCGA CATCCAGGCC TGGGAGTACG TACCCCTGGG GCCGTTCCTC
GGCAAGTCCT TCGCGACCTC GGTGTCCCAG TGGGTGACGC CGCTCGAGGC GCTCGACGCG
GCGTGGGTCG ACCTGCCCGG CCAGGACCCG GAGCCGTTGC CCTACCTGGG CCCCGGCGCC
ACCCGGGGGC TCGACATCGA CGTCGAGGTG GTCGTCAACG GCGAGGTCGT CAGCCGGCCG
CCCTACCGCA CGATGTACTG GTCGCCGGCG CAGCTGCTCG CCCATCTCAC GGTGAACGGA
GCGAGCCTGC GCACCGGCGA CCTCTACGCG TCGGGCACCA TCAGCGGCCC GGAGCCGGAC
CAGCGCGGCT CACTGCTGGA GCTGGGCTGG GGCGCCGACG ACGCGTTCCT CGACGACGGC
GACGAGGTCG TGCTCCGCTA CGCCGCACCC GGCACGTCCG GCGGCCGGAT CACGCTCGGT
GAGGTGGCCG GGGTCATCGC GCCCGCACGC ACCTGA
 
Protein sequence
MRTWVPGAAG SGFDVDHLPY GVFSRAGEEP RVGVRIGDQV LDLAPVAAAE MLDVHRVLEE 
RSLNALMAEG RRVRESVRAW VTGLLSDEAE RDLVEPHLVP LAEVALHLPV EVADYVDFYA
SEHHASNVGR IFRPDAEPLL PNWKHLPVGY HGRAGTVVAS GTPVVRPCGQ RRGDAGPTYG
PSTRLDIEAE LGFVVGTPSE LGTPVRYDAF ADHVFGVVGL NDWSARDIQA WEYVPLGPFL
GKSFATSVSQ WVTPLEALDA AWVDLPGQDP EPLPYLGPGA TRGLDIDVEV VVNGEVVSRP
PYRTMYWSPA QLLAHLTVNG ASLRTGDLYA SGTISGPEPD QRGSLLELGW GADDAFLDDG
DEVVLRYAAP GTSGGRITLG EVAGVIAPAR T