Gene Noca_1011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1011 
Symbol 
ID4599672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1063015 
End bp1064223 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content75% 
IMG OID639775610 
Product4-carboxymuconolactone decarboxylase 
Protein accessionYP_922217 
Protein GI119715252 
COG category[R] General function prediction only
[S] Function unknown 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)
[COG0599] Uncharacterized homolog of gamma-carboxymuconolactone decarboxylase subunit 
TIGRFAM ID[TIGR02425] 4-carboxymuconolactone decarboxylase
[TIGR02427] 3-oxoadipate enol-lactonase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.101506 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCCCA CCGTCACCGC GGTCCGGATG ACCCTGACCC AGCACCGTTC GGACCTGCCG 
CTGCTCGTGC TCGGCCCGTC GCTCGGCACC ACGGCCACCA CGCTGTGGAG CGCCTGTGCG
GGCCACCTCG CCGACGCCTT CGACATCCTC GCCTGGGACC TGCCGGGCCA CGGCCACAAC
CGGGCCGCCG CAGAGGCGTT CACGATGGCC GGGCTCGCCG CAGGCGTGCT GCACGTCGTC
GACGAGGTGC TCGCCGAGCG AGGCGAGCCG GGCGGTTCGT TCCACTACGC CGGCGACTCG
GTCGGCGGCG CCGTCGGGCT GCAGCTGCTC CTCGACGCCC CGGAGCGGGT CACCGCGGCG
GTGCTGCTGT GCACCGGCCC GCAGATCGGC ACCGCCGAGT CGTGGACCGA GCGGATCGAG
AAGGTGCGGT CCTCGGGCAC CTCGAGCCTG GTGTCGGCCT CCGCCGAGCG CTGGTTCGCG
CCGGGGTTCC TCGAGCGCGA CCCCGAGTGC GGGTCGGCGC TGCTGCACGC ACTCCAGGAC
GCCGACGACA AGGGCTACGC CCAGGTGTGC GGTGCGCTGG CCGACTTCGA CGTGCGCGAC
CGGCTCGGCG AGATCGCGGC TCCGGTGCTC GCCGTGGCGG GTGCGGACGA CGTCGTCTGC
CCGCCGGAGC TGCTCCAGTC GGTCGCCGAG GGGGTGCAGC GCGGCCGGGC GGTCACGCTG
GGAGGGGTCG CCCATCTCGC GCCGGCCGAG GCGCCCGACG AGGTGGCCCG GCTGCTGCGC
CGGCACCTGC TCGGCGAGCA GCCCGTCGAC GCCCGGCCGG TGCCGAGCGA CGAGCGGTAC
GCCGCCGGGC TCGCCGTACG CCGCGAGGTG CTCGGCGACG AGCACGTCGA CCGGGCCACC
GCGGCCGTCA CCGACCTGAC CGGGGACTTC CAGGAGCTGA TCACCCGCTA CGCCTGGGGC
GAGATCTGGA CCCGGCCCGG CCTGGACCGG CGCAGCCGCT CGATGATCAC CCTCACCGCG
CTGGTAGCCC GCGGCCACCA CGAGGAGCTC GCGCTGCACC TGCGCGCCGC CCTGCGCAAC
GGCCTCAGTG TCGCCGAGAT CAAGGAGGTC CTGCTCCAGA CCGCGGTCTA CTGCGGCGTC
CCCGACGCCA ACACCGCCTT CCGGATCGCC CAGGAGGTGC TGGGGGAGGA CGGCGCCGGC
TCGCTGTAA
 
Protein sequence
MNPTVTAVRM TLTQHRSDLP LLVLGPSLGT TATTLWSACA GHLADAFDIL AWDLPGHGHN 
RAAAEAFTMA GLAAGVLHVV DEVLAERGEP GGSFHYAGDS VGGAVGLQLL LDAPERVTAA
VLLCTGPQIG TAESWTERIE KVRSSGTSSL VSASAERWFA PGFLERDPEC GSALLHALQD
ADDKGYAQVC GALADFDVRD RLGEIAAPVL AVAGADDVVC PPELLQSVAE GVQRGRAVTL
GGVAHLAPAE APDEVARLLR RHLLGEQPVD ARPVPSDERY AAGLAVRREV LGDEHVDRAT
AAVTDLTGDF QELITRYAWG EIWTRPGLDR RSRSMITLTA LVARGHHEEL ALHLRAALRN
GLSVAEIKEV LLQTAVYCGV PDANTAFRIA QEVLGEDGAG SL