Gene Noca_0657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0657 
Symbol 
ID4599520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp696778 
End bp697992 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content71% 
IMG OID639775256 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_921870 
Protein GI119714905 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCACT ACCGACGGAT CGGTGCGGTC CCGCCGAAGC GGCACACGCA GGCCCGCGAC 
CCCGACGGCC GGCTCTACTA CGAGGAGCTG ATGGGCGAGG AGGGATTCTC CTCGGACTCG
TCGCTGCTCT ACCACCGCGG CGTGCCCTCC GCCATCGTCG CCGCGGAGCC GTGGGAGCTC
CCGGACCAGA GCCGCACGCC CAACCACCCG CTCAAGCCGC GGCACCTGCG GCTCCACGAC
CTCGAGACCG GCGGCGACGC CGTGACCAGC CGTCGGCTCG TGCTCGCCAA CGCCGACGTG
CGGATCTCCT ACGTGGTCGC CGGGTCCGAG CCCTCCGCCT ACTACCGCAA CGCGATCGGC
GACGAGTGCG TGTACGTCGA GTCCGGGGCC GGCGTGGTCG AGACCGTGTT CGGCGTGGTG
GGCTACCGCG CCGGCGACTA CGTCGTGATC CCGCGCGCGA CCACGCACCG GTGGGTGCCG
GCCCCTGGGT CCGAGGACCC GAGCCGGCTC TACGCGATCG AGGCGAACAG CCACATCGCC
CCGCCCAAGC GCTACCTGTC CCGCTACGGC CAGCTGCTCG AGCACGCGCC GTACTGCGAG
CGGGACCTGT ACGGCCCGAC CCAGCCGTTC ACGGCCGACG GCGGCGACGT CGACGTCCTC
GTCAAGCACC GGACCGGCGG CGGGATCGTC GGGACCCGGA TGACCTACGC GACGCACCCC
TTCGACGTGG TCGGTTGGGA CGGCTGCCTG TACCCCTACA CGCTCAACAT CGAGGACTAC
ATGCCGATCA CCGGCAAGGT GCACCAGCCG CCACCGGTGC ACCAGGTCTT CGAGGGGCAC
AACTTCGTGG TCTGCAACTT CCTGCCGCGC AAGGTCGACT ACCACCCGCT CGCGATCCCG
GTGCCGTACT ACCACTCCAA CGTCGACAGC GACGAGGTGA TGTTCTACGT CGGCGGCGAC
TACGAGGCGC GCAAGGGCTC CGGCATCCGC ATCGGGTCGA TCTCGCTGCA CCCCGGCGGA
CACGCCCACG GGCCGCAGCC CTCGGCGATC GAGGCCTCGC TCGGGGTGGA GTACTTCGAG
GAGTCGGCGG TCATGGTCGA CACCTTCGCC CCCCTCGACC TCGGCGAGGC GGGCCTCGCG
GTCGAGGACC CGGCGTACGC GTGGAGCTGG GCCGGGCGGG GGCCCGAGGA CCCGCCGGTC
TTCTCCAACT CGTGA
 
Protein sequence
MAHYRRIGAV PPKRHTQARD PDGRLYYEEL MGEEGFSSDS SLLYHRGVPS AIVAAEPWEL 
PDQSRTPNHP LKPRHLRLHD LETGGDAVTS RRLVLANADV RISYVVAGSE PSAYYRNAIG
DECVYVESGA GVVETVFGVV GYRAGDYVVI PRATTHRWVP APGSEDPSRL YAIEANSHIA
PPKRYLSRYG QLLEHAPYCE RDLYGPTQPF TADGGDVDVL VKHRTGGGIV GTRMTYATHP
FDVVGWDGCL YPYTLNIEDY MPITGKVHQP PPVHQVFEGH NFVVCNFLPR KVDYHPLAIP
VPYYHSNVDS DEVMFYVGGD YEARKGSGIR IGSISLHPGG HAHGPQPSAI EASLGVEYFE
ESAVMVDTFA PLDLGEAGLA VEDPAYAWSW AGRGPEDPPV FSNS