Gene Noca_4067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4067 
Symbol 
ID4596581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4293643 
End bp4295049 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content76% 
IMG OID639778673 
ProductMmgE/PrpD family protein 
Protein accessionYP_925251 
Protein GI119718286 
COG category[R] General function prediction only 
COG ID[COG2079] Uncharacterized protein involved in propionate catabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.530176 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGCTC GAACCCTCGC CCGCTGGATC ACCTCGGAGC TCGCCCTACC CGGAGAGGTC 
GAGGAGGCGG CCCTGCGCCA CCTCCTCGAC GGGCTGGGGA CCGCGGTGGC CGCAGCCCGG
ACCGGCGCGG CCGACCCCGC GGTCGCCGTG GCCACGAGGC TCGCCGGACC TCCCGAGGCG
ACGATCCTCG GGGGCACCAC CCAGGTGAGT GCTCCCGCCG CCGCGCTGGC GAACGGGACC
CTCGTGCACG CCCTGGACTT CGACGACACC CACGCGGGCG GTCTGGTGCA CGCGACGGCC
GTGGTGCTGC CCGCCGCGCT CGCGGTCGGC GAAGAGGTCG GTGCCAGCGG GCGCCGGGTC
CTCGACGCCG CGGTCGTCGG CTACGAGGTG GCCTGCCGGG TCGCCGCGGC CGCACCCCAC
GGCTTCCACG CCCGAGGACT GCACGCCACC ATGGTGGCCG GCGTCTTCTC ATCCGCGGCG
ACGGCTGCCC GTCTCTACCG TCTCGACGCG GACACCACGA CCCAGGCACT CGGCATCGCC
GGCAGCCAGG CCGGCGGGCT CCTGGCCTTC CTCGGCACCG GCGCCAGCAC CAAGCAGCTG
CACCCGGGCT TCGCCTCCCA GGCCGGGATC CTCGCCGCGC GGCTGGCGGC GGCCGGCGCG
ACCGGACCCG AGACCGTGTT CGACGGCCCG CACGGCATCT ACGACGCACT GGCCACCGGT
GACGTCGATC GCTCCGTCAT CCTCGGTGGC CTGGGCCGGA CCTGGGAGAC CACCCGGATC
GGCATCAAGC CCTGGCCGGC CTGCCAGCTC TCGCACGCGA CCATGGCGGC CGCACGCGAT
GCCCTGTCGC GGGCCGGGGT GTCCGCGGAC GCCGTCGTCT CGGTGCGTGC GCGGGTTCAC
CCGGACTCCG CCGCGGTCGT CTGCGCCGAG GACCGCGACC TGGCCCACCC GGCCAGCCCG
TACGCCGCGA AGTTCTCCCT CCCCTGGACG GTGGCGGCCA TGCTGCTCGA CGGCCGGGTC
GACGTCACCA CCTACGCGCC GGGCCAGCTC GGCCGCCGCG ACGTCTCGGA ACTGGCGGCG
CGGGTCGCGT GGGACGTGGT CGACCCCGGC GCCGTCGCGG CGGACAGTCC CGGCGACGTG
GTCCTGACCC TGGTGGACGG TCGCGAGGTC GCCGGGCACG TCGACCGGAG CCCCGGAGGG
GGGTCGGCAC CGCTCGCCGA CACCGACCTG ATGGCCAAGC TGACCGGCAA CGTCGGGCCG
CTCGCGTCCC GGCTCACGGC CGCCGTGCGC CGGCTCCCCA CTGCGACCGA CCTGGCCGCC
GTTCTGCACC TGGTCGCGGC GGCGTCCGCA CCCGAGGGCG TCCGCGGTGC CGAACCGTCC
CGACCGATCG AACCTCCGGA GGCCTGA
 
Protein sequence
MTARTLARWI TSELALPGEV EEAALRHLLD GLGTAVAAAR TGAADPAVAV ATRLAGPPEA 
TILGGTTQVS APAAALANGT LVHALDFDDT HAGGLVHATA VVLPAALAVG EEVGASGRRV
LDAAVVGYEV ACRVAAAAPH GFHARGLHAT MVAGVFSSAA TAARLYRLDA DTTTQALGIA
GSQAGGLLAF LGTGASTKQL HPGFASQAGI LAARLAAAGA TGPETVFDGP HGIYDALATG
DVDRSVILGG LGRTWETTRI GIKPWPACQL SHATMAAARD ALSRAGVSAD AVVSVRARVH
PDSAAVVCAE DRDLAHPASP YAAKFSLPWT VAAMLLDGRV DVTTYAPGQL GRRDVSELAA
RVAWDVVDPG AVAADSPGDV VLTLVDGREV AGHVDRSPGG GSAPLADTDL MAKLTGNVGP
LASRLTAAVR RLPTATDLAA VLHLVAAASA PEGVRGAEPS RPIEPPEA