Gene Noca_3749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3749 
Symbol 
ID4598611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3969313 
End bp3970293 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content73% 
IMG OID639778357 
Productaldo/keto reductase 
Protein accessionYP_924936 
Protein GI119717971 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCAGC GCACCCTCGG CCGGACCGGG CGTCCGGTCT CCGTGGTCGG ACTCGGCACC 
TGGCAGCTCG GCGCGGACTG GGGGGACGTC TCCGAGGACG ACGCCCTCGC CGTCCTCGGC
GCCTCGGTCG ACGCCGGTGT CACGTTCTTC GACACCGCCG ACGTGTACGG CGACGGGCGC
AGCGAACAGG TGATCGGCCG GTTCCTGCGC GAGCACCCCG AGGTCGTCGT GGCCACCAAG
ATGGGTCGTC GCGTCGAGCA GCTGCCCGAG CACTACACGC TCGAGAGCTT CCGGGCCTGG
ACCGACCGGT CGCGACGCAA CCTCGGCGTC GACACCCTCG ACCTCGTGCA GCTGCACTGC
CCCCCGAGCG CGGTCATCGA CGCGGACGCG ACGTACGACG CGCTCGACAC GCTGGTCGCC
GACGGCGCGA TCGCGGCGTA CGGCGTGAGC GTCGAGACCG TCGACCAGGC ATTGTCCGCC
ATCGCGCGCC CGCACGTCGC GTCGATCCAG ATCATCCTCA ACGCGTTCCG CCTCAAGCCG
TTGGACCGGG TGCTGCCGGC GGCGGCGGAG GCCGGGGTCG CGATCATCGC CCGGGTGCCG
CTCGCGTCCG GCCTGTTGTC GGGTCGCTAC GACGAGCACA CGACGTTCGC CCCGGACGAC
CACCGCAGCT ACAACCGCGA CGGCAGCGCC TTCGACGTGG GGGAGACGTT CTCGGGCGTC
GACTACGAGA CCGGCGTCCG CGCGGCGCAG GAGTTCTCGC AGCTGGTGCG TGACCTGGAC
CTGACGCCCG CGCAGGCGGC GATCGCGTGG GTGGTGCAGC AGCCGGGCGT CACCACGGTG
ATCCCGGGCG CCCGCAACGC CGAGCAGGCG CGCGCCAACG CGGTCGCCGG ACTGGCCGGG
CCGCTGCCCG GCTCCGTCCT GGACGGGGTC ACGCGGATCT ACGACACGAG GCTCCGCGCG
GCGATCCACG ACCGCTGGTA G
 
Protein sequence
MEQRTLGRTG RPVSVVGLGT WQLGADWGDV SEDDALAVLG ASVDAGVTFF DTADVYGDGR 
SEQVIGRFLR EHPEVVVATK MGRRVEQLPE HYTLESFRAW TDRSRRNLGV DTLDLVQLHC
PPSAVIDADA TYDALDTLVA DGAIAAYGVS VETVDQALSA IARPHVASIQ IILNAFRLKP
LDRVLPAAAE AGVAIIARVP LASGLLSGRY DEHTTFAPDD HRSYNRDGSA FDVGETFSGV
DYETGVRAAQ EFSQLVRDLD LTPAQAAIAW VVQQPGVTTV IPGARNAEQA RANAVAGLAG
PLPGSVLDGV TRIYDTRLRA AIHDRW