Gene Noca_0401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0401 
Symbol 
ID4597713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp430624 
End bp432582 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content72% 
IMG OID639775016 
Productshort chain dehydrogenase 
Protein accessionYP_921631 
Protein GI119714666 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only
[S] Function unknown 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases)
[COG3347] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCATTCG CGATGTCCCC GGCGGCGTCG GCCGCTCTGA CCGATCTCGT CAGGGTGTCC 
CGGCTCCTGG GTGCCGACCC CGCGCTCGTC CTTCACGGCG GCGGGAACAC GAGCCTGAAG
GCCACGACCA CCGACCTGAC CGGGTCGACC GTGGACACCC TCTTCGTCAA GGCGAGCGGA
CACGACCTCG CCTCGATCGA CGCGGCCGGC TTCGCACCGC TCCGGCTGGC ACGCCTGCGC
GAGATCCTGC CGCCCGTCGT CGTTCCCGAC GACCTGCTCA TCAACGAGCT GCGCTGTGCG
CTGCTCGACG CGGCGGCACC GGATCCCTCG GTGGAGACCC TGCTGCACGC GCTGCTGCCC
CGACCGGCCG TCATCCACTC ACACGCCGAC GCGTTGCTCG CGGTCGCCAA CACACCCGAT
GGCCGCGAGC GGCTCGCCGA CCTCTTCGGG GATCGCGTGA TCCTGGTGGA CTACGCCATG
CCGGGACCAG ACCTGGTGGC CGCCTGCGCG GCCGCCTGGA GCCGGGAGGG CGGCAACCAC
ACCCAGGGCG TCGTCGTGCT GGGGCACGGC CTGTTCACCG TCGGCTCCAC GCCGGACGAG
GCGTACCAGC ACCACGTCGA TCTGGTCACG GACGTGGCCG GCCTCCTGCG CGCATCCTCG
ACGCGCGGCC AGTCGCAGCC GGCTGCGCCC CTGCCGCCTG CAGACCCGCT GGCCGTGGCC
CGGCTGCGGC AGTCCCTGAG CCGGGCCGCC GGGCACCCGA TGGTGGTCAC CTCCCACCGC
GACAACGAGG TTGCGTCCTT CGTCGGGGAC GACCGCCTGC TCGCGGCCGC TCAGCGCGGG
CCGCTCACCC CCGACCACGT CATCTGGACC AAGCACAGCC CGATGATCGG CACCGATCTG
GACGCCTTCG TCCGGGCCGA GGACGACTAC TTCACCCGGC ACGCCACGCG CCGCGGCCGC
ACCCTGCTGC GGCTCGATGG GGCGCCCCGG GTGGTGCTCG ATCGCACCCT CGGCATGCTC
ACCGCGGGAC GAACGGCAGG CGAGGCGGCG CGAACCGCGG ACATCTACCG GCACACCCTC
GGCGCGATCG AGCTGGCCGA GGCCCTCGGC GGCTACCGGC CGGTCGAGGA GGAGCACGTC
TTCGACCTCG AGTACTGGTC GCTGCAGCAG GCGAAGCTCT CCCGCAAGGA CAGCCATCGC
CCGCTCTCCG GCCAGGTGGC GGTCGTGACC GGCGCGGCCT CGGGCATCGG GCACGCGTGC
GCGGTCAGCC TGCTGGAGTC GGGCGCCAGC GTGGTGGGCT GGGACGTCAG CCCATCGGTA
CGCGACGCGA TCTCCTCACC GGAGTGGCTC GGGCTCGCCG TCGACGTCAC CGACTCCGCT
GCGGTGACCG AGGCCATCCA GCGCGGGGTG GAGGCCTTCG GTGGGGTGGA CATCCTCGTC
GTCGCGGCGG GCATCTTCCC CACCAGCGCC AACCTCGGCG AGATGTCGAT GCAGGCCTGG
CGCCGGGCGA TGGCGGTGAA CGTCGACGCG GTCGCGGACC TCTACGGACA GATCCACCCG
TTCCTCGCCG CAGGCACCCC TTACGGCCGC GTCGTGCTGA TCGCGTCGAA GAACGTCGCC
GCCCCGGGCC CTGGCGCCGC GGCCTACTCC GCGTCCAAGG CAGCGGTGAC CCAGCTGACC
CGGGTCGCGG CCCTCGAGTG GGCCCCCCAT GGCATCCGCG TCAACATGAT CCACCCCGAC
GCCGTGTTCG ACACGGGCCT GTGGACACCG GAGCTGCTCG AGGCGCGAGC CGAGCACTAC
GGCATGACGG TCGAGCAGTA CAAGCGTCGC AATCTCCTCT CCACGGAGGT CACGAGCAAG
GCGGTCGGGG ACCTGGCCGT CACCCTGGCG GGTCCCGCCT TCGCCGCCAC GACCGGCGCC
CAGATTCCCA TCGACGGCGG CAACGAGCGG GTCATCTGA
 
Protein sequence
MPFAMSPAAS AALTDLVRVS RLLGADPALV LHGGGNTSLK ATTTDLTGST VDTLFVKASG 
HDLASIDAAG FAPLRLARLR EILPPVVVPD DLLINELRCA LLDAAAPDPS VETLLHALLP
RPAVIHSHAD ALLAVANTPD GRERLADLFG DRVILVDYAM PGPDLVAACA AAWSREGGNH
TQGVVVLGHG LFTVGSTPDE AYQHHVDLVT DVAGLLRASS TRGQSQPAAP LPPADPLAVA
RLRQSLSRAA GHPMVVTSHR DNEVASFVGD DRLLAAAQRG PLTPDHVIWT KHSPMIGTDL
DAFVRAEDDY FTRHATRRGR TLLRLDGAPR VVLDRTLGML TAGRTAGEAA RTADIYRHTL
GAIELAEALG GYRPVEEEHV FDLEYWSLQQ AKLSRKDSHR PLSGQVAVVT GAASGIGHAC
AVSLLESGAS VVGWDVSPSV RDAISSPEWL GLAVDVTDSA AVTEAIQRGV EAFGGVDILV
VAAGIFPTSA NLGEMSMQAW RRAMAVNVDA VADLYGQIHP FLAAGTPYGR VVLIASKNVA
APGPGAAAYS ASKAAVTQLT RVAALEWAPH GIRVNMIHPD AVFDTGLWTP ELLEARAEHY
GMTVEQYKRR NLLSTEVTSK AVGDLAVTLA GPAFAATTGA QIPIDGGNER VI