Gene Noca_0637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0637 
Symbol 
ID4596054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp674153 
End bp675613 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content64% 
IMG OID639775236 
Productpurine catabolism PurC domain-containing protein 
Protein accessionYP_921850 
Protein GI119714885 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism
[T] Signal transduction mechanisms 
COG ID[COG2508] Regulator of polyketide synthase expression 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACGAA CCATCACATG GGCGCACACC TGTGAGGTGG CCGACCCCTG GAACTGGCTG 
GGAAACGGTG ATCTCCTGAT GACCGACGGC TACAGCTTCC CGTCGGACGC CGCCGCCCAG
GTCAACTTCA TCAGGCAGCT CGCGGACGCA AGCATTGTTG GCCTGGCTCT CGGCGAAGGG
TTCGCTGCCC CCCCACTAAC CGCCGAGGCC ATCGCCGCAG CGGACTCGAT CGACTTTCCC
ATCCTGATGA CGGCGCGCAA CGTCCCGTTC GTCACGATCG CCCGCTTGGT GGCGGAGGCC
AGCAGCGGAC GTGCACATTC GAGGGCAGCC AAGGTCCTGC GCCTCTATGA CGTCCTGCGT
CGAACCCATC AGGGAGCCAT GGTGGACGAC CTACTGGAGC GCCTCGGCGG CGAGCTTCAC
GCCAGCCTGC ATGTGATCGA ACTTCGGACG GGCAGCAGCC TCCTACCTGC TCCGACCGGG
CTACCGCCAG AGGTGCGCGA AGCGGCCCTC AAGAGGGCAC AGTCGGAGAA GGGACGCCTC
CCGGCCTTCA ACCGCCTGGC TGACGACCAC GTTTCAGCGC TTCTCGTACC AGTAGGAACG
CGGGGAAGTG CCGGCCTCGT GGTGCGCGCC TGGCCCGAAG GTGAGGTCCC CGACCTACTC
CTCACGCAGC ACGCAGCAAT GATCGTGGAA CTTGAGGTCG AGCGACGTGC AGCTCAGGCA
TCGCGAATCA GAGCGCGAGG AGCGGATCTC ACTCGCCGGA TGCTCGACGG CACGATCACA
CCCGAGGCAG CGGCGAGTCA GATCCGACTT CATCGGCTCG GCGACGGTCC TTGGCGGGTA
ACCCTGTGGC AGGAGGAAAA CGATGATCAC TCGACACACC CTCGAGCGAT CGGGCTTGCG
GAAGGGCTGG AGTACGTCCA GTGGCCCCAT CTGCACTTGC CGGTGGGAGA ACTCCACCTG
ATCGTTGTCG ACGACGATCG GTTCCTAGCG GGGCTTGAGC TTGACTTCGT TGACGCCACG
GTGGGGGCTA GTCAACCCGT CGCGTCGCTC ACCCGCCTCT CGGATGCGTT CCGGGAGGCT
CAATGGGCAC TTGAGAGCGC TCGCGCAGCC TCTGTGCAGA GCGCGATTTA TGGATCCCAC
GGTTCCTACT TCATGCCGAA TACCGTGGCA GAAGGCGAAG CGGCGGTTCA GCGCCTCCTC
GGCCCGATCA TCAAATACGA CGAGGAACAC GGAGCCAATC TGCTCGGCTC GCTTCAGGTC
TACTTCGAGG TCAATCGATC ATGGCAGGAG GGGGCACGCC GACTTGGCAT CCACAAGCAG
ACTCTGGTCT ATCGACTCAA GAAGATCGAA GAGATGACCG GCGCCGACCT CCGAGACTTT
GGCGTCCAGG CCGAGTTGTA CCTTGCGCTG CGCACCCGCC AACTTCTCAG CACTACCGGG
CGGGGAGCAC TCGGCGGATA G
 
Protein sequence
MQRTITWAHT CEVADPWNWL GNGDLLMTDG YSFPSDAAAQ VNFIRQLADA SIVGLALGEG 
FAAPPLTAEA IAAADSIDFP ILMTARNVPF VTIARLVAEA SSGRAHSRAA KVLRLYDVLR
RTHQGAMVDD LLERLGGELH ASLHVIELRT GSSLLPAPTG LPPEVREAAL KRAQSEKGRL
PAFNRLADDH VSALLVPVGT RGSAGLVVRA WPEGEVPDLL LTQHAAMIVE LEVERRAAQA
SRIRARGADL TRRMLDGTIT PEAAASQIRL HRLGDGPWRV TLWQEENDDH STHPRAIGLA
EGLEYVQWPH LHLPVGELHL IVVDDDRFLA GLELDFVDAT VGASQPVASL TRLSDAFREA
QWALESARAA SVQSAIYGSH GSYFMPNTVA EGEAAVQRLL GPIIKYDEEH GANLLGSLQV
YFEVNRSWQE GARRLGIHKQ TLVYRLKKIE EMTGADLRDF GVQAELYLAL RTRQLLSTTG
RGALGG