Gene Noca_4149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4149 
Symbol 
ID4596663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4383460 
End bp4384989 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content73% 
IMG OID639778755 
Productpurine catabolism PurC domain-containing protein 
Protein accessionYP_925333 
Protein GI119718368 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism
[T] Signal transduction mechanisms 
COG ID[COG2508] Regulator of polyketide synthase expression 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGCCGA CCGTTCAGGA CCTGCTCGAC CTCCCGGTGC TGCGGCGCGC CCGGCCCGAG 
GTGGCCGTCG GCAGCCGCCT GGACGAACGG GAGGTGCGCT GGGTGCACAC CTCCGAGATC
TATGAGATCT CACCGTTGCT CAAGGGCGGC GAGGTGCTCT TGACCACCGG GCTCGGCCTG
GTGGGCGTCG GGGCGGGAGC GGTGGCGGCG TACGTCGAGG CGCTGGCTCG CCAAGGCGTC
GCGGCGTTGG TGCTCGAGGT CGGCCGGACG TTCACCCACC CGCCCGAGGC GCTGGTGGCA
GCGGCCCGGG CGCATGACCT GCCTCTCGTG CTGCTCCACG GAGTGGTCCC GTTCATCGAT
GTCACGGAGA CGGTCTACCC GATGCTGATC GGCGGTGAGG TCGAGCAGCT GCGCGAGCTC
GAGCGGGCCT CGACCAGGTT GCACGAGGCG CTCACCACGG GGGCTGGGCC GGGTGAGCTG
CTGGCGCTGG TCGGTGACAT CTGCGGCTCG CCGGCCGGGC TGTACACCTC GGCCGGTGAC
CTGCTGGGCG GGGAAGACGT GCGCTCCCGG GAGGGTGGGA CGCTCGAGTT CGACGTGGGT
CGGAGTCCTT GGGCGGTGCT CGCCCTGCCA GCGTCGCAAC GCTCGACCCA CACCGGCGTG
AGGCGGTTGG CCGAGTTGTG CGCGACCATG ATCGACATCC GCCTGGGCGC CATGTTCCGA
GCCGGCCACC GCCCCGGGGC CGACGCCGAC CTGGTCCGCA TGCTGGCGGC GGGACAGTAC
CTCTCCAGCG CGGACATCGA GGTGCAATCA CGCGCCGCCG GACTCGTCGT GCGCCCGGGT
TGGCGAGCCG TCGGCCTCGC GGTGGACCTG CGCCTGCCAA GCTCGCTGCG CCCCGGTCTC
AACGCCACCA TCGAGGCCGC GCGGTCGGTG TTCGGAGTGT CCGCGGTGGC CGAGCTGGAC
CGCGAGTTCG TCGTGGCGAC CACGGTACGG CCAGCGGAGC TGCGCTCGCG CCTGTCCTCG
TTCGTCGACG CCTTGGAGCG CGAGCTGCAC GCCGCCGTGG GCACAGCGGC GATCCGGGTC
TCGGCGGGCC ATCCGGTTGG AGACGCCGCG GGTTTGGCTC GATCGCTGCC GGCCGCCCTC
GACGCGCTCC ACCTGGCCCG CCGGCTCGGC CTGGGGTCAC GGACGGTGCT CGCCAGCGAC
CTGGGGGTCT TCCACCTGCT CTCGAGCGCC ACCGCGGACG TCGAGCTCGA GCGGTTCGTC
CAGGAGCAAC TCGGCGCGTT GCTCGAGCAC GATGCGCGGC ACGGCTCGGA CCTCGTCCAG
ACCCTGGACG CATACCTGGA GGCGGGCCTG GGCAAGACGG CGGCGGCCCA GGCACTCGGG
ATCCGCAGGC AGACCCTGTA TGCACGCTTG GAGCGGATCA GCAGGCTGCT CGGCGGCCTG
GACATCGAGG CGCGTCAGGC GCGCACAGCG CTCGACCTCG CACTGGTGAG CTGGCGCCTG
AGGACCTCGG CCGTCACCGG CCGGCCCTGA
 
Protein sequence
MAPTVQDLLD LPVLRRARPE VAVGSRLDER EVRWVHTSEI YEISPLLKGG EVLLTTGLGL 
VGVGAGAVAA YVEALARQGV AALVLEVGRT FTHPPEALVA AARAHDLPLV LLHGVVPFID
VTETVYPMLI GGEVEQLREL ERASTRLHEA LTTGAGPGEL LALVGDICGS PAGLYTSAGD
LLGGEDVRSR EGGTLEFDVG RSPWAVLALP ASQRSTHTGV RRLAELCATM IDIRLGAMFR
AGHRPGADAD LVRMLAAGQY LSSADIEVQS RAAGLVVRPG WRAVGLAVDL RLPSSLRPGL
NATIEAARSV FGVSAVAELD REFVVATTVR PAELRSRLSS FVDALERELH AAVGTAAIRV
SAGHPVGDAA GLARSLPAAL DALHLARRLG LGSRTVLASD LGVFHLLSSA TADVELERFV
QEQLGALLEH DARHGSDLVQ TLDAYLEAGL GKTAAAQALG IRRQTLYARL ERISRLLGGL
DIEARQARTA LDLALVSWRL RTSAVTGRP