Gene Noca_1688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1688 
Symbol 
ID4599727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1794738 
End bp1796213 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content77% 
IMG OID639776287 
ProductGntR family transcriptional regulator 
Protein accessionYP_922888 
Protein GI119715923 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.82306 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCTGC CGGTCGACCT GGACGCGCGC GCCGACCGGG CGAGCGCGAT CTATCGCGCG 
CTGCTCGAGG CGATCCGCGC CGGCCGGGTC GGTGCCGGTG ACCGGCTGCC GCCGACCCGC
ACCCTGGCCC GCGACCTGGG GGTCGCGCGC AACACGGTTG CCACCGCGTA CGAGCGGCTC
GCCGCCGAGG GCCTCCTCGA CGCCCGCGTG GGCGCGGGCA CCTACGTCAC CGACCTGGCC
GCGCCGGTGC CGGCGCCGCG CCGCCCCGGC TCGCTGCACC CACGGGCCGG CTGGTCGTTC
CGGCCGCTGC CGGTGAGCGG CGAGCAGCCG GCACCGCCGT ACGACTTCCG GGTCGGCATC
CCCGACGCGT CGCTGTTCCC GTTCGACACC TGGCGTCGAC TGGTGGCCGC GGAGCTCCGC
GCCGGGGCGC ACCGCCCGGG CACCTACGCC CACCCAGCGG GCCTGCCGCA GCTGCGGGCC
GCCATCGTCC GCTACCTCGC CCTGGCCCGC GGCGTCGCAG CCGAGGCCGA CGACGTCGTG
GTGACCCACG GCACCCAGCA GGCCCTCGAC CTGGTCGCCC GGGTGCTGCT CGAGCCAGGC
GACGTCGTCG CGGTCGAGGA CCCCGGCTAC CCGTTCGCGC GCGAGCTGTT CGCGTCGCAC
GGCGCCCGCG TGGTGCCCGT CCCGGTCGAC GCGGAGGGCC TGGTCGTCGA GCGGGTCCCG
GAGCGGGCCC GGCTGGTGTT CAGCACCCCC TCACACCAGT TCCCGCTCGG TCCGCCGCTC
TCGCTGGCCC GGCGCCAGGC GCTGCTCGAG CTCGCCAACC GACACCGGGT CGCGATCGTC
GAGGACGACT ACGACAGCGA GTTCCGGTTC ACCGATCGCC CGCTCGAGAC GCTGCACGCG
ATGGACCGGC ACGGCCGGGT CGTCTACGTC GGCACCTTCT CGAAGTCGCT GCTCCCGGCC
CTGCGGGCGG GCTACCTGGT CGCTCCCGAG CCGCTGCGCG AGGCGCTGCT CGGGGCCCGC
CAGCTGGCGG ACGGCCACGG CGGTCCGGCC GAGCAGGCCG CGCTCGCCCA CTTCGTGGCC
GACGGGCTCC TCGCCCGGCA CCTCAGGCGG GCTCGGGCGA CGTACGCCGA GCGTCGCGAG
CTGGTCCGGT CCGGGCTGGA GCGGCTGCTC GCGGACCGCC TCGAGGTGGT CCCGTCGGCA
GCCGGCCTGC ACGTCGCCGC CACGTTCCGC GACGCCGAGG TCGACGACGC GGCGGTCGCG
GAGGCGGCGC TGGCGGCCGG CGTCGCGGTC GAGCCGCTCT CGGCGTACGC CGTCGGGCCG
GACGTCCCGC CGGGCCTGGT GCTCGGCTAC GGCGCCGCAG GCACCGCCAC GATCAGGCCG
GGTCTGGAGC GGCTCGCCCG GCTCGTCGCG TCCACGCCAT CCAGGCCACC GCGGCCGCGC
CGACCAGGAG CTGCGGCAGC AGGTAGCGCC CGGTGA
 
Protein sequence
MDLPVDLDAR ADRASAIYRA LLEAIRAGRV GAGDRLPPTR TLARDLGVAR NTVATAYERL 
AAEGLLDARV GAGTYVTDLA APVPAPRRPG SLHPRAGWSF RPLPVSGEQP APPYDFRVGI
PDASLFPFDT WRRLVAAELR AGAHRPGTYA HPAGLPQLRA AIVRYLALAR GVAAEADDVV
VTHGTQQALD LVARVLLEPG DVVAVEDPGY PFARELFASH GARVVPVPVD AEGLVVERVP
ERARLVFSTP SHQFPLGPPL SLARRQALLE LANRHRVAIV EDDYDSEFRF TDRPLETLHA
MDRHGRVVYV GTFSKSLLPA LRAGYLVAPE PLREALLGAR QLADGHGGPA EQAALAHFVA
DGLLARHLRR ARATYAERRE LVRSGLERLL ADRLEVVPSA AGLHVAATFR DAEVDDAAVA
EAALAAGVAV EPLSAYAVGP DVPPGLVLGY GAAGTATIRP GLERLARLVA STPSRPPRPR
RPGAAAAGSA R