Gene Noca_3373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3373 
Symbol 
ID4598771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3573131 
End bp3574567 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content77% 
IMG OID639777980 
ProductGntR family transcriptional regulator 
Protein accessionYP_924561 
Protein GI119717596 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.576212 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCGCA CCGTGAGCGC CGGCCGGGTG GCCGCCCTGG TCGGCGACTT CGACCGCTCC 
CCCGCCTACG CCGGCCTCGC CGACGCCCTG GTCCTGCTGA TCGGCGACGG CCGGATCGCG
CTGGACACCC GGCTGCCGAG CGAGCGGGAG CTGACCGAGG CGCTCGGGCT CTCGCGCACC
ACCGTGACCC GGGCGTACAC CGCGCTGCGG GAGGCGGGGT ACGCCGAGGC GCGACGCGGC
TCGGGCACGT TCACGCGGGT GCCGGGCGGG CCGGCGCGCG CACACGACCG GGCCCTGCTG
CCGCGGCCCG GCGACGACGA GGCGATCGAC CTGAACTGCG CGGCGCCGTC CGCCCCGGCG
GGCCTGGCGA AGGCGTACGC CGAGGCCGCG GCCGAGCTGC CGGCGTACCT CGGCGGCCAC
GGCTACTTCC CCGCCGGGCT CCCCCAGCTC CAGCAGGCGA TCGCGGCGAC GTACGAGGCG
CGCGGCCTGC CGACCGCGCC CGACCAGATC ATGGTCACGC CCGGCGCGCT GTCGGCGGCG
TCGATCGTGG CGCAGGCGTT CACCTCGCCC GGCGACCGGG TGCTGGTGGA GTCGCCGGTG
TACCCGAACG CGATCGACGC GCTGCGGCAC GGCGGCGCCC GCCTCACGCC GGTGCCGGTC
GACCCCGAGG GCTGGGACCT CCCCGCCGTC GGCGCCGCGC TGCGGCAGAC GGCACCGCGG
CTGGCCTACC TGATCGCGGA CTTCCAGAAC CCCACCGGCC ACCTGATGAC CGAGGCGCAG
CGCGAGGAGT ACGCCGGCCA CCTGCGCCGG GCGCACACCA CCGCGATCGT CGACGAGGCC
CACCAGTGGC TCCCGCTCGA GGGCCAGGAC ATGCCGCGCC CGTTCGCGGC GTACGCCCCC
GACACGATCA CGATCGGCAG CGCCAGCAAG GGCTTCTGGG GCGGCCTGCG GCTGGGCTGG
ATGCGGGTCC CACCCGGCCG GATGGACCGG CTCACCCAGG CCCGGGTGAG CATGGACCTC
GGCGCCCCGG TGATGGAGCA ACTGGTGCTG GTCCGGCTGC TCGCCGAGGC CGACGAGGTG
CTCGCCGCGA ACCGGGCGCG GCTCCGTGCC CAGCGCGACG CGCTGGTCGC GGCGGTGCGT
GCGCAGCTCC CCGAGTGGAC GTTCCGCGTG CCGTCCGGCG GGTTGGCGCT GTGGTGCCGG
CTCCCCGCCG CCTCGGGCTC GGCGGTCGCG GCCGAGGCCG AACGCCTGGG CGTCATCATC
CCGCCCGGCC CGGTCTTCGC CGTCGAGGGC GGTCTCGACC GGTTCGTGCG GATCCCGTGG
ACGCGGCCGG CCGAGGACCT CGTCGACGGT GTCGCCCGGC TGGCCGAGGC CTGGGCCGTG
GTGCGCGAGC GGCCGGCGTC CGGGCCGGGT GTGTCCGGGC GGGTGATGGT GGCCTGA
 
Protein sequence
MNRTVSAGRV AALVGDFDRS PAYAGLADAL VLLIGDGRIA LDTRLPSERE LTEALGLSRT 
TVTRAYTALR EAGYAEARRG SGTFTRVPGG PARAHDRALL PRPGDDEAID LNCAAPSAPA
GLAKAYAEAA AELPAYLGGH GYFPAGLPQL QQAIAATYEA RGLPTAPDQI MVTPGALSAA
SIVAQAFTSP GDRVLVESPV YPNAIDALRH GGARLTPVPV DPEGWDLPAV GAALRQTAPR
LAYLIADFQN PTGHLMTEAQ REEYAGHLRR AHTTAIVDEA HQWLPLEGQD MPRPFAAYAP
DTITIGSASK GFWGGLRLGW MRVPPGRMDR LTQARVSMDL GAPVMEQLVL VRLLAEADEV
LAANRARLRA QRDALVAAVR AQLPEWTFRV PSGGLALWCR LPAASGSAVA AEAERLGVII
PPGPVFAVEG GLDRFVRIPW TRPAEDLVDG VARLAEAWAV VRERPASGPG VSGRVMVA