Gene Noca_4635 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4635 
Symbol 
ID4596091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4915766 
End bp4916971 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content76% 
IMG OID639779244 
ProductROK family protein 
Protein accessionYP_925817 
Protein GI119718852 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.892035 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGCTC CGTCGACCAC CCGCGGCGGC GACCTGCTGC CCGCCGCGAT CCTGGGTCTG 
CTCGGCAGCC GCGGATCCTC GTCCCGGGCC GACATCGCCC GCCTCCTGCG GGTCAGCCCG
GCGGCCGTCA CCCAGGCCAC CAAGGGGCTG ATCGCTCGCG GGCTGGTCGC CGAGCTCGCG
GCCGAGCCGT CGCGCGGCGG CCGGCCGGCC CGGCTCCTGG GCCTGGTCCG GGAGGCGGCC
AGTGCTATCG GGGTCAAGGT GACCGCCGAC CACGTGGCCA CGGTGCGGGT CACGCTCGAC
GGCCTGGTCG AGGCCTACAG CACCCGCCCC TTCGACCCCT GGGCGCCCGA CGCCCTCGAC
CGCCTCGGCC GGCTGCTGGC CGACGCCGTC GCGGCCCACG AGGGTGCCCT TCTCGGCGTC
GGGGTCGGCG TACCCGGCTC CGTGGACGCA CAGGCCTCGG GGGTCGTGAC CGCTCCGACC
CTGGGCTGGG CCGAGCTGCC CGTCGGTGCC CACCTGCGCG CCGAGCTTGG CGTCCCCGTG
CTGCTGGACA ACGACGTCAA CACCCTCGCG GCTGCCGAGC GGCTGTACGG CGTCGGTCAG
GACGCCGCGT CGTACGTCGT CGTCACGATC GGGCGGGGCA TCGGCTGTGG CGTGGTCGTC
GACGGGTCCA TCTACCGCGG TGCCCGCGGC GGGGCTGGGG AGATCGGACA CATCCCGGTC
GCCGACGGAC CCGACTGCGC CTGTGGGGGC GTCGGCTGCC TGGAGGCGCT GATCGGCGAG
GACGCGCTGG TCCGGCGCGG GCGCGAGGAG GGTCTGATCG GTCCCGCGCA GGGCATCGCC
GAGCTGGCCG GCGCCGCCGA CGACGGCATC GCGGGCGCGC TCGAGCTGTT CGCGCTCGCC
GGACGCCTGC TCGGCCGGGC GCTCGCCGGC GTGGTCCACA CCATCGACCC GGGGGTGCTG
GTCATCCAGG GCGAGGGCGT GACGGCCTGG CGGCACTGGC AGTCGCCCTT CGAGACGTCG
TTTCGCCGGC ATCTGATGCC GAGCCGCCGA TCTCTGCGCT ACCAGGTGCA CGCCTGGTCG
GAGCAGCAGT GGACCCTGGG GGCCGCCAGT CTGGTGCTCG CCGCCCCGTT CGACTCGACC
GACACGACCG GCGAGCAGGG CCGCCTGGTG CGGGCCCGTC TGCAGGACCC CGAGGGCGGT
GCCTGA
 
Protein sequence
MPAPSTTRGG DLLPAAILGL LGSRGSSSRA DIARLLRVSP AAVTQATKGL IARGLVAELA 
AEPSRGGRPA RLLGLVREAA SAIGVKVTAD HVATVRVTLD GLVEAYSTRP FDPWAPDALD
RLGRLLADAV AAHEGALLGV GVGVPGSVDA QASGVVTAPT LGWAELPVGA HLRAELGVPV
LLDNDVNTLA AAERLYGVGQ DAASYVVVTI GRGIGCGVVV DGSIYRGARG GAGEIGHIPV
ADGPDCACGG VGCLEALIGE DALVRRGREE GLIGPAQGIA ELAGAADDGI AGALELFALA
GRLLGRALAG VVHTIDPGVL VIQGEGVTAW RHWQSPFETS FRRHLMPSRR SLRYQVHAWS
EQQWTLGAAS LVLAAPFDST DTTGEQGRLV RARLQDPEGG A