Gene Noca_4488 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4488 
Symbol 
ID4597007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4744508 
End bp4745632 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content75% 
IMG OID639779099 
ProductROK family protein 
Protein accessionYP_925672 
Protein GI119718707 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.520196 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCCGG TCGGACGACC GCTCCGGCCC CGCGGCAAGC TCCTCCAGGA GGACGCCCGG 
AGGCACCACC GCTCGTTGCT GCTCCAGCAG CTGTTCCGCG AGGGGCCGGC CAGCCGCGCG
GACCTGGCCC GGACCACCCG CCTGACCCGG GTCACGGTTT CCGACCTGGT GGGCGAGCTG
GTGGGCGAGG GCCTGGTCGA GGAGCTCGGT GCGCCGGCCG AGGCGCGGGT CGGCAAGCCG
CCAACGCTGG TCGGGCTGGC GCCGGACGCG AGTCACATCA TCGGGCTGGA CCTCTCGGCC
GACGACCGGA TGACCGGCGC CGTGGTGAAC CTCCTCGGCC AGGTTCAGGC CCGTCACGAG
ATCGAGATCG GCGACGCGCA GGGCGAAGCG GCGGTCCGGC TCGTGCACCG CCTCGCGGCC
GAGCTGATCG CGATGACCGA CCGGCCGGTG CTCGGCGTCG GCGTGGGCAG CCCCGGCGTG
GTCGACGCCG CCGGCACCGT CATCGACGCC CCCAACTTCG CGTGGACCGA CACCCCGCTG
TCCACCACCC TGGCCGCCGC GCTCGGCGTA CCGGTCTTCG TGGCCAACGA TGCCAACACC
GCGGTCCTCG GGGAGCACAC CTTCGGCCAG ACCGGCGACG GCGGCCTGAT GGTGCTCCGG
GTCGGCATCG GCGTCGGCGC CGGGCTGGTG CTCGGGGGTT CGCTCCTCCA CGGCCACCTC
GGCGCCGCCG GCGAGATCGG CCACGTCACC GTCGACCCCG ACGGCGACGT GTGCGCCTGC
GGACGCCGCG GCTGCCTGGA GACGATCCTG GCCGCGCCCC GCCTGCGGCG CCGGCTCGCC
GAGCCCGGTG CGGACCGGGA CGCCGTGCTC ACCGAGGTGG GTGAGCGGCT CGGCGTCACC
CTGGCGCCGG TCGTCGGCAC CCTCAACATC CACGAGCTGG TGCTGAGCGG CCCGACCGAG
CTGCTGGACG GCCCGCTGCG TGCGGCGGCC GACCGGGTCG TGCGCGAGCG GACCATGCCG
GTCAGCTCCG CGGGCCTGAC GGTCCGCACC TCCACGCTCG GCGCGGACGT GGTGTTGATC
GGCGCCGCGG TCCTCGTCCT CTCGGGACAG CTGGGCGTGT CGTGA
 
Protein sequence
MSPVGRPLRP RGKLLQEDAR RHHRSLLLQQ LFREGPASRA DLARTTRLTR VTVSDLVGEL 
VGEGLVEELG APAEARVGKP PTLVGLAPDA SHIIGLDLSA DDRMTGAVVN LLGQVQARHE
IEIGDAQGEA AVRLVHRLAA ELIAMTDRPV LGVGVGSPGV VDAAGTVIDA PNFAWTDTPL
STTLAAALGV PVFVANDANT AVLGEHTFGQ TGDGGLMVLR VGIGVGAGLV LGGSLLHGHL
GAAGEIGHVT VDPDGDVCAC GRRGCLETIL AAPRLRRRLA EPGADRDAVL TEVGERLGVT
LAPVVGTLNI HELVLSGPTE LLDGPLRAAA DRVVRERTMP VSSAGLTVRT STLGADVVLI
GAAVLVLSGQ LGVS