Gene Noca_1923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1923 
Symbol 
ID4596370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2049787 
End bp2051322 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content78% 
IMG OID639776521 
Productstage II sporulation E family protein 
Protein accessionYP_923120 
Protein GI119716155 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.963771 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGACCGGG AAGGAATCAG GCAGCGCGAC CGGTACGTCG TCACCGCCAT GCAAGACGCC 
CTCCTGCCGG TCGGCGTGGT CGCGCTGCCC AGGGTCGACG TGGCGGCCCG GTACCTGCTC
TCCGAGGATG ACCGGTCGGC CGGTGGTGAC TGGTTCGACA CGGTCCCCCT GGGCGACGGC
CGGGTCGGGC TGGTCGTCGG CGACGTCGTG GGCCAGGGCG TCACCGCGGC GTCGACCATG
GGCCAGCTGC GAGCGGTGCT CACCGCCGCG CTGCTCGCTC GGGAGGACCC CGTCGAGGCG
CTCGAGGAGG CGTCCCGGTT CGCCGCCGCC CTGCCCGAGG CGCGCGGTGC GACCGCGGTC
GTCGCGCTCG TCGACCCGGC GGCCGGCGAG CGGGGGTCCC TGCGCTACTG CACCGCGGGC
CACCCGCCGC CGCTGGTGGT GCGCGCCGAC GGGTCCGCCG CCTTCCTGCC GGTCACCGGG
GGCGGCCCGC TGGGGAGCGG TACGCCGCTG CAGGCAGCGG AGGCCGCCCT CGGCCCGGGC
GACCTGCTGC TGCTCTACAC CGATGGCCTG CTGGTCCGCC CGGACGCGGC GGCGGCGTCG
AGCCTGGACG ACCTGCTCGC GGCGGCCACC GACGCCTGCC GGGAGGCCGC ACCGGCCGCG
GCCGGGGCCG TCGCCGAGGC CGCCTGCGAG CTCACGCTGG AGCGGTTGAC GCGGCGCACC
GGGGTCGCCG ACGACGTCAC CGTGCTGGCC GCGCAGCTGC TCCCCAGCCG CACCGAGGAC
CTGTTGCTCG ACCTGCCCGC GCTGCCGGAC ACCGTCCGGG TGGTCCGCCT CGAGCTCGGC
GGGTGGCTGG CCGACCTGCG GCTGTCCTCG ATCGACCAGC TCGCCCTCCA GCAGGCGGTC
GGGGAGCTGG TGGGCAACGC GGTCGAGCAC GCCTACCCTC CCGGCTCGCA GGCGCTGCAC
ACCTCGGTGC GGGTCCGAGC CACCCACACC GATGCGGGGT GCATCGAGGT CGACGTGTCG
GACCGGGGCC GGTGGCGGCC GCGCCCACCC GGCGGGACGC ACCGCGGCCT GGCCCTGGCC
AGCGGCTTCG TCGACCACTT GGAGCGGGTG CCGGCCGACA CGGGGACCCA TCTGCGGGTG
CGGCACCGGG CCCTGCGCCC GGCGCACCTG CTCACCGCGA CGCCCGGCGG TGCGGGCGCG
GAGACCCGGG CGGCGCGGGA GCAGCCGGGC CTCAGCGTCC GGCGCAGCGC GAGTGGCGAC
CTGGCGCTGA GCGGCGCCGT CGACCCACTG AGCGTCGACC GCCTCGCGGC AGAGGTCCGC
CGCTCGACGG CCGGGCCGGA CCGCCCGGTC GTGGTCGACC TCTCGGCGGT GACGCTGCTG
TGCAGTGGCG CGGTGCAGGT GCTCAGCGAC GCCGTCCCCG GCGCCGGTTC CGCCCATGCC
CCCGTGGTGC TGCGGACACC GGCCGGGAGC GTGGCGGAGC TGGTGCTCGA GCTGACCCGG
GTGCCCTACG AGACGATCGA GGGCCCGCCC GGCTGA
 
Protein sequence
MDREGIRQRD RYVVTAMQDA LLPVGVVALP RVDVAARYLL SEDDRSAGGD WFDTVPLGDG 
RVGLVVGDVV GQGVTAASTM GQLRAVLTAA LLAREDPVEA LEEASRFAAA LPEARGATAV
VALVDPAAGE RGSLRYCTAG HPPPLVVRAD GSAAFLPVTG GGPLGSGTPL QAAEAALGPG
DLLLLYTDGL LVRPDAAAAS SLDDLLAAAT DACREAAPAA AGAVAEAACE LTLERLTRRT
GVADDVTVLA AQLLPSRTED LLLDLPALPD TVRVVRLELG GWLADLRLSS IDQLALQQAV
GELVGNAVEH AYPPGSQALH TSVRVRATHT DAGCIEVDVS DRGRWRPRPP GGTHRGLALA
SGFVDHLERV PADTGTHLRV RHRALRPAHL LTATPGGAGA ETRAAREQPG LSVRRSASGD
LALSGAVDPL SVDRLAAEVR RSTAGPDRPV VVDLSAVTLL CSGAVQVLSD AVPGAGSAHA
PVVLRTPAGS VAELVLELTR VPYETIEGPP G