Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1923 |
Symbol | |
ID | 4596370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 2049787 |
End bp | 2051322 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 639776521 |
Product | stage II sporulation E family protein |
Protein accession | YP_923120 |
Protein GI | 119716155 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG2208] Serine phosphatase RsbU, regulator of sigma subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.963771 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGACCGGG AAGGAATCAG GCAGCGCGAC CGGTACGTCG TCACCGCCAT GCAAGACGCC CTCCTGCCGG TCGGCGTGGT CGCGCTGCCC AGGGTCGACG TGGCGGCCCG GTACCTGCTC TCCGAGGATG ACCGGTCGGC CGGTGGTGAC TGGTTCGACA CGGTCCCCCT GGGCGACGGC CGGGTCGGGC TGGTCGTCGG CGACGTCGTG GGCCAGGGCG TCACCGCGGC GTCGACCATG GGCCAGCTGC GAGCGGTGCT CACCGCCGCG CTGCTCGCTC GGGAGGACCC CGTCGAGGCG CTCGAGGAGG CGTCCCGGTT CGCCGCCGCC CTGCCCGAGG CGCGCGGTGC GACCGCGGTC GTCGCGCTCG TCGACCCGGC GGCCGGCGAG CGGGGGTCCC TGCGCTACTG CACCGCGGGC CACCCGCCGC CGCTGGTGGT GCGCGCCGAC GGGTCCGCCG CCTTCCTGCC GGTCACCGGG GGCGGCCCGC TGGGGAGCGG TACGCCGCTG CAGGCAGCGG AGGCCGCCCT CGGCCCGGGC GACCTGCTGC TGCTCTACAC CGATGGCCTG CTGGTCCGCC CGGACGCGGC GGCGGCGTCG AGCCTGGACG ACCTGCTCGC GGCGGCCACC GACGCCTGCC GGGAGGCCGC ACCGGCCGCG GCCGGGGCCG TCGCCGAGGC CGCCTGCGAG CTCACGCTGG AGCGGTTGAC GCGGCGCACC GGGGTCGCCG ACGACGTCAC CGTGCTGGCC GCGCAGCTGC TCCCCAGCCG CACCGAGGAC CTGTTGCTCG ACCTGCCCGC GCTGCCGGAC ACCGTCCGGG TGGTCCGCCT CGAGCTCGGC GGGTGGCTGG CCGACCTGCG GCTGTCCTCG ATCGACCAGC TCGCCCTCCA GCAGGCGGTC GGGGAGCTGG TGGGCAACGC GGTCGAGCAC GCCTACCCTC CCGGCTCGCA GGCGCTGCAC ACCTCGGTGC GGGTCCGAGC CACCCACACC GATGCGGGGT GCATCGAGGT CGACGTGTCG GACCGGGGCC GGTGGCGGCC GCGCCCACCC GGCGGGACGC ACCGCGGCCT GGCCCTGGCC AGCGGCTTCG TCGACCACTT GGAGCGGGTG CCGGCCGACA CGGGGACCCA TCTGCGGGTG CGGCACCGGG CCCTGCGCCC GGCGCACCTG CTCACCGCGA CGCCCGGCGG TGCGGGCGCG GAGACCCGGG CGGCGCGGGA GCAGCCGGGC CTCAGCGTCC GGCGCAGCGC GAGTGGCGAC CTGGCGCTGA GCGGCGCCGT CGACCCACTG AGCGTCGACC GCCTCGCGGC AGAGGTCCGC CGCTCGACGG CCGGGCCGGA CCGCCCGGTC GTGGTCGACC TCTCGGCGGT GACGCTGCTG TGCAGTGGCG CGGTGCAGGT GCTCAGCGAC GCCGTCCCCG GCGCCGGTTC CGCCCATGCC CCCGTGGTGC TGCGGACACC GGCCGGGAGC GTGGCGGAGC TGGTGCTCGA GCTGACCCGG GTGCCCTACG AGACGATCGA GGGCCCGCCC GGCTGA
|
Protein sequence | MDREGIRQRD RYVVTAMQDA LLPVGVVALP RVDVAARYLL SEDDRSAGGD WFDTVPLGDG RVGLVVGDVV GQGVTAASTM GQLRAVLTAA LLAREDPVEA LEEASRFAAA LPEARGATAV VALVDPAAGE RGSLRYCTAG HPPPLVVRAD GSAAFLPVTG GGPLGSGTPL QAAEAALGPG DLLLLYTDGL LVRPDAAAAS SLDDLLAAAT DACREAAPAA AGAVAEAACE LTLERLTRRT GVADDVTVLA AQLLPSRTED LLLDLPALPD TVRVVRLELG GWLADLRLSS IDQLALQQAV GELVGNAVEH AYPPGSQALH TSVRVRATHT DAGCIEVDVS DRGRWRPRPP GGTHRGLALA SGFVDHLERV PADTGTHLRV RHRALRPAHL LTATPGGAGA ETRAAREQPG LSVRRSASGD LALSGAVDPL SVDRLAAEVR RSTAGPDRPV VVDLSAVTLL CSGAVQVLSD AVPGAGSAHA PVVLRTPAGS VAELVLELTR VPYETIEGPP G
|
| |