Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4479 |
Symbol | |
ID | 4596998 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 4735200 |
End bp | 4736228 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 639779090 |
Product | LacI family transcription regulator |
Protein accession | YP_925663 |
Protein GI | 119718698 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.277487 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGGGTG TGACTCCCAC GCTCCGCGAC GTCGCCGACG CCGCGGGTGT GCATCCCGCG ACCGCCTCCC GCGCGCTGAA CCCCGCCACC CGCGGGCTGG TCAACGCCGA CACCGCGCGT CGGGTGATCA AGGTCGCCGA GTCCTTGGGC TACCGGCCGA ACCCGATCGC GCGGGGGCTG AAGACCGCGA AGTCCGGCAC CGTCGGCCTG GTCATCCCCG ACCTCACCAA CCCGCTGTTC CCCCCGATCG TGCGCGGCAT CGAGGACGTG CTCGAGCCGG CCGGGTACAG CGGCCTGATC GTCAACACCG ACAACGACCC GAGCCGCGAG CGGGCCCAGT TCGAGTCGCT GCGCTCGCGC CAGGTCGAGG GCTTCATCGT GGCCACGGCG CTGCTGGACC ACCCGCTGCT CGACCAGCTG CGCCGCGAGG GCGTGCTGAT GGTGATGGTC AACCGCCGGC CCGACGGCCT GGACGTCCCC TCGATCACCC CCGACGACGC TGCCGGCGTG GAGCTCGCCG TACGCCACCT GGCGGAGCTG GGGCACCGCC GGGTCGCGCA CCTGGCCGGC CCGTCGAACA CCTCGACCGG CGTGGTGCGG GCGCGGTCGT TCCGCAACAC GGTGCGCGAC CTCGGCCTCG ACGAGGACCC GGCGCTGACC GTGACCTGCC CGTACTGGAG CGAGACCGCC GGCGCCGAGG CGCTGCGCTC GCTGCTCGAC TCCGGCGCGG AGTTCACCGC CGTCGTCGCC GGCAACGACC TGATCGCGCT GGGCTGCTAC GACGTGTTCG CCGAGCGCTC GATCGAGTGC CCGCGCGACG TCAGCGTGGT CGGCTTCAAC GACATGCCGT TCCTGGACAA GCTGCGCCCG CCGCTGACGA CGGTCGCCGT GCCCCACCAG CAGATCGGCG CCGAGGCCGC ACGGATGCTG CTGGACGCGA TCCGGGACCC GTCCCGCCCG GCCCGGTCGG TGCTGCTGCC GCTCTCGCTC GTGGTGCGCG GCTCGACCGC GCCGCCGTAC TCCGGCTGA
|
Protein sequence | MRGVTPTLRD VADAAGVHPA TASRALNPAT RGLVNADTAR RVIKVAESLG YRPNPIARGL KTAKSGTVGL VIPDLTNPLF PPIVRGIEDV LEPAGYSGLI VNTDNDPSRE RAQFESLRSR QVEGFIVATA LLDHPLLDQL RREGVLMVMV NRRPDGLDVP SITPDDAAGV ELAVRHLAEL GHRRVAHLAG PSNTSTGVVR ARSFRNTVRD LGLDEDPALT VTCPYWSETA GAEALRSLLD SGAEFTAVVA GNDLIALGCY DVFAERSIEC PRDVSVVGFN DMPFLDKLRP PLTTVAVPHQ QIGAEAARML LDAIRDPSRP ARSVLLPLSL VVRGSTAPPY SG
|
| |