Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_0379 |
Symbol | |
ID | 4597765 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 407542 |
End bp | 408570 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 639774993 |
Product | LacI family transcription regulator |
Protein accession | YP_921609 |
Protein GI | 119714644 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.676966 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGCCC ACCGCGACAA GCGCCATCCC ACGATGGCCG ACGTGGCGGC CCGGGCCGGG GTCTCCCACC AGACGGTGTC ACGGGTGATC AACAACGCGC CGAGCGTCCG GCCGGAGACG GCAGCCCGGG TGGTCCAGGC GATCAGCGAC CTCGGCTACC GCCCCAACCG CTCGGCGCGG CTCCTCGCCT CGCGGCACTC CCGGCTGATC GGCGTCGCGA CCTGGGGCAC CAGCCACCAC GGGCCGCAGC AGGTGCTGCT CGCGCTGGAC GCGGCCGCCC GCCGCGCCGA CTACCGGACC GCCGTCAGCA CCCTGCACGC GCTGACCGAG CAGGACACCC GGGACGGCGT CGAGGAGCTG CTCCAGCTCG GGGTCGAGGC GGTGGTGCTG ATCATCCCGC ACGAGTCGAT GCTGCGGTTC GCGACCGAGG CCGACCTCGG GGTGCCGACC GTGGTCGTCG AGGGGGACCT CTCGCGGATG CCACTGACCG TCGGGGTCGA CAACGTCCAG GGCGGCAGCC TCGCGACCCG GCACCTGCTC GAGCTGGGCC ACCGCACCGT CGTGCACGTC GCCGGCCCGC CGGGCTGGGC CGAGGCGGCC GCCCGCGTCG ACGGGTGGCG GCTGGAGCTC GAGACCTGGG GCCGGGTGGT GCCGCCCCTG CGGTGGGGCG GCGACTGGAG TGCCCGCAGC GGGTACGACG CCGGCGTCTC CCTGGCGCGC GATCCCGAGG TCACCGCGGT GTTCGCCGCG AACGACCAGA TGGCGATGGG GGTGATCGCC GCGTTGAGGG AGGCCGGCCG CCGGGTGCCC GACGACATCT CGGTCGTCGG GTTCGACGAC CTGCCGGAGT CGGCGTACCT CGATCCGCCG CTGACCTCGG TGCACCAGGA CTTCGGGGAG CTGGGGCGCC GCGCCATGGG GCTCCTCGAG CGGGTGCTGG CCGGCGAGAA GAAGCCGACC GCCGACCTGG TGCCGACCTC GCTGGTCGTC CGGGCGTCCA CCTCGTCGCC GCGGGTGCCC GCGAACTGA
|
Protein sequence | MPAHRDKRHP TMADVAARAG VSHQTVSRVI NNAPSVRPET AARVVQAISD LGYRPNRSAR LLASRHSRLI GVATWGTSHH GPQQVLLALD AAARRADYRT AVSTLHALTE QDTRDGVEEL LQLGVEAVVL IIPHESMLRF ATEADLGVPT VVVEGDLSRM PLTVGVDNVQ GGSLATRHLL ELGHRTVVHV AGPPGWAEAA ARVDGWRLEL ETWGRVVPPL RWGGDWSARS GYDAGVSLAR DPEVTAVFAA NDQMAMGVIA ALREAGRRVP DDISVVGFDD LPESAYLDPP LTSVHQDFGE LGRRAMGLLE RVLAGEKKPT ADLVPTSLVV RASTSSPRVP AN
|
| |