Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3984 |
Symbol | |
ID | 4598119 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 4203696 |
End bp | 4204715 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 639778589 |
Product | LacI family transcription regulator |
Protein accession | YP_925168 |
Protein GI | 119718203 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.125023 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCGCC GCCCCCGCAC CCGCCCCACC CTGGAGGAGG TCGCCGCGCT CGCGGGCGTC GGCCGCGGCA CCGCCTCGCG CGTGATCAAC GGTGGGGCGA AGGTCTCCGA CCGCGCGCGC CAGGCGGTCG AGGACGCGAT CGCCGAGCTC GGCTACGTGC CGAACGCCGC CGCCCGCGCC CTGGTCACCC GGCGCACCGA CGCGGTCGCG CTGGTCATCG CCGAGTCCGA GGAGCGGGTG TTCGGCGAAC CGTTCTTCGC AGGGATCGTG CGCGGCATCG GGGCGGCGCT CGCGACTGCC GATCGCCAGC TCGTGCTGAT CCTGGCCGAT GCCCGCCGCC GCGGCGGCCT CGAGCACTAC CTCACCCGCC AGCACGTCGA CGGGGTGCTG CTGCTCTCGC TGCACGGCGA CGACACGCTG CCCGACCGGA TCCGCGGCCA CGGGCTGCCG GTGGTCGTGG GTGGCCGCCC GCGGCCCGAC GTCGCGACCG GCTTCGTCGA CGTCGACAAC GTGCAGGGCG CCGGCCTCGC CGTGGCCCAC CTCGTCGACC GCGGCCGCAC CCGGATCGCC ACGATCGCCG GCCCGGCGGA CATGGTCGCC GGCAGCTCCC GCTTCGAGGG GTACGTCGCC GGCCTCACCG CCGCCGACCG GCCCATCGAC GAGCGACTGG TGGCGCGCGG CGACTTCAGC CAGGAGAGCG GCACCCGCGC GATGCGGGCG CTCCTGGACC GGGAGCCGGG CGTGGACGGC GTCTTCTGCG CCAACGACCT GATGGCGGTC GGCGCGCTCC AGGCCCTGCG CGAGCACGGC CGCCGGGTCC CCGAGGACGT CTCCGTCGTG GGCTTCGAGG ACGCTCCGAT CGCCCGGGCG ACGGTGCCCC CGCTGACGAC CGTGCACCAG TCACCCGGGG CGATGGGGGG CGAGATGGTC GCGCTCCTGC TGGAGACGAT GGCCGGCACC GACCCCGCGC CGCCCGGCCG GATGCTGCCC ACCCGCCTGG TCGTGCGCCA GAGCAGCTGA
|
Protein sequence | MARRPRTRPT LEEVAALAGV GRGTASRVIN GGAKVSDRAR QAVEDAIAEL GYVPNAAARA LVTRRTDAVA LVIAESEERV FGEPFFAGIV RGIGAALATA DRQLVLILAD ARRRGGLEHY LTRQHVDGVL LLSLHGDDTL PDRIRGHGLP VVVGGRPRPD VATGFVDVDN VQGAGLAVAH LVDRGRTRIA TIAGPADMVA GSSRFEGYVA GLTAADRPID ERLVARGDFS QESGTRAMRA LLDREPGVDG VFCANDLMAV GALQALREHG RRVPEDVSVV GFEDAPIARA TVPPLTTVHQ SPGAMGGEMV ALLLETMAGT DPAPPGRMLP TRLVVRQSS
|
| |