Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_0471 |
Symbol | |
ID | 4597370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 503013 |
End bp | 504032 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 639775085 |
Product | LacI family transcription regulator |
Protein accession | YP_921700 |
Protein GI | 119714735 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0818961 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCGCGGG TCTCGACGAC CCCGCCGACC CTGGCCGACG TCGCCGAGCG CGCCGGCGTC TCCCGGCAGA CGGTGTCGAA CGCCGTGAAC AACCCCGACC TGCTGCGCGC CGACACCCTG GCCCGGGTGC TGCACGCGAT CGACGAGCTC GGCTACTCGC CCAACCGGGC CGCCCGCAAC CTGCGCACCC GCGCCAGCCA CCTGGTCGGG ATGCGGATCG GGCCGGTCTA CGAGGGCACG GCCAACGCCA CGATGGACCG GTTCGTGCAC TCCCTGGTGG AGGCCTCGCG CGAGGCGGGC TACCACGTCC TGCTGTTCGC GGGTGACCCC GAGGACCCGG TGGCCGGGTA CGACGACCTG CTCCGCTCCA CCACCGTCGA CGCGTTCGTC GTCACCGACA CCTACCTCGG CAATCCGCAG GCCACCTGGC TCGAGGAGCG GCGGGCCCCG TTCGTCGCGT TCGGACGGCC GTGGGACAAC CCGGCGGCCG AGCACCCCTG GGTCGACGTG GACGGCGCGG CCGGCACCGA CCTGGCGACG TCGTACCTCC TCGGCCGCGG CCACGAGCGG ATCGCCTGGA TCGGCTGGCG CAAGGACTCC TGGATCGGTG AGGACCGCCG CTCGGGATGG AGCCGCGCCC TGCACGCCCG CGGGCTGCCC ACCACCGGCC TGGCGTCGCG GGTCGAGGAC ACCGTCTCGA GCGGCCGCGA GGCGAGCGCC GTCCTGCTGG ACGAGGCCCG GCCCACCGCG TTCGTGTGCG CGTCCGACAC CCTCGCCATG GGGGTGCTCG GCACCCTCGC CGACCGCGGC CTGACGCCCG GTCGGGACGT GTCGGTGATC GGCTTCGACG ACTCGCAGGT CGCGCAGGTC GTCTCGCCCG GCCTCACCTC GGTGCGCCAG CCGCTCGAGG AAGTGGCCGT GGAGATCGTC AAGGCGCTCG AGGGCCTGCT CGGCCACCCG CCCACCGTCG GCCCCGGCGT GATGCTGGTC CCGAGCCTCG CCCTCCGCGG CACCAGCTGA
|
Protein sequence | MARVSTTPPT LADVAERAGV SRQTVSNAVN NPDLLRADTL ARVLHAIDEL GYSPNRAARN LRTRASHLVG MRIGPVYEGT ANATMDRFVH SLVEASREAG YHVLLFAGDP EDPVAGYDDL LRSTTVDAFV VTDTYLGNPQ ATWLEERRAP FVAFGRPWDN PAAEHPWVDV DGAAGTDLAT SYLLGRGHER IAWIGWRKDS WIGEDRRSGW SRALHARGLP TTGLASRVED TVSSGREASA VLLDEARPTA FVCASDTLAM GVLGTLADRG LTPGRDVSVI GFDDSQVAQV VSPGLTSVRQ PLEEVAVEIV KALEGLLGHP PTVGPGVMLV PSLALRGTS
|
| |