Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4931 |
Symbol | |
ID | 9248818 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | - |
Start bp | 66959 |
End bp | 67981 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | transcriptional regulator, LacI family |
Protein accession | YP_003682820 |
Protein GI | 297563847 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.699714 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGACCA TCAAGGACGT CGCCGCGGCG GCGGGCGTGG CCCCGAGCAC GGTGTCCTAC GTGCTCAGCG GCTCCCGCCG CGTCTCCGAA CGGACCCGGT CCGCGGTGCG GACGGCCATC GACGAACTGG GCTACCACCC CAACGCGGGT GCGCGCTCCC TGCGCAGCGC CCGCACCCGC GTGATCGCGC TGGCCCTGCC GCTGGCCTCG CCCGGCTACC TGCCGGTCGG CGGGCGGTTC ATGTACGGGC TGAGCCGCGC CGCCGGGGAG CTGGGCTACG ACCTGCTGCT CCTGACCGTG CGCGACGACG ACGCCGGGGC GTACGGGCTG GAGCGCGCCG CGCGCAGCAG GCTCGCCGAC GTCGCGGTGA TCATGGGCGT GGAGATGGAG GACCCCCGCA TCGGCGCGAT GGACGCGCTG GGGTTCCCCG TGGTGGTCCT CGGCCGCCCC TCCGACGAGG ACGCCGCGCC CTGGGCCGAC CTCGACTGGG AGGAGGCCGC GGTCGCCTCG CTGCGCCTGC TGCACGGGAC CGGGCACCGC GACCTGTGCT TCGTGTCGAC TGTGGAGGAG GACATCGCCT CGGGCCGCAG CTACTCCGTG CGCGGTCTGC GCGGGGCCGA ACGCGCCGCC GCCGAACTCG GCGTCCCCGT GCGCGTCCTG CCCTCGGCGA AGGACCCCGC CGAACTGTAC CGGCGGCTGG ACGCGCTCCT CGACGGGGAC CGCCCGCCCA CCGCGCTCGC CCTCCAGCAC CCCGCCGCCG TGCCCGGGGT CCTGCGCCAC CTCGCAGCGC GCGGCACCGA CGTGCCCGGG GACGTCTCCC TGGTGGCCAT CGGCAGCTTC CCCGAGGACC TCGCGGGACT GGACGTGACC CGGGTCGAGC TGCCCGTCGA GCGGATGTCG GCCGCCGTGA CCCGGCTGGC CGCCGAGGCC GCGCGCGGCA GCCCGCCGCT CCCGGGAGGA CGGCGCGAAC TCATCCCGCC CGAGATCACC CCCGGGGGGA CGGTAGCCGC TCCCCCGCCC TGA
|
Protein sequence | MVTIKDVAAA AGVAPSTVSY VLSGSRRVSE RTRSAVRTAI DELGYHPNAG ARSLRSARTR VIALALPLAS PGYLPVGGRF MYGLSRAAGE LGYDLLLLTV RDDDAGAYGL ERAARSRLAD VAVIMGVEME DPRIGAMDAL GFPVVVLGRP SDEDAAPWAD LDWEEAAVAS LRLLHGTGHR DLCFVSTVEE DIASGRSYSV RGLRGAERAA AELGVPVRVL PSAKDPAELY RRLDALLDGD RPPTALALQH PAAVPGVLRH LAARGTDVPG DVSLVAIGSF PEDLAGLDVT RVELPVERMS AAVTRLAAEA ARGSPPLPGG RRELIPPEIT PGGTVAAPPP
|
| |