Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3659 |
Symbol | |
ID | 9247528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4390279 |
End bp | 4391310 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | transcriptional regulator, LacI family |
Protein accession | YP_003681563 |
Protein GI | 297562589 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.346093 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGGGTT TGATGCGGCG TCCGACCATC ATGGACATCG CCAAGGCCGC GGGGGTGTCC AAGGGGGCCG TGTCCTACGC GCTCAACGGG CGTCCGGGGG TCTCGGAGGA GACCCGAAGA CGCATCCTGG CGATCGCCAG CGACCTCGGC TGGGCGCCCA GCAGCCCGGC GCGCGCACTC GCCCCCGGCG GCCGGATCGG GGCGGTCGGC CTGGTGGTGG ACCGGCCCGC CCACTCCCTC GGTCTGGAAC CCTTCTTCAT GCAGCTGGTC TCGGGGATCG AGACCGAGCT GGCCACCTCG GGGGTCGACC TGTTGTTACA GGTCACCGAG GACATGGGCG CCGAGATCGC GGCCTACCGG CGCTGGTCGT CCGAACGCCG GGTGGACGGC GTGATCATGG TGGACCTGCG CGTGGGCGAC CCGCGCGTCC AGGTCGTCGA GGAGCTCCCG CTGCCCGCCG TCGTCCTCGG CGGACCCGAG GGCGTGGGCT CCCTGCCCTA CCTGTACACC GACGACGCCA CCGCCATGCG CGAGGTCGTC CACTACCTGG CCGCCCTCGG CCACCGGCGC ATCGTCCAGG TGGCGGGCCC GGAGAAGTTC GTGCACACCC GCGTGCGCAC CCAGGCCTTC CTGGACGCCG CGGCCCAGGC CGGGCTCAGC GAGGCCCGCT GGGTGCACGC CGACTACACG GGCGAGGGGG GCACCAGGAC CACGCGCAAG CTGCTGGCCG CCACCGACCG GCCGACGGCC CTGATCTACG ACAACGACCT CATGGCCGTG GCCGGACTGG GCGTGGCCCA CGAGATGGGC GTGGACGTGC CCTCGCAGCT GTCCATCGTG GCCTGGGACG ACTCGGTGCT GTGCCGCCTG GTCCGCCCCT CGCTGACCGC CATAGTGCGC GACATCGTGT CCTACGGCCG CCAGGCCGCG CTGATGCTGG CCAGGACGAT CGAGGGCAAG CCGGTGGCCA ACAGCGAGAC CTCGCGCGGG GAGCTGCTCC CGCGGGGCAG CACCGGCCCC CTGGGCGCGT GA
|
Protein sequence | MGGLMRRPTI MDIAKAAGVS KGAVSYALNG RPGVSEETRR RILAIASDLG WAPSSPARAL APGGRIGAVG LVVDRPAHSL GLEPFFMQLV SGIETELATS GVDLLLQVTE DMGAEIAAYR RWSSERRVDG VIMVDLRVGD PRVQVVEELP LPAVVLGGPE GVGSLPYLYT DDATAMREVV HYLAALGHRR IVQVAGPEKF VHTRVRTQAF LDAAAQAGLS EARWVHADYT GEGGTRTTRK LLAATDRPTA LIYDNDLMAV AGLGVAHEMG VDVPSQLSIV AWDDSVLCRL VRPSLTAIVR DIVSYGRQAA LMLARTIEGK PVANSETSRG ELLPRGSTGP LGA
|
| |