Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2790 |
Symbol | |
ID | 9246641 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 3332156 |
End bp | 3333148 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | transcriptional regulator, LysR family |
Protein accession | YP_003680709 |
Protein GI | 297561735 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.298567 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCTCG AACTCCGCCA CCTGCGCACC ATCTGCCTCC TGGCCGACAC CGGCAGTGTG ACCAAGGCGG CCGCCGCGCT CTCCACCTCA CAGCCCGCGC TGACCACCCA ACTCCAGCGG ATCGAGCGCG AGGTCGGCGG ACCGCTGTTC CACCGCGGGC GTTCGGGGGT CCGCCCCACC GAGCTGGGCG AGTTCGTGCT GGTGCGCGCA CGCTCGGTCC TGCTGTCCAT GGACGACCTG CTGCACGACA TCACCGCGCG CGGGGCCTCC CCCACCACCC TGCGCGTGGG CGGGGTCGGA CCGCTGACGC TGGAGCTGTC GGCCCGCATA CCCGAGGTCT TCCCGAACGT GCCGGTCCAC GTGCGCACGG AGTACTCGCC CAAGCTGGTC ACCGACCTGG TGCGCGCGGG CCGCCTGGAC CTGGGGACCA CGATCGACTA CGTGGACCGC GACCTGTCCA CCGACGGCCC CCTGTCCTGG GCGATGCTCT CGGTGGAGCC GCTGCACGTG GCGCTGTGGG CCGACCACCC CCAGGCGAAG CGGCCGCTGG TCAGGCTGGG CGACCTGGCG GGGACGCCGT GGGCGCTGAC CCCGCCGGAC GGCACCGGCT GGCCCGAGTG CTTCTACCTG GCCTGCCAGC GGGCCGGGTT CACCCCCGAT GTGCCCTACC GGCTCTACGA CCGCTCCGAG ATCCGCGACC TCATCGCCGA CCGCCGGGCG GTCGCGCCGT GCCAGCCGGA CTTCGACACC GGGCCGGACG TGGTGGTACG CCCCCTGGAG GGCGAGCCCA TACAGCTGCG CCACCTGCTG GTCTGGCGGC GCGACAGTCC GATCCAGCAG GCCTCCGACA CGATCGCGCG GCTGGCCCGC GAGATCCTGG GCGGCCCGTC CCGCACACCG GACCCCCCGG CCCCGGGCGA CGCCGCGTCC GCTCTCGACC GGGCGCCGGG CGAGCGGCGC AACGGACGCG CGCCGGGGGC GCTCAACGGC TGA
|
Protein sequence | MDLELRHLRT ICLLADTGSV TKAAAALSTS QPALTTQLQR IEREVGGPLF HRGRSGVRPT ELGEFVLVRA RSVLLSMDDL LHDITARGAS PTTLRVGGVG PLTLELSARI PEVFPNVPVH VRTEYSPKLV TDLVRAGRLD LGTTIDYVDR DLSTDGPLSW AMLSVEPLHV ALWADHPQAK RPLVRLGDLA GTPWALTPPD GTGWPECFYL ACQRAGFTPD VPYRLYDRSE IRDLIADRRA VAPCQPDFDT GPDVVVRPLE GEPIQLRHLL VWRRDSPIQQ ASDTIARLAR EILGGPSRTP DPPAPGDAAS ALDRAPGERR NGRAPGALNG
|
| |