Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1068 |
Symbol | |
ID | 9244914 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1314554 |
End bp | 1315543 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | transcriptional regulator, ArsR family |
Protein accession | YP_003679016 |
Protein GI | 297560042 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.380277 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGTCGAA ACAGGGTCCG CTTCCGGATC GGGGCGGCCG ACGTCTCGGC GATCCGGTTC GGCGTCTCAC CGGGCCACGA ACTGTGCCAC GCGGTCCGGG TCCTGCTCGA CCCCTCAAGC CACCCGCTCC AGTGGGGCTG GCTGCGCGCG GTGCGGGGGT CGCTGCCGAG GCGGTCGTTC GAGTACCTGG CCACGGTGAT CGGCCACGAG GGTTACTTCC CCGACTTCCT CACGGCCGCC CCGTCCTGGG ACATGACCCC CGAGGAGGAA GTCGAGCGCC TGCGGCGCGT GCCGCCCGAA GCCGTCCGGG CCGACCTGAC CAAGGTGCTC GCGCGCTCGC GGGGGCGCCG CCACACCCTG GTGGCGGACA TGTTCGCCCA CCCCGAGCGC ACCCGCGCGC TGGTCTCGGA CGCCTGGACG GCGGTCTGGG AAGCCGCACT GGCCCCGCAC TGGCCGCAGA TCCGGCGTCT GCTGGGCGCG GACATCGACC TGCGGGTGCG GCGCATGGGC GCGGGCGGCG TGGCCGCGAT GGTGGGCACG CTGCACGAGT CCGTGGGCTG GCACCGGGAC GCGATCGAGG TCGGGATGCG CAGGCACGAC GAGGACGTGG CCTGCGAGGG CAGCGGCGTG GTGCTCACGC CCTCCGTCAT GAGCACGCCG CGCTGCTCGG TGCTCACCGA GCCGCCCGTG CAGCCGACGC TGTTCTACCC GGTGCACGGA CTGTCGGCGT CGTGGACGCG TGAGGGCACG GCCGCCGCGC AGGCGCTGGA GGAACTGCTC GGGTCGGGGC GCGCCCGCGT CCTGCTGGCC CTGGACGGGC CCCGGTCCAC GTCGGAGGTG GCCGGGGACT GCGCGATGGC GGTCTCCACG GCCTCGCACC ACCTGTCGAT CCTGCGGCGG TCGGGGCTGG TGGATGGTCG CCGGGAGAGC ACACGGGTGA TGCACGCGCG CACGCCGCTG GGCGAGGCCC TGGTGAGCGG AGGGGCCTGA
|
Protein sequence | MRRNRVRFRI GAADVSAIRF GVSPGHELCH AVRVLLDPSS HPLQWGWLRA VRGSLPRRSF EYLATVIGHE GYFPDFLTAA PSWDMTPEEE VERLRRVPPE AVRADLTKVL ARSRGRRHTL VADMFAHPER TRALVSDAWT AVWEAALAPH WPQIRRLLGA DIDLRVRRMG AGGVAAMVGT LHESVGWHRD AIEVGMRRHD EDVACEGSGV VLTPSVMSTP RCSVLTEPPV QPTLFYPVHG LSASWTREGT AAAQALEELL GSGRARVLLA LDGPRSTSEV AGDCAMAVST ASHHLSILRR SGLVDGRRES TRVMHARTPL GEALVSGGA
|
| |