Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2643 |
Symbol | |
ID | 9246494 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 3152512 |
End bp | 3153882 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | putative PAS/PAC sensor protein |
Protein accession | YP_003680566 |
Protein GI | 297561592 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.351947 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGGG AGACGGGGCC TCCCGGGGCC GGCAACGGCT CCGCCACCGA CGCGGCCCTC AGCGCGCTGC TGGAGGACAG CGCCGAGGAA CTCTACGAGT CCGCGCCGTG CGGCTACCTG TCCACGCTCA TGGACGGCAC GGTCGCCAGG ATCAACGCGA CGCTGCTGAG GTGGCTCGGC CTGGAGCGCG CCGCCGTGGT GGGCCGCATG CGCTTCACCG ACCTGCTCAC CGTGGGCGGC AGGCTCTACC ACGAGACGCA CTTCGCGCCC CTGCTGCACC TGCGGGGCGA GGTCAACGGC ATCGCCCTGG AGATGCGGGC CTCCGACGGC GGCCGCCTTC CCGTGCTGGT CTCCTCCACC GTCAAGCGCG ACGGCGGGGG CCAGCCGCTG CTGGTCCGCA CCACCGTCTT CGACGCCACC GACCGCCGCT CCTACGAGGA GGAGCTGCTG CGCCGCCGCA GGGAGGCGGA GCAGGCCCGC GCCGAGGCAG AACGGGCCCG CGAGGAGGCC GAACGGGCCC GCGCCGAGGC CGAGGAGGCG CACCGGCGGG CCGAGGCGGA CCGGGCGCGC CTGGGAGAGG CGCTCGTCAT CCTCCAGAGG GCCCTGCTGC CCGACACCCT GCCCGACGTT CCGGGCATGG AGGCCGCCGC CTACTACCAC ACCGCCTCCC CCTACCGGCT GGGCGGCGAC TTCTACGACC TCTTCCCGCT CGGTGACGGG TGCTGGGCGT TCTTCCTCGG CGACGTGAGC GGCAAGGGGC CCGAGGCCGC GACCCTGACC TCCCAGGCCC GTTACGTCCT GCGCACCACC GCCCTGCACT CGTCCGAACC CGCGGACGCC CTGGGCACGC TGAACACCGC CCTCCTGGAG CGCTACGCCG ACAACGGCGA CCCCCGCTAC TGCACCGCCG TCTTCGGTGT CCTCGAACCC GACGGCGACG CCGGGCACGT CCGTGCCCGC CTGGCCTTCG GCGGCCACCC CCCGGCGCTG GCCCTGCGCG GGGACGGCCG GGCCGAGTTC CTGTTCGCCC CCGGCGGGAT GCTCGTCGGC GTGCTGCCCG ACGCCCACTT CGCCACCGTC GAGACCGCCC TCGCCCCCGG CGACACCTTC GTGCTCTACA CCGACGGCCT GACCGAGGCC CGCACCGGCG CCGGACCCGA CACGATGTAC GGCGAGGAGG CCCTGCTCTC CTTCGTCGCG CGGCACGCCC CCTCCACGGC CCACGGCGTC GTCGACGCGC TGGCCGAACT GCTGGAGGGT TTCGGCGAGG GCTTGGAGGA CGACACCGCC CTCCTCGCGC TCGGCGTCCC CGCCCCGCCC TCCACCGCCC CGTCCGGAAC CGGACACCAC ACGATGAGCG GTCCACGATG A
|
Protein sequence | MSGETGPPGA GNGSATDAAL SALLEDSAEE LYESAPCGYL STLMDGTVAR INATLLRWLG LERAAVVGRM RFTDLLTVGG RLYHETHFAP LLHLRGEVNG IALEMRASDG GRLPVLVSST VKRDGGGQPL LVRTTVFDAT DRRSYEEELL RRRREAEQAR AEAERAREEA ERARAEAEEA HRRAEADRAR LGEALVILQR ALLPDTLPDV PGMEAAAYYH TASPYRLGGD FYDLFPLGDG CWAFFLGDVS GKGPEAATLT SQARYVLRTT ALHSSEPADA LGTLNTALLE RYADNGDPRY CTAVFGVLEP DGDAGHVRAR LAFGGHPPAL ALRGDGRAEF LFAPGGMLVG VLPDAHFATV ETALAPGDTF VLYTDGLTEA RTGAGPDTMY GEEALLSFVA RHAPSTAHGV VDALAELLEG FGEGLEDDTA LLALGVPAPP STAPSGTGHH TMSGPR
|
| |