Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5197 |
Symbol | |
ID | 9249090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | + |
Start bp | 340507 |
End bp | 341982 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | integral membrane sensor signal transduction histidine kinase |
Protein accession | YP_003683083 |
Protein GI | 297564110 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.449609 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.98893 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCGCCG CGCGCACCGC CGTCCCGAGG TGGCGCCGGA CACTGCTCCA ACCGGGCGAC TGGCGGCTGG GCACCCGCTT CGCGGTGATC TTCGCGCTGG TCGCCACCGT CGTCATCGCC CTGGTCGGCA CCCTCGCCTA CACCACCGCC GCCGCGCTCA TCCGCTCCGA CGCCCGCACC GAGTTCGAGA GCACCGTCAC CGCCCTGTCC GACCAGCTCG TCGACTACCA CCAGCAGGGG CGGGGCGGCT CCGCCGGTCC CGGACCGTTC CTGCCCAGCG ACAAGTTCCA GCTCCAGCTC CTCGAACCCG ACGGCAGCAG GACGGTCAAC ATCGCCGACC CGTCCGAGAT CATCCTGTTC GCACCGTCGC AGAGGGACCT GGAGGTCGCC GAGGAGGCCC GGCCCGGCAT CGTCGACATG CGCGAGCAGG CCATCGGCGG GCAGGAGTAC CGGCTGGCCA CGGTCTCCCT CGGCGACGGC GCCGGGGCCC TCCAGCTCCT CCAGCGCCTG TCCCCGACCG AGCTGATGAT CGACCGGCTG GCCACGCAGA TCCTGTGGGT GGGCCTGTTC GTCGCCCTGT GCGCGGCCGC CGCGGGCTGG CTGGTGGGCC ACCGCACCAC CGGCCGCCTG GTGCGCCTCA CCGAGGCGGC CGAGTACGTC AGCTCCACCG GGCGGCTCGA CCCCGTGGAC CCCGGCCGGA GCGGCGAGAG CCGCGAGGAG GACGTCGGCC GCGACGAGGT CGGCCGCCTC ACCAGCGCGT TCAACGCCAT GCTCGCCCGG CTGGCCCGTT CCAAGGACGA GCAGCGCCGC CTCGTCCAGG ACGCCGCGCA CGAACTCCGC ACCCCGCTGA CCAGCCTGTA CACCAACGTG CAGGTGCTCG ACAGGGTGGA CCGGCTCAGC CCGGAGGCGC GCGCCGGCCT CATCGAGGAC CTGCGCGGCG AGACCCGCGA ACTCACCGCC CTGGTCAACG AACTGGTCGG CCTGGCCACC GGCGACCACG AGGACGAGCA GATGAGCGCC GTCCCCCTCG CCGGGATCGC CGAGAAGGTC GCCAAGCGCA CCCGTCGCCG CACCGGCCGC GACATCGTCG TGGACGCCGA CGACAGCGTC GTGTGGGGAC GCCCCGGCTC CCTGGAGCGC GCCGTCTCCA ACCCGGTCGA GAACTCCGCC AAGTTCGACC CCGAGGGCAC CGCGCCCATC GAGATCCGCG TGCGCGCTGG GACGGTGGAG GTCCTGGACC GGGGGCCCGG CATCGACCCG GCCGAACTCG ACCACGTCTT CGAGCGCTTC TACAGGGCCG CCGTCGCCCG CGGCCTGCCC GGTTCGGGGC TCGGCCTGTC CATGGTCAGG GAGATCGCGC AGGCGCACGG GGGCAGGGTG TTCGCCCGCA ACAGGGAGGG CGGCGGCGCC GCCATCGGCT TCCACCTGCC GCTGTTCACA CCGCCGCGGG ACGGGGAGCA GAAGGCGCGC GGGTGA
|
Protein sequence | MSAARTAVPR WRRTLLQPGD WRLGTRFAVI FALVATVVIA LVGTLAYTTA AALIRSDART EFESTVTALS DQLVDYHQQG RGGSAGPGPF LPSDKFQLQL LEPDGSRTVN IADPSEIILF APSQRDLEVA EEARPGIVDM REQAIGGQEY RLATVSLGDG AGALQLLQRL SPTELMIDRL ATQILWVGLF VALCAAAAGW LVGHRTTGRL VRLTEAAEYV SSTGRLDPVD PGRSGESREE DVGRDEVGRL TSAFNAMLAR LARSKDEQRR LVQDAAHELR TPLTSLYTNV QVLDRVDRLS PEARAGLIED LRGETRELTA LVNELVGLAT GDHEDEQMSA VPLAGIAEKV AKRTRRRTGR DIVVDADDSV VWGRPGSLER AVSNPVENSA KFDPEGTAPI EIRVRAGTVE VLDRGPGIDP AELDHVFERF YRAAVARGLP GSGLGLSMVR EIAQAHGGRV FARNREGGGA AIGFHLPLFT PPRDGEQKAR G
|
| |