Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1895 |
Symbol | |
ID | 9245745 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2312487 |
End bp | 2314211 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | putative PAS/PAC sensor protein |
Protein accession | YP_003679829 |
Protein GI | 297560855 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.757129 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACGAGG ATTCCGGTAC GTACAGCGAT TCGACCGGTG TGTCAGGTCA ACACCGGCTC CTCGCGGAGG CGCTGGACAG CCTGAGCGCG GGTGTGTACG CCGTCGACGA GAGGGAGCGG ATCGTCGCGG TCAACGCCAC CGCCCTGCGA CTGCTGGCCC GTTCGGCGGA CGAGACGCTC GGGCAGGACC TCCGGTCGTT GCACCGGGAC GCCCGGGGGC AGGCGGTCGT CAGGGAGGCA CGCGACGCCT CCCAGGGGGC GCCGTCCGGT TCCCTCTCCC GCGCGGGGGA GTCCTGGTTC CAGCGCGGCG ACGGCACGCT CCTCCCCGTG TCGTGGTCCG CCGTGCCCTG TAAGCCCGAC GGTTCCCAGG TCACGGAGCT GGTCTTCTTC CAGGCCGTCG AGCAGGACGA GGGGACCCCC GGGCGCTCCA CCCCTTCCCG ACGGACACTG TCGGAGACGG AGCGGCTGGC CCTGCTGGCC GACACCACCG CACACCTGAT CAACAACGTC GACGTGGAGA AGTCCCTGCT CCGGGTGGTG GAGCTGATGC TTCCACGACT CGCGGACTGG GCGATCATCG ACCTGATCAC CGAAGGCGAC GAGGTGTCCC GCTCCCTGGT GGTCCAGGCG GACCAGGGCA GGACCACGGT GCGCGAGGAC CTCCAGGGAC CGATGCCCCC GGTGCCCCCG ACCTCCAGCA TGCCCCTGTC CAAAGCCCTG CGAGGCGCCG CCTCCACCCT CGTCAAGCGT GAGATCTACT CCGGTCCGCC CGACACGGGT ATCGCGGTGG AGCAGCGCAG GCTGTTCGAG GCGACGGGCA TCAACACCGC GGCCATCGCC CCCATCCGCG GGCCCCGGGA GGTGCTGGGC GCCGTGACCC TGGGGCGCAC GGGCTCCCAG CACCCCTTCG CCCGCGACGA TCTCGCCCTT CTGGAGGACA TCGCCCGACG CATCGGCCTG GCCCTGGAGA ACGCGCGCCA CTACCAGCGC CAGCGCCAGG TCGCCGAGAC CATGCAGCGC TACCTGCTGC CCCAGCTCCC CCGGCTGGCG GGAGCGGAGA TGACCGCCCG GTACCTGCCC GCACCAGACG TCTCGCACGT GGGCGGTGAC TGGTACGACG CCTTCCCCCT GCCCCGCGGC GACACGGCCC TGGTCATCGG CGACGTCGTC GGCCACGACC TGGACGCGGC GGCCGGGATG GCCCAGCTCC GCAACATGCT CCGGGCCTAC ACCTGGGCAC AGGAGCAGTC CCCCCACCGC ACGCTGGAGC GCATGGACCA GGCCCTGGAG CACATCAGCG ACGTCTACAT GGCGACCCTC GTCCTGGCCC ACCTGACGGT CGACGAGGCC GGACGGTGGG AACTGCTGTG GTCGAGCGCC GGCCACCCCC CGCCCCTGCT CGTCCACCAC GACGGCATCG CCCACTACCT GGAGGAGGGG AGCGGGGTCC TGCTCGGCAC GGGGATGGCG CGGCCGCGCG CCGACGCGCG CATCGCCCTG CCGCCCGGGT CCACGCTCGT GTTCTACACG GACGGCCTGG TCGAAGCCCG GGGGCAGTCA CTGGACACGG GCCTCAGACG TATGCGCCAG CACGCGGCCT CCCTCGCGCA CCGCCCCCTG AACTCCTTCG CCGACCAGCT GCTGGAGCGC GCACGGCCGA GAAGCAACGA CGACGACGCC GCCCTGCTCG TCATCCGTAT TCCGGCGGAC GCGACCGACG GCTGA
|
Protein sequence | MDEDSGTYSD STGVSGQHRL LAEALDSLSA GVYAVDERER IVAVNATALR LLARSADETL GQDLRSLHRD ARGQAVVREA RDASQGAPSG SLSRAGESWF QRGDGTLLPV SWSAVPCKPD GSQVTELVFF QAVEQDEGTP GRSTPSRRTL SETERLALLA DTTAHLINNV DVEKSLLRVV ELMLPRLADW AIIDLITEGD EVSRSLVVQA DQGRTTVRED LQGPMPPVPP TSSMPLSKAL RGAASTLVKR EIYSGPPDTG IAVEQRRLFE ATGINTAAIA PIRGPREVLG AVTLGRTGSQ HPFARDDLAL LEDIARRIGL ALENARHYQR QRQVAETMQR YLLPQLPRLA GAEMTARYLP APDVSHVGGD WYDAFPLPRG DTALVIGDVV GHDLDAAAGM AQLRNMLRAY TWAQEQSPHR TLERMDQALE HISDVYMATL VLAHLTVDEA GRWELLWSSA GHPPPLLVHH DGIAHYLEEG SGVLLGTGMA RPRADARIAL PPGSTLVFYT DGLVEARGQS LDTGLRRMRQ HAASLAHRPL NSFADQLLER ARPRSNDDDA ALLVIRIPAD ATDG
|
| |