Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5077 |
Symbol | |
ID | 9248966 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | + |
Start bp | 222259 |
End bp | 223935 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | signal transduction histidine kinase regulating citrate/malate metabolism |
Protein accession | YP_003682964 |
Protein GI | 297563991 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.669392 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCACGT ACCCCGCGCG CTTTCGACTG TCGCTCACCG GCTCCCTGTT CATCGGCTAC GTGCTCCTGC TGGCACTGGC CCTGGCCGCC GTGGGCGCGC TGTGGGCCGT CCACATGGAC CGGGCCACCG ACCGCCACTA CGCCGAGCGG GTCCTGAGCC TGGCCCGGTC CGTGGCCGTC ATGCCCGAGG TCGTCACGGG CCTGCAGTCC GCCGACCCGG CGGCCGAACT CGCCCCCCTG GCCGACCGGA TCGACTCCGC CACCAGCACC GAGTTCGTGG TCATCGCCTC CCCGGAGGGG ATCCGCTACT CGCACCCCGA CGACGAACTC ATCGGCCGCA CCGTGTCCAC CCCGCCCGGA CCCGCCGCGC AGGGGCGGGA GTGGGCCGGC GTGCAGGAGG GGACGCGGGG ACGCACCGTG CGCGCCAAGG TCCCGGTGTT CTCCGGCGGC GGCTCGGTCA ACGGCGGCGA CGCGCGCGGC GAGGTGGTCG GCTACGTCTC CCTGGGCATC CTCGCCTCCA GCGCCGCCAC CGAGGCCAGG GCCGCGGTCC CCGCCATACT GGGCACGGTG GCCGTGGTGC TGGTCCTGGG CGTGGCCGGC GCGTGGGCAC TCTCCCGCCA GGTCCGCACC AAGACCCACG GACTCGAACC CGCCGACATC ACCTCCCTGC TGGAGAGCCG CGAGGCCCTG CTGTACGCGG TCCGCGAGGG CGTGCTCGCC GTGGACGGCT CGGGCCGCCT CGTCCTGGCC AACCCGCCCG TCCGGGAGAT GCTCGGCCTG CCCGAGGACG CCGAGGGCCG GGGCCTGGAC GAACTCGGCC TGTCCGAGCG CGTCCGCGAT ATCGTCTCCG GCGCCGACCC CGGCGACGAC CGCCTCCTCC TGGCGGGGCA CCGCATCCTG GTCGCCAACC GGATGCCGGT CCACGTGCGC GGCCAGGACG CCGGGGCGGT CGTCACCTTC CGCGACCGCA CCGAACTGGA CCGGCTCACC GGCGAGCTCG ACGGCGCGCG CACGGTCACC CGCGGCCTGC GCGCCCAGAC CCACGAGTTC GCCAACCGGG TGCATACCAT CGCCGGAATG CTCGAACTCG GCGCCCACGA GGAGGCCCGC GCCTACCTCG CCGACCTGTC CGCGACGCAC AGCCGCACCA GCGCGGACAT CTCCCGGCAC GTCGGCGACT CCGCGCTGGC CGCGCTGACC ATCGCCAAGT CCGCGCAGGC CTCCGAGCTG GGTGTGGACC TGCGGCTGTC CCCCCTCACC AGCGTCCCCG CGCTGGACAG GGAGGTGCGC TCCGACGCGC TGCTCGTCCT CGGCAACCTG GTCGACAACG CGCTCGACGC GGTGGCCTCG GCCCCGCACG GCTGGGTGGA GCTGATGGTG CGGCTGCACC GGGCCGAGGG CACCGACCTG CCCCACGACC TGCTGGAGAT CCGGGTGACC GACTCCGGAC ACGGGGTGGC CGACGACGTG GCGGAGGAGA TCTTCCGGCT CGGGTTCACC ACCAAGGCGT CCCGGGACGG CGGCACGCGC GGGCTGGGCC TGGCGCTGGT CAAGCAGGTC TGCGAGGGAA GAGGGGGAAG CGTGGAGATG GAGGCGCCCG ACGCCGACGA GGGCGCGGTG TTCACCGCCT GCCTGCCCCT GCCGGGGGCG CGGGCGCCGC AGGGGGCGGC CCGATGA
|
Protein sequence | MITYPARFRL SLTGSLFIGY VLLLALALAA VGALWAVHMD RATDRHYAER VLSLARSVAV MPEVVTGLQS ADPAAELAPL ADRIDSATST EFVVIASPEG IRYSHPDDEL IGRTVSTPPG PAAQGREWAG VQEGTRGRTV RAKVPVFSGG GSVNGGDARG EVVGYVSLGI LASSAATEAR AAVPAILGTV AVVLVLGVAG AWALSRQVRT KTHGLEPADI TSLLESREAL LYAVREGVLA VDGSGRLVLA NPPVREMLGL PEDAEGRGLD ELGLSERVRD IVSGADPGDD RLLLAGHRIL VANRMPVHVR GQDAGAVVTF RDRTELDRLT GELDGARTVT RGLRAQTHEF ANRVHTIAGM LELGAHEEAR AYLADLSATH SRTSADISRH VGDSALAALT IAKSAQASEL GVDLRLSPLT SVPALDREVR SDALLVLGNL VDNALDAVAS APHGWVELMV RLHRAEGTDL PHDLLEIRVT DSGHGVADDV AEEIFRLGFT TKASRDGGTR GLGLALVKQV CEGRGGSVEM EAPDADEGAV FTACLPLPGA RAPQGAAR
|
| |