Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2189 |
Symbol | |
ID | 9246039 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2616442 |
End bp | 2617932 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | integral membrane sensor signal transduction histidine kinase |
Protein accession | YP_003680117 |
Protein GI | 297561143 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000683339 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000292513 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTGCCATA TGCAACAGTC ACCGACGGGC CCTCACCTCC CGCGAAACGC CCACCGCACC CAACCCGGGC CCCGCTCACC CCGCACGCGG GGGACACACG GGGGAGGGGC GCCCGTCAAC ACCACACCGG GCGACGCCCG GGCCGCGAAC ACACCCCACC CCCGAACACA CCCCCAGCCT CACCCCGCCG GGGGAGGCGC CACCGCCCCC GCCTGTGCCA CGATCACCCA CGTGCGCCAC CTCACCCCGC TCCTGCGCCC GCTCACCAGC GCCCACACCT ACCGGCGCTG GGCCTACCTC GTCATCGGCG GCGCGCTCCT GATGCCCTAC GCCATGGCCG CCCTCGTCCT GGCGACCCTC GTCAACCCCA CCCCCGCACC CGCCGCCTTC GCCCTGACCC TCACCGCGGC CCTGGCCGCC GTCGCCGCCA CCGCCTACCT GCCCGGCACC CGCAGCGCCC AGACCCACCT GGCCCGCGCC CTGCTCCGCG GCCCCCTCAC CTCCACACCC CGCCCCAACC CCACCTGGCG CAGCAACGCC CGCACCAGCC TGTGGCTGTG CCTGCACATG CTCACCGGCA TGGCCGTGTG CCTGCTCACC ATGGTCGCCC TGACCGAGGC CGCCCTCCTG GCCGTCGCAC CCCTGACCCG CGACGTCGCC ACCATCGCCC AGGGCCCCCT GACCTTCATG GGCCAGACCG CCCTGACCCC CGCCCAGCGC GCCCTGGGCC CCCTCATCGG CCTCGCCCTG CTCGCCGCCC TGGTCTACAC CACCGCCCTG GTCGGCCACC TGCTCACCCT GGCCGCACCC CGCCTGCTGG GCCCCTCACC CACCGAACGC CTGGCCGCCG CCCAGGCCCA CGCCCACACC CTCGCCGAAC GCAACCGTCT GGCCCGCGAA CTCCACGACT CCCTGGGCCA CGCCCTGTCC GTGGTCACCC TCCAAGCCGC CACCGCCGCC CGCCTGCTCG ACACCGACCC CGACTTCGCC CGACAGGCCC TCACCCACAT CGCCGACCAG GCCCGCACCG CCACCGCCGA CCTCGACCAC GCACTGGGCA TCCTGCGCGA GAACACCCCC ACCCCACGCA CCACACCCCC TGACCTGGCC CACCTGCCCC ACCTGGCCCG CGCCACCCAA CACACCGGCA CCGACCTGAC CCTGCACCTG AACGGCGACC CCGCACACGT CCCCGCCCTG CTCTCCCGCG AGACCTACCG CATCAGCCAA GAAGCCCTCA CCAACGCCCT GCGCCACGCC CCCGGCCAAC CCCTCACCCT CACCCTGGAC ATCACCCCCA CCGCCCTGAC CCTGACCCTC ACCAACCCCC TCCCGCCCAC CCGCGCCCAC CGCACCCCGC GAGGGCGCGG CCACCACGGC ATCACCGGCA TGCGCGAACG AGCCCACCTG CTCGGCGGCA CGCTCACCGC CGGCCCCCAC CACGGCACCT GGCGCCTGAC CTGCCGCCTG ACCTGGAAAG AACAACCGTG A
|
Protein sequence | MCHMQQSPTG PHLPRNAHRT QPGPRSPRTR GTHGGGAPVN TTPGDARAAN TPHPRTHPQP HPAGGGATAP ACATITHVRH LTPLLRPLTS AHTYRRWAYL VIGGALLMPY AMAALVLATL VNPTPAPAAF ALTLTAALAA VAATAYLPGT RSAQTHLARA LLRGPLTSTP RPNPTWRSNA RTSLWLCLHM LTGMAVCLLT MVALTEAALL AVAPLTRDVA TIAQGPLTFM GQTALTPAQR ALGPLIGLAL LAALVYTTAL VGHLLTLAAP RLLGPSPTER LAAAQAHAHT LAERNRLARE LHDSLGHALS VVTLQAATAA RLLDTDPDFA RQALTHIADQ ARTATADLDH ALGILRENTP TPRTTPPDLA HLPHLARATQ HTGTDLTLHL NGDPAHVPAL LSRETYRISQ EALTNALRHA PGQPLTLTLD ITPTALTLTL TNPLPPTRAH RTPRGRGHHG ITGMRERAHL LGGTLTAGPH HGTWRLTCRL TWKEQP
|
| |