Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3048 |
Symbol | |
ID | 9246904 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3642226 |
End bp | 3645123 |
Gene Length | 2898 bp |
Protein Length | 965 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | putative sensor with HAMP domain |
Protein accession | YP_003680964 |
Protein GI | 297561990 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.138126 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.807901 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAGAA CCCGCGACAA CGCGCCCACG ATCCGGAGAC AGCTCACCCG CATCGTGCTG ATCCCGAGCC TGTGCTTCCT CGCGCTCTGG CTCGTCGTGG CCGCCATCGG CACCATCCGC GCCGTCCAGT TCATGGGAGC CGTCGTCCAG GCGCGCGAGG GAACACAGGT CTTCTCCGCC GCGGCCGACG AGGTGCGGGC CGAACGCCGC CTCTCCCTGG TCCACCTGGG CCGATTCGAG AGCACCGGAA CCCGCGACGA CGAGGTCGGC GCCGCCCTCG ACGAGCAGCG CGAGGCCACC GACACCGCCA TGGCCGAGGC CGTCGCGTTC GCCGAGCGAC TGCGCGGAAC CCGCGACGAC GAGGTCGAAC GCAGCGCGAC CGACCTGGTG GACAGCGCCG GGATCCTCGA CGAGGTCCGC GCGGACGTGG ACGCGCACGA CCTCGGGCAG GAGGGGACCA TCCTGCGCTA CGGCGAGGTC CTGGCGGGCA CCACCGGCGC CATCACCGCG CTCGTGCACA CCACCGACGG CGGGGAGAAC CTCACCGACG CCGTCCTGAC CAGCGAGCTC ATGGGCGCGT CCTCCGCCTA CTCCACCGCC GACGCGCTGC TCGCGGGAGC CATCGCCCGG GGCGAGATGA GCTACGAGGA GACGGCGCAC TTCACCTACC TCACCGCCGC CTACCGCGAC ACCCTGGAGC CGGCCAGCGC CGCGATGCAC CCGAGCGTGG CCGCGCGCTA CGAGGAGCTG GTGTCCCTGC CCGCCTGGTC CCGGGCGGAG GAGCTGAGCC GTCGGGTGGT CACCCGCCAG CCGCTCGCCG AACCGGACGA ACCCGGGCAG GGGACCACGG GCTCGGACTG GAACGCCGAC GTCGGCATCA GGGCGCAGTC GTGGGACGAC TCCTCCGCAG AGGCCGCCCT GGCGCTACAG GACCTGGCCG GGCTCCAGGC CCGGCGCACT ATCGACCTGG CCTGGAGCGC CGCACTGCTG CGCGTCTTCC TGGGCGTGGC GGCGGCCGCG CTCACCCTGG CGGGCGGCGC GGTGGCCATC GCCGTGGTCG GCCGCTCCTC GCGCCGCCTC ACCGATCGCC TGACCCACCT GCGCGAACAG ATCCTGGACC GCGACGGCGA CCTGCCCGAC ATCGTCGACC GCGCCCAGCG CGGCGAGAAG GTCGACGTCC AGGAGGAGCT GCCGCCGCTG GACGACTGGG GGGACGACGA GATCGGCCAG GTCGCCGAGG CCTTCGACGC CGCCCAGCTC ACCGCCGTGG AGTCGGCCGT GCTCCAGGCC GAGATCCGCC GGGGGGCCAA CCGCGCCTTC CTGGGCATCG CCTTCCGCAA CCAGGCCCTG GTCCAGCGCC AGCTCCGCCT GCTCGACGAG ATCGAGTACC ACGAACAGGA CCCCGAGGCG CTGCGCCGCC TGTTCCGCCT GGACCACCTG GCCACCCGCG CGCGCCGCTA CTCCGACAAC CTCATCATCC TCGGCGGGGG GCAGTCGGCC CGCCGCTGGC GCCAGCCCCG CCCGCTCGTG GACGTGCTGC GCGCCGCCAT CGCCGAGACC GAGGACTTCG AACGCGTCCG CCTGACCTCG GCGCCCCGCG TGCTCATGCA CGGGCAGGTG GTCGCCGACG TGGTGCACCT GCTCGCCGAG CTGGTCGAGA ACGCCACGCA GTTCTCCCCG GCGGGAACGC CCGTGGACGT CGGCTGCACC CCGGCCGCGG AGGGGCTGGT GGTGGAGATC GAGGACCGGG GCCTGGGCAT GTCCGAGCGC GGCTACGCGG AGGCCGAGCG CACGCTCACC CAGCCCCCCG AGTTCGACGT CATGGCCCTG CCCGAGGACC CCCGGCTGGG CCTGTTCGTG GTGTCCCGCC TGGCGGGGCG GCACGGGGTC CGTGTCTGGC TGCGGCCCTC TCCCTACGGG GGCACCCGGG CGACCGTGCT CATCCCCGCC TCGCTGCTGG AGCCGGTCGA CAACCTCGTG ACGGTCGCGG GCACGCCCAC GGGCCGCCCG GCGGCGAACG GACGGGAGGC CCGCGTTCCG GCCGGGTGGG GGGCGCCTCC CGCGAGGGCC GCCCTCACCA CGCGCGGCCA CGGCGCGGAC GGCACCGTCC ACGGCCGGAC CGGCCCGCAG GCGCCCGTCA CACCTTCGGG GCAGACAGGT CCGCAGCTCT TCGCGCCGGG GCCGGACCGA ACCGGTCCGC AGGTTCCCGT GCCACCTCTC GGCCAGAGCG GCCCCCAGCC CTCGGTACCG CGATCGGACC GGGCCGAACC ACGGGTTCCT GTGCCGCCTT CCGGGCAGAG CGGTCCACAG CCCCTGGCAC AGCGGTACCG GACCGGTCCG CAGGTTCCCG TGCCGCCTCT CGGCCAGAGC GGCCCCCAGC CCTCGGTACC GCGATCGGAC CGGGCCGAAC CACGGGTTCC TGTGCCGCCT TCCGGGCAGA GCGGTCCACA GCCCCTGGCA CAGCGGTACC GGACCGGTCC GCAGGTTCCC GTGCCGCCTC TCGGCCAGAG CGGCCCCCAG CCCTCGGTGC CACGGGCGGA CCGGGCCGAA CCACGGGTCC CCGTGCACGC GGGCCAGACA GGTCCGCAGC CCTCCGCGGT CTCCGCCGGG CAGACCGGCC CGCAGGTCCC CGTGCCGCCT TCCGGACAGA CCGGACCGCA GCCCTTCTCC CCGCAGCCGG ACCGGCCCGG CCCGCACGGC GTCGTCCTCG CGCACACCGG ACCGCAGAGG TCCGTTCCCG CCTCCGGCGG TCCCGGCGGC ACCGCTCCGC CGGGGGAGCA CCTCCTGCCC GACCCGCACG GCCACCCGCT CGCACAGGAA CCCGTCCCCG AACCGTGGCA CGACGGCGCG CTGCACGCAC CCGCCCCCAC ACAGCAGAGG GGGAACGGTG AGAGCTGA
|
Protein sequence | MRRTRDNAPT IRRQLTRIVL IPSLCFLALW LVVAAIGTIR AVQFMGAVVQ AREGTQVFSA AADEVRAERR LSLVHLGRFE STGTRDDEVG AALDEQREAT DTAMAEAVAF AERLRGTRDD EVERSATDLV DSAGILDEVR ADVDAHDLGQ EGTILRYGEV LAGTTGAITA LVHTTDGGEN LTDAVLTSEL MGASSAYSTA DALLAGAIAR GEMSYEETAH FTYLTAAYRD TLEPASAAMH PSVAARYEEL VSLPAWSRAE ELSRRVVTRQ PLAEPDEPGQ GTTGSDWNAD VGIRAQSWDD SSAEAALALQ DLAGLQARRT IDLAWSAALL RVFLGVAAAA LTLAGGAVAI AVVGRSSRRL TDRLTHLREQ ILDRDGDLPD IVDRAQRGEK VDVQEELPPL DDWGDDEIGQ VAEAFDAAQL TAVESAVLQA EIRRGANRAF LGIAFRNQAL VQRQLRLLDE IEYHEQDPEA LRRLFRLDHL ATRARRYSDN LIILGGGQSA RRWRQPRPLV DVLRAAIAET EDFERVRLTS APRVLMHGQV VADVVHLLAE LVENATQFSP AGTPVDVGCT PAAEGLVVEI EDRGLGMSER GYAEAERTLT QPPEFDVMAL PEDPRLGLFV VSRLAGRHGV RVWLRPSPYG GTRATVLIPA SLLEPVDNLV TVAGTPTGRP AANGREARVP AGWGAPPARA ALTTRGHGAD GTVHGRTGPQ APVTPSGQTG PQLFAPGPDR TGPQVPVPPL GQSGPQPSVP RSDRAEPRVP VPPSGQSGPQ PLAQRYRTGP QVPVPPLGQS GPQPSVPRSD RAEPRVPVPP SGQSGPQPLA QRYRTGPQVP VPPLGQSGPQ PSVPRADRAE PRVPVHAGQT GPQPSAVSAG QTGPQVPVPP SGQTGPQPFS PQPDRPGPHG VVLAHTGPQR SVPASGGPGG TAPPGEHLLP DPHGHPLAQE PVPEPWHDGA LHAPAPTQQR GNGES
|
| |