Gene Ndas_3054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3054 
Symbol 
ID9246910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3648031 
End bp3650520 
Gene Length2490 bp 
Protein Length829 aa 
Translation table11 
GC content75% 
IMG OID 
Productputative sensor with HAMP domain 
Protein accessionYP_003680970 
Protein GI297561996 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGAACA CGGGAGGCAC CGGCCGCAGC GCCATCAGGG CTCAGATCAA CAGGATCGTA 
CTGATCCCCA GCATCACCTT CCTGGCGCTG TTCGTCGTCC TCAGCACGGC CACGCTCGTC
CAGGCCGTGT CGCTGCGCTC CTCGGTCGGC GACGGCCGCG CCGGGATCCG GCTCGCCGCG
GCGCTCACCC TTCTCCAGGA GGAGCGCCGC CTGAGCGCCG CCTACCTCGC CGACCCCGCC
GAGGACGGCC GCGCCGCCCT GTCCGACGCG GCCTCGCGCA CCGACGAGGC ACTCGTCCCC
GTCCACGACC TGCGCGGCAC CATCGGCGAC CGGGACGACC CGGCCACCGA ACCCCTCGCC
GAGGACTTCT TCACCTCCCT GACCGAGGCC ACGGAACTGC GCGCCCAGAA CCTCGCCGAG
CCCGGTCCAC CCGAGGAAGC CCTCACCGCC TACACCACCG CGATCGGACA GGGCATCCGC
CTCTACGCCG GCACCGCCCG CTCCCTCGAC AACGGGCCCG CCACCGCCGA GGCCGCCGCC
GTCACCGACC TGATGTGGGC CCAGGAGAGC TTCAGCCGGG CCGACGCCCT CATCGCGGCC
GTGCTCGCCG AGGACTCCCT CGACCGCTTC CAGCAGGCAC AGGTCGTCGC GCTCACCGCG
GACGCCCGCC ACCGCGTCGA CATGGCCCCT CCCGGTCCCG CCGCAGACGA CGGCGAGCCC
CCCGCCCTGA CCGCACTCGC CGAGAGCCGC CCCTGGCAGG ACGCCCTGGC CATCGCGGAC
ACCCTCGCCA CCCACGAGGC CCCGGTCATC GTGGACGTCC TCAGCGGCGA ACAGACCCGG
GACCCGACGC CGCCCGAGGG GCTCGGGGGC TGGCGCGAGG CCGCCGACCA GGTCAACGCC
GAACTGGCCG ACATCACCGC CACGCGCGCC GCCTCCGTCG TCACCGCCAC CGAGCAGGCC
AGTTCCTGGA TGCTCACCCT CGCGCTGGGC GGCAGCATCA CCTCCCTGTT CGCGGGCACC
CTCGCCTACG GTGTGGCCGC CCGCTCCGCC GGACGCCTCA CCCACCGGCT CGCCCAGCTG
CGCGCCGACA CCCTGGGCAG CGCGCGCAGC GACCTGCCCC GTATCGTGCG CCGCCTGGAG
TCGGGGGAGC AGGTCGACCT CGACACCGAG ATGAAGCAGC TCGACCACGG CGACGACGAG
GTCGGCCAGG TCGCCGACGC CTTCAACATC GCCCAGCGCA CCGCCGTCGC CGCCGCCGTC
AAACAGGCCG ACATCCGCGC GGGCGTCAAC CGGGTCTTCC TCGGCATCGC CCACCGCCAC
CAGTCCCTCC TCCAGCGCCA GCTCCAGCTG CTGGACCGCG TCGAGCGCGA GGAGGAGGAC
CCCGACCTGC TGGAGAGCCT CTTCCAGCTC GACCACCTGG CCACCCGGGG CCGCCGCCAC
GCCGAGAACC TCATCATCCT GGGCGGCGCC CAGCCCGGCC GCCGCTGGCG CCACCCCGTC
CCGCTCGTGG ACATCCTGCG CGGCGCGATC TCCGAGACCG AGGAGTACAC CAGGGTCCGA
CTGACCTCCG TCCCCGACCT GTCGCTGTCC GGCGCCGCGG TGGCCGACGT CATCCACATG
CTCGCCGAGC TGGTCGAGAA CGCCACCGCC TACTCGCCGC CGCACACCCA GGTCACCATC
GCCACCGAGT CCGTGCCCAA GGGCGTGGCC GTGGAGATCG AGGACCGGGG CCTGGGCATG
ACCGAGGAGG TCCTGACCAG CTCCAACCGC ACCCTCAGCG AGGCGCCCGA GTTCGACGTC
ATGACCTCCG GCGGGGACTC GCGCCTGGGC CTGTTCGTCG TCGCCCGCCT CGCCGCCAAA
CACGACATCC GGGTGCAGCT GCGGCACTCG CCCTACGGCG GGACCCGGGC CGTGGTGCTC
ATCCCCGGCG CCCTGGTCGC CTCCTCGTCC AGCGCCCCGG AACGGCCCCG CGCGGCCATC
CGCCAGGGCG CCCACTCGGT ACTGCGTGAG CCGTCGGCGC AGCACGGTGA CGCGCTCACC
ACGCAGGACG CCCCCCACCG GAGCCGGCCG CTCCTGCGTC CGGTCCCCCG CGACGGCGAC
GGCGCACGCC CGGCCCGGGG CACGCCCCTG CTGCGCCGGG TACCGGCGCC CGTGCCCGAC
TCCGAGACCG CGGACGCCGC TCCCGGGACC GAACCCGGGT CCGGCCGGGT CGGCGCCGCA
CCCGGTCGCC CCGGACTTCC CCGGCGCAGG CGCCAGGCCA GCCTCGCCCC CCAGCTCAGA
GCCGCGGACC CGCCGGAGTG GGACGACGCC GACCCCCCGG GCTCCTCACG GCGCACCCCC
GAACAGGCGC GGCAGATGAT GGACGCCTTC AGCGCCGGTA CCCGGCGCGG CCGGGCCGCG
GACATCGGGG TCGACGCCGA CGGACGTGAA CACAGCAGTG ACGGTCAGGT CGGTGCGAGC
GCCGACGATC ACACGGGAGA AAGCGACTGA
 
Protein sequence
MGNTGGTGRS AIRAQINRIV LIPSITFLAL FVVLSTATLV QAVSLRSSVG DGRAGIRLAA 
ALTLLQEERR LSAAYLADPA EDGRAALSDA ASRTDEALVP VHDLRGTIGD RDDPATEPLA
EDFFTSLTEA TELRAQNLAE PGPPEEALTA YTTAIGQGIR LYAGTARSLD NGPATAEAAA
VTDLMWAQES FSRADALIAA VLAEDSLDRF QQAQVVALTA DARHRVDMAP PGPAADDGEP
PALTALAESR PWQDALAIAD TLATHEAPVI VDVLSGEQTR DPTPPEGLGG WREAADQVNA
ELADITATRA ASVVTATEQA SSWMLTLALG GSITSLFAGT LAYGVAARSA GRLTHRLAQL
RADTLGSARS DLPRIVRRLE SGEQVDLDTE MKQLDHGDDE VGQVADAFNI AQRTAVAAAV
KQADIRAGVN RVFLGIAHRH QSLLQRQLQL LDRVEREEED PDLLESLFQL DHLATRGRRH
AENLIILGGA QPGRRWRHPV PLVDILRGAI SETEEYTRVR LTSVPDLSLS GAAVADVIHM
LAELVENATA YSPPHTQVTI ATESVPKGVA VEIEDRGLGM TEEVLTSSNR TLSEAPEFDV
MTSGGDSRLG LFVVARLAAK HDIRVQLRHS PYGGTRAVVL IPGALVASSS SAPERPRAAI
RQGAHSVLRE PSAQHGDALT TQDAPHRSRP LLRPVPRDGD GARPARGTPL LRRVPAPVPD
SETADAAPGT EPGSGRVGAA PGRPGLPRRR RQASLAPQLR AADPPEWDDA DPPGSSRRTP
EQARQMMDAF SAGTRRGRAA DIGVDADGRE HSSDGQVGAS ADDHTGESD