Gene Ndas_3048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3048 
Symbol 
ID9246904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3642226 
End bp3645123 
Gene Length2898 bp 
Protein Length965 aa 
Translation table11 
GC content75% 
IMG OID 
Productputative sensor with HAMP domain 
Protein accessionYP_003680964 
Protein GI297561990 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.138126 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.807901 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAGAA CCCGCGACAA CGCGCCCACG ATCCGGAGAC AGCTCACCCG CATCGTGCTG 
ATCCCGAGCC TGTGCTTCCT CGCGCTCTGG CTCGTCGTGG CCGCCATCGG CACCATCCGC
GCCGTCCAGT TCATGGGAGC CGTCGTCCAG GCGCGCGAGG GAACACAGGT CTTCTCCGCC
GCGGCCGACG AGGTGCGGGC CGAACGCCGC CTCTCCCTGG TCCACCTGGG CCGATTCGAG
AGCACCGGAA CCCGCGACGA CGAGGTCGGC GCCGCCCTCG ACGAGCAGCG CGAGGCCACC
GACACCGCCA TGGCCGAGGC CGTCGCGTTC GCCGAGCGAC TGCGCGGAAC CCGCGACGAC
GAGGTCGAAC GCAGCGCGAC CGACCTGGTG GACAGCGCCG GGATCCTCGA CGAGGTCCGC
GCGGACGTGG ACGCGCACGA CCTCGGGCAG GAGGGGACCA TCCTGCGCTA CGGCGAGGTC
CTGGCGGGCA CCACCGGCGC CATCACCGCG CTCGTGCACA CCACCGACGG CGGGGAGAAC
CTCACCGACG CCGTCCTGAC CAGCGAGCTC ATGGGCGCGT CCTCCGCCTA CTCCACCGCC
GACGCGCTGC TCGCGGGAGC CATCGCCCGG GGCGAGATGA GCTACGAGGA GACGGCGCAC
TTCACCTACC TCACCGCCGC CTACCGCGAC ACCCTGGAGC CGGCCAGCGC CGCGATGCAC
CCGAGCGTGG CCGCGCGCTA CGAGGAGCTG GTGTCCCTGC CCGCCTGGTC CCGGGCGGAG
GAGCTGAGCC GTCGGGTGGT CACCCGCCAG CCGCTCGCCG AACCGGACGA ACCCGGGCAG
GGGACCACGG GCTCGGACTG GAACGCCGAC GTCGGCATCA GGGCGCAGTC GTGGGACGAC
TCCTCCGCAG AGGCCGCCCT GGCGCTACAG GACCTGGCCG GGCTCCAGGC CCGGCGCACT
ATCGACCTGG CCTGGAGCGC CGCACTGCTG CGCGTCTTCC TGGGCGTGGC GGCGGCCGCG
CTCACCCTGG CGGGCGGCGC GGTGGCCATC GCCGTGGTCG GCCGCTCCTC GCGCCGCCTC
ACCGATCGCC TGACCCACCT GCGCGAACAG ATCCTGGACC GCGACGGCGA CCTGCCCGAC
ATCGTCGACC GCGCCCAGCG CGGCGAGAAG GTCGACGTCC AGGAGGAGCT GCCGCCGCTG
GACGACTGGG GGGACGACGA GATCGGCCAG GTCGCCGAGG CCTTCGACGC CGCCCAGCTC
ACCGCCGTGG AGTCGGCCGT GCTCCAGGCC GAGATCCGCC GGGGGGCCAA CCGCGCCTTC
CTGGGCATCG CCTTCCGCAA CCAGGCCCTG GTCCAGCGCC AGCTCCGCCT GCTCGACGAG
ATCGAGTACC ACGAACAGGA CCCCGAGGCG CTGCGCCGCC TGTTCCGCCT GGACCACCTG
GCCACCCGCG CGCGCCGCTA CTCCGACAAC CTCATCATCC TCGGCGGGGG GCAGTCGGCC
CGCCGCTGGC GCCAGCCCCG CCCGCTCGTG GACGTGCTGC GCGCCGCCAT CGCCGAGACC
GAGGACTTCG AACGCGTCCG CCTGACCTCG GCGCCCCGCG TGCTCATGCA CGGGCAGGTG
GTCGCCGACG TGGTGCACCT GCTCGCCGAG CTGGTCGAGA ACGCCACGCA GTTCTCCCCG
GCGGGAACGC CCGTGGACGT CGGCTGCACC CCGGCCGCGG AGGGGCTGGT GGTGGAGATC
GAGGACCGGG GCCTGGGCAT GTCCGAGCGC GGCTACGCGG AGGCCGAGCG CACGCTCACC
CAGCCCCCCG AGTTCGACGT CATGGCCCTG CCCGAGGACC CCCGGCTGGG CCTGTTCGTG
GTGTCCCGCC TGGCGGGGCG GCACGGGGTC CGTGTCTGGC TGCGGCCCTC TCCCTACGGG
GGCACCCGGG CGACCGTGCT CATCCCCGCC TCGCTGCTGG AGCCGGTCGA CAACCTCGTG
ACGGTCGCGG GCACGCCCAC GGGCCGCCCG GCGGCGAACG GACGGGAGGC CCGCGTTCCG
GCCGGGTGGG GGGCGCCTCC CGCGAGGGCC GCCCTCACCA CGCGCGGCCA CGGCGCGGAC
GGCACCGTCC ACGGCCGGAC CGGCCCGCAG GCGCCCGTCA CACCTTCGGG GCAGACAGGT
CCGCAGCTCT TCGCGCCGGG GCCGGACCGA ACCGGTCCGC AGGTTCCCGT GCCACCTCTC
GGCCAGAGCG GCCCCCAGCC CTCGGTACCG CGATCGGACC GGGCCGAACC ACGGGTTCCT
GTGCCGCCTT CCGGGCAGAG CGGTCCACAG CCCCTGGCAC AGCGGTACCG GACCGGTCCG
CAGGTTCCCG TGCCGCCTCT CGGCCAGAGC GGCCCCCAGC CCTCGGTACC GCGATCGGAC
CGGGCCGAAC CACGGGTTCC TGTGCCGCCT TCCGGGCAGA GCGGTCCACA GCCCCTGGCA
CAGCGGTACC GGACCGGTCC GCAGGTTCCC GTGCCGCCTC TCGGCCAGAG CGGCCCCCAG
CCCTCGGTGC CACGGGCGGA CCGGGCCGAA CCACGGGTCC CCGTGCACGC GGGCCAGACA
GGTCCGCAGC CCTCCGCGGT CTCCGCCGGG CAGACCGGCC CGCAGGTCCC CGTGCCGCCT
TCCGGACAGA CCGGACCGCA GCCCTTCTCC CCGCAGCCGG ACCGGCCCGG CCCGCACGGC
GTCGTCCTCG CGCACACCGG ACCGCAGAGG TCCGTTCCCG CCTCCGGCGG TCCCGGCGGC
ACCGCTCCGC CGGGGGAGCA CCTCCTGCCC GACCCGCACG GCCACCCGCT CGCACAGGAA
CCCGTCCCCG AACCGTGGCA CGACGGCGCG CTGCACGCAC CCGCCCCCAC ACAGCAGAGG
GGGAACGGTG AGAGCTGA
 
Protein sequence
MRRTRDNAPT IRRQLTRIVL IPSLCFLALW LVVAAIGTIR AVQFMGAVVQ AREGTQVFSA 
AADEVRAERR LSLVHLGRFE STGTRDDEVG AALDEQREAT DTAMAEAVAF AERLRGTRDD
EVERSATDLV DSAGILDEVR ADVDAHDLGQ EGTILRYGEV LAGTTGAITA LVHTTDGGEN
LTDAVLTSEL MGASSAYSTA DALLAGAIAR GEMSYEETAH FTYLTAAYRD TLEPASAAMH
PSVAARYEEL VSLPAWSRAE ELSRRVVTRQ PLAEPDEPGQ GTTGSDWNAD VGIRAQSWDD
SSAEAALALQ DLAGLQARRT IDLAWSAALL RVFLGVAAAA LTLAGGAVAI AVVGRSSRRL
TDRLTHLREQ ILDRDGDLPD IVDRAQRGEK VDVQEELPPL DDWGDDEIGQ VAEAFDAAQL
TAVESAVLQA EIRRGANRAF LGIAFRNQAL VQRQLRLLDE IEYHEQDPEA LRRLFRLDHL
ATRARRYSDN LIILGGGQSA RRWRQPRPLV DVLRAAIAET EDFERVRLTS APRVLMHGQV
VADVVHLLAE LVENATQFSP AGTPVDVGCT PAAEGLVVEI EDRGLGMSER GYAEAERTLT
QPPEFDVMAL PEDPRLGLFV VSRLAGRHGV RVWLRPSPYG GTRATVLIPA SLLEPVDNLV
TVAGTPTGRP AANGREARVP AGWGAPPARA ALTTRGHGAD GTVHGRTGPQ APVTPSGQTG
PQLFAPGPDR TGPQVPVPPL GQSGPQPSVP RSDRAEPRVP VPPSGQSGPQ PLAQRYRTGP
QVPVPPLGQS GPQPSVPRSD RAEPRVPVPP SGQSGPQPLA QRYRTGPQVP VPPLGQSGPQ
PSVPRADRAE PRVPVHAGQT GPQPSAVSAG QTGPQVPVPP SGQTGPQPFS PQPDRPGPHG
VVLAHTGPQR SVPASGGPGG TAPPGEHLLP DPHGHPLAQE PVPEPWHDGA LHAPAPTQQR
GNGES