Gene Ndas_4783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4783 
Symbol 
ID9248666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5670142 
End bp5671356 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content76% 
IMG OID 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_003682673 
Protein GI297563699 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.714313 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGAGGG GGAGGGGCGG CGGGGCCAGG GCCGGGCTCA CCCTGCGCGC CAGGCTGACC 
CTGGTCTACA CCGCCGTCTT CGCCGCGGGC GGGGCGGTGC TGCTCGGGGC CAACTACGCG
ATGGTCTCCG CCAGTCTGGA GGCCCGGGGC GTCGACCTGG CCGCGACCAT CGTCGCCTCG
GGGGACGGAG GGGAGCCCTC CATCGCCGTC GGGCCCGCCG AGCTCGTGGA GTCGGCGGGG
CCGTTCGATG AGGTGGAGAG GGCGTCCGGG CCGACGAGTC CGCTGGTCGA CTACCAGGAC
GGTGTGCTCT CCGACCTGCT GACCACGTCG GTCGCCGTGC TGGTGGCCGT CGCCGCCCTG
GCGGCCCTGG CCGGGTGGCT GATCGCGGGC AGGCCGATGC GCAGGCTGCA CGCGGTCACC
GAGACGGCCC GCCGCATCTC CGAGCGCGAC CTGCACAGCA GGCTGGCGCT GACCGGTCCC
GACGACGAGT TCCGGGAGCT CGGCGACACC TTCGACGGGA TGCTCTCCCG GCTGGAGCGG
GCCTTCGACG CGCAGCGGCG GTTCGTCGCC AACGCCTCAC ACGAGCTGCG CACCCCGCTG
GCGGTCCAGA AGGCCGCCGT GGAGGTACCG CTGTCGCAGG GCCGGGTCCC CGAGGACCTC
AGGCCCGCGT TCACCCGCGT GCTGGACTCG GTGGACCGCA GCGAACTCCT CATCGCGGGG
CTGCTCCTGC TCGCCCGCTC CGACCGCGGC CTCGCCCGCA CCGAGCCCGT GGACCTGGCC
GGGGCGGCGG AGAACGCATC GGCGCTGCTC GAACGCGACG CGCGGGAGGC CGGGGTCGAC
CTGCGGACCG ACCTCGCGGC CGCGGAGGTG GAGGGGGATC CGGTCCTGCT GGAGCACCTC
GTGCGCAACC TGGTGGACAA CGCGGTCCGC TACAACGTCC CGGGCGGGTG GGTCGAGGCG
CGGGTGCGCA CGGCGGCCGG TCGCACCGTG CTGAGGGTGG CCAACACGGG CGGGAACGTC
ACCGACCCCG AGAGCCTGTT CGAGCCCTTC CACCGGGGCG GCGACGCCCG GTTGCGCACC
GCGCGGCCCG GCAGCGGCCT GGGCCTGGCC ATCGTCCGCT CGATCGCCTC CGCCCACGGC
GCGCGGGCCG AGGCCCGCGC GCGCCCGGGC GGGGGACTGG TGGTGACGGT CGGGTTCCCG
GCCGCCGGGG GCTGA
 
Protein sequence
MGRGRGGGAR AGLTLRARLT LVYTAVFAAG GAVLLGANYA MVSASLEARG VDLAATIVAS 
GDGGEPSIAV GPAELVESAG PFDEVERASG PTSPLVDYQD GVLSDLLTTS VAVLVAVAAL
AALAGWLIAG RPMRRLHAVT ETARRISERD LHSRLALTGP DDEFRELGDT FDGMLSRLER
AFDAQRRFVA NASHELRTPL AVQKAAVEVP LSQGRVPEDL RPAFTRVLDS VDRSELLIAG
LLLLARSDRG LARTEPVDLA GAAENASALL ERDAREAGVD LRTDLAAAEV EGDPVLLEHL
VRNLVDNAVR YNVPGGWVEA RVRTAAGRTV LRVANTGGNV TDPESLFEPF HRGGDARLRT
ARPGSGLGLA IVRSIASAHG ARAEARARPG GGLVVTVGFP AAGG