Gene Ndas_3946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3946 
Symbol 
ID9247817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4718774 
End bp4720060 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content71% 
IMG OID 
Productputative CheA signal transduction histidine kinase 
Protein accessionYP_003681849 
Protein GI297562875 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCGCC GCTTGGTAAC ACCCACAGTG CTGACCGCCG GACTCGCGGC GGCGATGGTG 
AGCGCCGCGG TGGCGCCCGC CTCGGCTGAC GCCCAACCCC AACCGCCCGA GGCCCAGGCG
ATCGCCGAGG AACTCGGGAA CGACAACGTC TTCATCGACC CGAGCATCGA CGCGATCCCC
GAGGCCGAGC AGGGAATCCT GGAGTCGACC GCCGCCGAGG CGGACGTCCC CGTCTACTAC
GTGATCCTGC CCACCGACAG CATCTCCTCC CAGGCCGGTC TCGACGCCCT GATGAACCCC
GTCATGGACG AGGTCGGGGA CGGGGTCTAC GGCGTGTTCG CGGGCAGCCA GACCTTCCAG
GTGCTGTCCC CCAACGTCGA GGACACCGAG GCCATCCAGC AGCTCGCCGT CCGGGAGGGC
GGCGGCAACC AGGTCGACAC CCTCGTCGCC ATCCCCGACG CCGCCACCCA GGTCGAGGAG
GCCGAGGCTG CCGGGGCCAC CTCCGGGTTC GTGCTCCTCG GACTCCTGCT GGCGGTCGTC
GCCGCGGGCG CGTGGTTCGT CCACCGCAGC CGCAAGAAGC GCGAGGCCGA GAAGGCCAAG
CAGCTGGAGG AGATCAGGCA GATGGCCACC GAGGACGTCG TGCGCCTCGG TGAGGACGTC
GCCCGGCTGG AGATCGACGT CTCCAAGGTC GACGACGCCA CCCGCAACGA CTACTCCCAG
GCCATGGACG CCTACGACCA GGCCAAGGCC CAGCTGGACA ACATCCGCGA GCCCGAGCAG
GTCAGGCTGG TCACCAGTGC CCTGGAGGAC GGCCGTTACT ACATGACCGC CACCCGGGCC
CGCCTCAACG GCGACCCGGT GCCCGAGCGG CGCGGCCCCT GCTTCTTCAA CCCGCAGCAC
GGCCCGTCCG TGGAGGACGT GACCTGGGCC CCGCCCGGCG GCGCGCCCCG CGAGGTCACC
GCCTGCGCCG ACTGCGCGCG TGCGGTGCGC ACCGGCGGCC AGCCCGACGT CCGCCTGGTC
GAGGTGGACG GCGAGCGCCG CCCGTACTAC GACGCCGGTC CGGCCTACTC GCCCTACGCG
AGCGGCTACT TCGGCATGAA CATGATGATG GGCATGTTCA CCGGCATGAT GATGGGCTCC
ATGATGGGGT CGATGATGGG CATGGGTATG GGCATGGGCG CCGGTGAGGT CGGCGCCGGA
GAGGACTTCG GAGGCGGGGA CTTCGGGGGC GGCGACTTCG GGGGCGGCGA CTTCGGCGGA
GGCGACTTCG GCGGCTTCGA CTTCTGA
 
Protein sequence
MLRRLVTPTV LTAGLAAAMV SAAVAPASAD AQPQPPEAQA IAEELGNDNV FIDPSIDAIP 
EAEQGILEST AAEADVPVYY VILPTDSISS QAGLDALMNP VMDEVGDGVY GVFAGSQTFQ
VLSPNVEDTE AIQQLAVREG GGNQVDTLVA IPDAATQVEE AEAAGATSGF VLLGLLLAVV
AAGAWFVHRS RKKREAEKAK QLEEIRQMAT EDVVRLGEDV ARLEIDVSKV DDATRNDYSQ
AMDAYDQAKA QLDNIREPEQ VRLVTSALED GRYYMTATRA RLNGDPVPER RGPCFFNPQH
GPSVEDVTWA PPGGAPREVT ACADCARAVR TGGQPDVRLV EVDGERRPYY DAGPAYSPYA
SGYFGMNMMM GMFTGMMMGS MMGSMMGMGM GMGAGEVGAG EDFGGGDFGG GDFGGGDFGG
GDFGGFDF