Gene Ndas_4253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4253 
Symbol 
ID9248127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5070763 
End bp5072457 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content76% 
IMG OID 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_003682148 
Protein GI297563174 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.452469 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.374196 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGAGA TCCGGAGAGA ACCGCGCGGG TGGGCCCGTC CGGCCTTCGA GGCCGCGCTG 
TGGGCCCTTC TGGGCGCGCT GATCGTGATC GAGCTCTCCG GGGTCCGCGA CCGGCCGCCG
GTCCTCGAAG CGTGCCTCCT GCTGGCCGCC CTGGCCGTCG CGGCGGCCCT GTCCCGTCCC
CTGCCGTTCG TCTCCCTGGG CGTCATGCTC AGCGGGGCGG TCCTGCAGAC GTCCGCCTCG
GGGTTCCTGG TGGTGGAGAC CACCGTGATC TCGCCCTGGC TGGCCGTGGC TGTGCTGGCG
CTGCTCACCG GCCTGCGCTC GGAACGGGTG CGCCCGGTGG TGACGATGGT GGTCGTGGCG
TGCTCGTTCA CCCTCGTCCT GCACCTGTCG GCGGGGCTCG TGTTCGGTGC GGGCGTCCGC
GCGTTCCTGT CCGACTCCGT GGACTGGCTG GGCGTCGTCC TCGTCATCGC GCTGAGCACG
CTCACCCCGT GGCTGTTCGG CCGCTACCAG CGCCTGCGCC GCCGGGTGTG GCGGGGCGGC
TGGGAGATCG CCGAACGCAT GGAGCGGACC CGCGCGGCCG AGGCCGACCG CGCCCGTCTG
CGGGAGCGCG CCCGGATCGC GACCCGGATG CACGACTCCC TCGGCCACGA CCTCGCGCTG
ATCGCGGTGC GCGCGGCGGC CCTGGAGATG ACCGCTCTTG AGGACTCCGA GCAGGGCCGG
GCAGCCGGTG AGCTGCGCAC GGCGGCGCAC GAGGCCAACC TGCGGCTGCG GGAGATCATC
GGGGTGCTGC GCGAGGACGA CGGTGGCGAC GGCCACGGTG AAGCCGCTTC CGGACCGGGC
GCGGGCGGTC CGAAGGCAGG CGCGCTGAAC ACGGACGATC CGGGCACGGG CGAGTCGGTG
GCGGCCCTGG TGCAGCGCGC CTCGGACGCC GGTATGCGGG TCCGGCTGCT GCGGGAGGGC
CCCGATCCGG ATCCCGCCGC CCCGGGTGGC GGCGCCGTGC ACCGGGTGGT CCAGGAGGCG
CTGACCAACG CGGCCAAGTA CGCGCCGGAG GCGGAGGTGA CCGTGCGGGT GGTCCGCGAG
TCCGACCGGA CCCGGGTGTC GGTCAGCGAC ACCGGACCGC CCGGGGCCGT GCGGACGGTC
CTGCCCGCGC GGAGCGGGGG CGGGTCCGGC CTGGCCGGGC TGCGTGCGCT GGTGGAGGAG
CTCGACGGGA GCTTCGCCGT GGGCCGGGGG GAGGGCGCGG GGTTCACCGT CCGGGCGACC
GTGCCCGACC CCGGCGCCGA GGGCGGGGCG GAGGAACTCG GGGAGTCGGA GACGCACCGC
GCTCACAGCG AGGCCCGCGC CCGCGCGCGG CGCCGACTGG TGGCCGCCGT GGCCGTTCCG
GCCGCGCTGG GCCTGGGGCT GGCGGCGGTC GGCCTGGGCC TGCTGTCCTG GGTGGGCGTC
AACACCGTCC TGTCTCCGGA GCGGTACGGG CGGTTGACCG TGGGGGACGA CCGGGAGCGG
GTGGAGGAGG TGCTCCCCCG GTTCTCCTAC CCCGAGCACT CGGTGTCGGC CCGGGCCTCC
GAACCGCCCG CGCCGCCCGG TGCCCGGTGC CTCTTCTACC TGTCCCGGTA CGAGAACGGG
CTGCCGCCGG TGTACCGGCT GTGCTTCGAG GACGGTGTCC TGGTCGCCAA GGACGAGCTC
CAGCGGACCG ACTGA
 
Protein sequence
MREIRREPRG WARPAFEAAL WALLGALIVI ELSGVRDRPP VLEACLLLAA LAVAAALSRP 
LPFVSLGVML SGAVLQTSAS GFLVVETTVI SPWLAVAVLA LLTGLRSERV RPVVTMVVVA
CSFTLVLHLS AGLVFGAGVR AFLSDSVDWL GVVLVIALST LTPWLFGRYQ RLRRRVWRGG
WEIAERMERT RAAEADRARL RERARIATRM HDSLGHDLAL IAVRAAALEM TALEDSEQGR
AAGELRTAAH EANLRLREII GVLREDDGGD GHGEAASGPG AGGPKAGALN TDDPGTGESV
AALVQRASDA GMRVRLLREG PDPDPAAPGG GAVHRVVQEA LTNAAKYAPE AEVTVRVVRE
SDRTRVSVSD TGPPGAVRTV LPARSGGGSG LAGLRALVEE LDGSFAVGRG EGAGFTVRAT
VPDPGAEGGA EELGESETHR AHSEARARAR RRLVAAVAVP AALGLGLAAV GLGLLSWVGV
NTVLSPERYG RLTVGDDRER VEEVLPRFSY PEHSVSARAS EPPAPPGARC LFYLSRYENG
LPPVYRLCFE DGVLVAKDEL QRTD