Gene Ndas_3847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3847 
Symbol 
ID9247718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4615200 
End bp4616486 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content73% 
IMG OID 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_003681750 
Protein GI297562776 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAGGC GGATGGTTTT CTCCACCCTG GTGGTGACCG TCATCGCCGT CATGCTGCTC 
GGGCTGCCCC TGGGCGCGCT CACGTACAAG CTGGTGTACG ACGAGAGCAC CCGCCAGCTC
CAGGGCGAGG CGGAGCTGAT CGGCGCCGAG AGCGACACCA TGCTGGAGCT GCACGGCCAA
CTCGACCTGG GCGAGTTCGA CCGCGACCAT CCCCACCGCT TCATCCGGAT CACCCCGGCG
GAGGAGCCCA CCTCGGTGAC CGCGGGCGAT CCCGCGCTCG ATCCCGAGGC GCCCGACTCC
CCGAGCATGC TCAAGGCGAC CGCGACCACC GGCCGGGGCA CGCGCGTCGA GGTGTGGATG
AGCGCCGAGA GCGTGCAGCA GAGCGTGGTC CGCGCCTGGA TGGGGATCGC GTCGCTGTCC
CTGCTGGCCA TCGGCGTCGC CGTGGGCCTG TCGATGTTCC AGGCCCGCAG GCTGACCCTG
CCCCTGCTCG ACCTGGCGGC CACCGCGGAG CGCCTGGGCT CGGGCGTGAC CACGCCGTGG
GGCCACCGGT ACGGGATACC GGAGGCCGAC CGGGTGGCGG AGGTCCTGGA CCGCAGCGCC
GAGCGCATCG CCGGGCTGAT CGCCACCGAG CGCCACTTCG CGACCGACGC CTCGCACCAG
CTGCGCACGC CGCTGACCGC GCTGACGATG CGCCTGGAGG AGATCCTGGC CGAGGCGGAC
AACCCCGAGG TGGTCCGCGA GGAGGGCGAG GCCGCCCTGG CCCAGACCGA GCGCCTGGTG
GAGACCGTGG AGAGCCTGCT GGGACGGGCC CGCAAGAGCC AGAACCCCGA GGTGGAGGCG
GTGGAGATCG ACCCCGTCCT GCACCACCTC CAAGAGGAGT GGCAGCCGGT CTTCCAGTCC
GCGCAGCGCA GGCTGCTGGT CACCGGCGAC CCGGGGCTGA CCGCGATGAC CGTCTCCGCC
GACCTGGCGC AGATCGTCGC GACCCTGGTG GAGAACGCCT ACAAGCACGG CGCGGGGACG
GTCACCATCC GGCGGCTGGA CACCGGGCAG TCGGTGCGCA TCGAGGTGAG CGACGAGGGC
GAGGGCGTGC CCGAGCACCT GTCGGGCCGG ATCTTCGAGC GCGAGGTGAG CGGCGGGGGC
GGGACCGGGC TGGGCCTGGC CCTGGCACGG CACATCGCCG AGTCCGAGGG GGCCCGGATC
GAGCTGGTGC AGACCAAGCC GACGACCTTC GCGCTGTTCC TGCCCGCGGG CGCGGGGGGC
CTGTCCAAGA TGACGGGCCC GGTGTAG
 
Protein sequence
MRRRMVFSTL VVTVIAVMLL GLPLGALTYK LVYDESTRQL QGEAELIGAE SDTMLELHGQ 
LDLGEFDRDH PHRFIRITPA EEPTSVTAGD PALDPEAPDS PSMLKATATT GRGTRVEVWM
SAESVQQSVV RAWMGIASLS LLAIGVAVGL SMFQARRLTL PLLDLAATAE RLGSGVTTPW
GHRYGIPEAD RVAEVLDRSA ERIAGLIATE RHFATDASHQ LRTPLTALTM RLEEILAEAD
NPEVVREEGE AALAQTERLV ETVESLLGRA RKSQNPEVEA VEIDPVLHHL QEEWQPVFQS
AQRRLLVTGD PGLTAMTVSA DLAQIVATLV ENAYKHGAGT VTIRRLDTGQ SVRIEVSDEG
EGVPEHLSGR IFEREVSGGG GTGLGLALAR HIAESEGARI ELVQTKPTTF ALFLPAGAGG
LSKMTGPV