Gene Ndas_5197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5197 
Symbol 
ID9249090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp340507 
End bp341982 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content74% 
IMG OID 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_003683083 
Protein GI297564110 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.449609 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.98893 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGCCG CGCGCACCGC CGTCCCGAGG TGGCGCCGGA CACTGCTCCA ACCGGGCGAC 
TGGCGGCTGG GCACCCGCTT CGCGGTGATC TTCGCGCTGG TCGCCACCGT CGTCATCGCC
CTGGTCGGCA CCCTCGCCTA CACCACCGCC GCCGCGCTCA TCCGCTCCGA CGCCCGCACC
GAGTTCGAGA GCACCGTCAC CGCCCTGTCC GACCAGCTCG TCGACTACCA CCAGCAGGGG
CGGGGCGGCT CCGCCGGTCC CGGACCGTTC CTGCCCAGCG ACAAGTTCCA GCTCCAGCTC
CTCGAACCCG ACGGCAGCAG GACGGTCAAC ATCGCCGACC CGTCCGAGAT CATCCTGTTC
GCACCGTCGC AGAGGGACCT GGAGGTCGCC GAGGAGGCCC GGCCCGGCAT CGTCGACATG
CGCGAGCAGG CCATCGGCGG GCAGGAGTAC CGGCTGGCCA CGGTCTCCCT CGGCGACGGC
GCCGGGGCCC TCCAGCTCCT CCAGCGCCTG TCCCCGACCG AGCTGATGAT CGACCGGCTG
GCCACGCAGA TCCTGTGGGT GGGCCTGTTC GTCGCCCTGT GCGCGGCCGC CGCGGGCTGG
CTGGTGGGCC ACCGCACCAC CGGCCGCCTG GTGCGCCTCA CCGAGGCGGC CGAGTACGTC
AGCTCCACCG GGCGGCTCGA CCCCGTGGAC CCCGGCCGGA GCGGCGAGAG CCGCGAGGAG
GACGTCGGCC GCGACGAGGT CGGCCGCCTC ACCAGCGCGT TCAACGCCAT GCTCGCCCGG
CTGGCCCGTT CCAAGGACGA GCAGCGCCGC CTCGTCCAGG ACGCCGCGCA CGAACTCCGC
ACCCCGCTGA CCAGCCTGTA CACCAACGTG CAGGTGCTCG ACAGGGTGGA CCGGCTCAGC
CCGGAGGCGC GCGCCGGCCT CATCGAGGAC CTGCGCGGCG AGACCCGCGA ACTCACCGCC
CTGGTCAACG AACTGGTCGG CCTGGCCACC GGCGACCACG AGGACGAGCA GATGAGCGCC
GTCCCCCTCG CCGGGATCGC CGAGAAGGTC GCCAAGCGCA CCCGTCGCCG CACCGGCCGC
GACATCGTCG TGGACGCCGA CGACAGCGTC GTGTGGGGAC GCCCCGGCTC CCTGGAGCGC
GCCGTCTCCA ACCCGGTCGA GAACTCCGCC AAGTTCGACC CCGAGGGCAC CGCGCCCATC
GAGATCCGCG TGCGCGCTGG GACGGTGGAG GTCCTGGACC GGGGGCCCGG CATCGACCCG
GCCGAACTCG ACCACGTCTT CGAGCGCTTC TACAGGGCCG CCGTCGCCCG CGGCCTGCCC
GGTTCGGGGC TCGGCCTGTC CATGGTCAGG GAGATCGCGC AGGCGCACGG GGGCAGGGTG
TTCGCCCGCA ACAGGGAGGG CGGCGGCGCC GCCATCGGCT TCCACCTGCC GCTGTTCACA
CCGCCGCGGG ACGGGGAGCA GAAGGCGCGC GGGTGA
 
Protein sequence
MSAARTAVPR WRRTLLQPGD WRLGTRFAVI FALVATVVIA LVGTLAYTTA AALIRSDART 
EFESTVTALS DQLVDYHQQG RGGSAGPGPF LPSDKFQLQL LEPDGSRTVN IADPSEIILF
APSQRDLEVA EEARPGIVDM REQAIGGQEY RLATVSLGDG AGALQLLQRL SPTELMIDRL
ATQILWVGLF VALCAAAAGW LVGHRTTGRL VRLTEAAEYV SSTGRLDPVD PGRSGESREE
DVGRDEVGRL TSAFNAMLAR LARSKDEQRR LVQDAAHELR TPLTSLYTNV QVLDRVDRLS
PEARAGLIED LRGETRELTA LVNELVGLAT GDHEDEQMSA VPLAGIAEKV AKRTRRRTGR
DIVVDADDSV VWGRPGSLER AVSNPVENSA KFDPEGTAPI EIRVRAGTVE VLDRGPGIDP
AELDHVFERF YRAAVARGLP GSGLGLSMVR EIAQAHGGRV FARNREGGGA AIGFHLPLFT
PPRDGEQKAR G