Gene Ndas_5480 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5480 
Symbol 
ID9249383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp674686 
End bp675900 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content74% 
IMG OID 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_003683365 
Protein GI297564392 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.500623 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCGATC ACGGCACGCC CTCCGCCGAG TACACGGCCT CCGACCGGGA GGGCGACCGC 
AGGGTCCGCT TCGCCCGGCG CATCGTCATC GTCTCGGTGG GCTTCACCGC GCTGCTCGTC
GTGGGGATGC CCCTGGTGGA CGTCGTCACG GCCGCGGCCT CGGACTGGCC CCTGTGGCGC
TCCCTGACCG GGGTGGTCGC CTCCGTGCCG ATCGCCGTGC TGCTGATCGT GATGCTCCGG
GCCAGGCTGG ACGGTCGGCG CAAGCCGGAC CCGCGCGCCT ACTGGACCTC GCTCGGCCTG
ATCGTCGTGG TCGTCCTGTG CCTCCAGGAG CCGATCAACA CCGTGTTCTT CCTGGCCGCG
TGGTGGGGGA CCGGGGTCTT CCTCGCCCCG CGCAGGCGCA GCGCGTACGT CACAGCCGCC
CTGCTGGCGC TGCCCTGGCT CGCGGTCCCG TTCTACGGGT TCGAGACGCA GTTCCAGCCG
CTCCTCTACC TGCTGGTGTG GCTGGTGATG GTCTTCAGCG GGCTGATGTT CGCCGGGGCC
TCGCTCTCCA TGATCTGGCT GTGGGACATC AGCCGCGAGG CCGTCGCGGG ACAGCGGGCC
CGCGCCCAGC TCGCGGTGAG CGAGGAGCGG CTCCGCTTCG CCCGCGACAT GCACGACCTG
CTCGGGCACA GCCTCTCCGC GCTCGCGGTC AAGGCCCAGC TGGCCGGACG GCTCGTGGAG
CGGGACCCCG AGCGGGCGGG CGCGGAGATG GCGGAGGTGC AGGTGCTCGC CAGACAGGCG
CTCCAACAGG TGAGGTCGGC GGTCAGCGGC TACCGGGAGG TCGACCTGGC GGGCGAGACG
GAGGCCGTCC GCGCGGTGCT CGACGCGGGG GGAACCAGGG CGGTCGCCAC CGGGCTGGAG
GGGCTGGACC TGCCGCCGCG GACCGCCGCG CTCGCCGCCT GGGTGGTGCG CGAGGGCGGC
ACGAACGTCC TGCGGCACAG CGACGCCAAC GAGTGCCAGA TCAGCTTCAC CCTCGCCCGC
GACAGCGCCG TGGGCCCCCG GACGCTCGTG GTCGAGGTGT TCAACGACCG CGCCCGGGGC
GGCGGCCAGG ACGGGAGGGA GTCGGGCAAC GGGCTCGCCG GTCTGTCCGA GCGGGTCGCC
ATGGGCGGCG GCACCCTCTC CGCGGCCCGC ACTCCGGAGG GGGGCTTCCT GCTCCGCGCC
GTCCTGCCGC TCTGA
 
Protein sequence
MSDHGTPSAE YTASDREGDR RVRFARRIVI VSVGFTALLV VGMPLVDVVT AAASDWPLWR 
SLTGVVASVP IAVLLIVMLR ARLDGRRKPD PRAYWTSLGL IVVVVLCLQE PINTVFFLAA
WWGTGVFLAP RRRSAYVTAA LLALPWLAVP FYGFETQFQP LLYLLVWLVM VFSGLMFAGA
SLSMIWLWDI SREAVAGQRA RAQLAVSEER LRFARDMHDL LGHSLSALAV KAQLAGRLVE
RDPERAGAEM AEVQVLARQA LQQVRSAVSG YREVDLAGET EAVRAVLDAG GTRAVATGLE
GLDLPPRTAA LAAWVVREGG TNVLRHSDAN ECQISFTLAR DSAVGPRTLV VEVFNDRARG
GGQDGRESGN GLAGLSERVA MGGGTLSAAR TPEGGFLLRA VLPL