Gene Ndas_5077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5077 
Symbol 
ID9248966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp222259 
End bp223935 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content75% 
IMG OID 
Productsignal transduction histidine kinase regulating citrate/malate metabolism 
Protein accessionYP_003682964 
Protein GI297563991 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.669392 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACGT ACCCCGCGCG CTTTCGACTG TCGCTCACCG GCTCCCTGTT CATCGGCTAC 
GTGCTCCTGC TGGCACTGGC CCTGGCCGCC GTGGGCGCGC TGTGGGCCGT CCACATGGAC
CGGGCCACCG ACCGCCACTA CGCCGAGCGG GTCCTGAGCC TGGCCCGGTC CGTGGCCGTC
ATGCCCGAGG TCGTCACGGG CCTGCAGTCC GCCGACCCGG CGGCCGAACT CGCCCCCCTG
GCCGACCGGA TCGACTCCGC CACCAGCACC GAGTTCGTGG TCATCGCCTC CCCGGAGGGG
ATCCGCTACT CGCACCCCGA CGACGAACTC ATCGGCCGCA CCGTGTCCAC CCCGCCCGGA
CCCGCCGCGC AGGGGCGGGA GTGGGCCGGC GTGCAGGAGG GGACGCGGGG ACGCACCGTG
CGCGCCAAGG TCCCGGTGTT CTCCGGCGGC GGCTCGGTCA ACGGCGGCGA CGCGCGCGGC
GAGGTGGTCG GCTACGTCTC CCTGGGCATC CTCGCCTCCA GCGCCGCCAC CGAGGCCAGG
GCCGCGGTCC CCGCCATACT GGGCACGGTG GCCGTGGTGC TGGTCCTGGG CGTGGCCGGC
GCGTGGGCAC TCTCCCGCCA GGTCCGCACC AAGACCCACG GACTCGAACC CGCCGACATC
ACCTCCCTGC TGGAGAGCCG CGAGGCCCTG CTGTACGCGG TCCGCGAGGG CGTGCTCGCC
GTGGACGGCT CGGGCCGCCT CGTCCTGGCC AACCCGCCCG TCCGGGAGAT GCTCGGCCTG
CCCGAGGACG CCGAGGGCCG GGGCCTGGAC GAACTCGGCC TGTCCGAGCG CGTCCGCGAT
ATCGTCTCCG GCGCCGACCC CGGCGACGAC CGCCTCCTCC TGGCGGGGCA CCGCATCCTG
GTCGCCAACC GGATGCCGGT CCACGTGCGC GGCCAGGACG CCGGGGCGGT CGTCACCTTC
CGCGACCGCA CCGAACTGGA CCGGCTCACC GGCGAGCTCG ACGGCGCGCG CACGGTCACC
CGCGGCCTGC GCGCCCAGAC CCACGAGTTC GCCAACCGGG TGCATACCAT CGCCGGAATG
CTCGAACTCG GCGCCCACGA GGAGGCCCGC GCCTACCTCG CCGACCTGTC CGCGACGCAC
AGCCGCACCA GCGCGGACAT CTCCCGGCAC GTCGGCGACT CCGCGCTGGC CGCGCTGACC
ATCGCCAAGT CCGCGCAGGC CTCCGAGCTG GGTGTGGACC TGCGGCTGTC CCCCCTCACC
AGCGTCCCCG CGCTGGACAG GGAGGTGCGC TCCGACGCGC TGCTCGTCCT CGGCAACCTG
GTCGACAACG CGCTCGACGC GGTGGCCTCG GCCCCGCACG GCTGGGTGGA GCTGATGGTG
CGGCTGCACC GGGCCGAGGG CACCGACCTG CCCCACGACC TGCTGGAGAT CCGGGTGACC
GACTCCGGAC ACGGGGTGGC CGACGACGTG GCGGAGGAGA TCTTCCGGCT CGGGTTCACC
ACCAAGGCGT CCCGGGACGG CGGCACGCGC GGGCTGGGCC TGGCGCTGGT CAAGCAGGTC
TGCGAGGGAA GAGGGGGAAG CGTGGAGATG GAGGCGCCCG ACGCCGACGA GGGCGCGGTG
TTCACCGCCT GCCTGCCCCT GCCGGGGGCG CGGGCGCCGC AGGGGGCGGC CCGATGA
 
Protein sequence
MITYPARFRL SLTGSLFIGY VLLLALALAA VGALWAVHMD RATDRHYAER VLSLARSVAV 
MPEVVTGLQS ADPAAELAPL ADRIDSATST EFVVIASPEG IRYSHPDDEL IGRTVSTPPG
PAAQGREWAG VQEGTRGRTV RAKVPVFSGG GSVNGGDARG EVVGYVSLGI LASSAATEAR
AAVPAILGTV AVVLVLGVAG AWALSRQVRT KTHGLEPADI TSLLESREAL LYAVREGVLA
VDGSGRLVLA NPPVREMLGL PEDAEGRGLD ELGLSERVRD IVSGADPGDD RLLLAGHRIL
VANRMPVHVR GQDAGAVVTF RDRTELDRLT GELDGARTVT RGLRAQTHEF ANRVHTIAGM
LELGAHEEAR AYLADLSATH SRTSADISRH VGDSALAALT IAKSAQASEL GVDLRLSPLT
SVPALDREVR SDALLVLGNL VDNALDAVAS APHGWVELMV RLHRAEGTDL PHDLLEIRVT
DSGHGVADDV AEEIFRLGFT TKASRDGGTR GLGLALVKQV CEGRGGSVEM EAPDADEGAV
FTACLPLPGA RAPQGAAR