Gene Ndas_1895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1895 
Symbol 
ID9245745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2312487 
End bp2314211 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content72% 
IMG OID 
Productputative PAS/PAC sensor protein 
Protein accessionYP_003679829 
Protein GI297560855 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.757129 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGAGG ATTCCGGTAC GTACAGCGAT TCGACCGGTG TGTCAGGTCA ACACCGGCTC 
CTCGCGGAGG CGCTGGACAG CCTGAGCGCG GGTGTGTACG CCGTCGACGA GAGGGAGCGG
ATCGTCGCGG TCAACGCCAC CGCCCTGCGA CTGCTGGCCC GTTCGGCGGA CGAGACGCTC
GGGCAGGACC TCCGGTCGTT GCACCGGGAC GCCCGGGGGC AGGCGGTCGT CAGGGAGGCA
CGCGACGCCT CCCAGGGGGC GCCGTCCGGT TCCCTCTCCC GCGCGGGGGA GTCCTGGTTC
CAGCGCGGCG ACGGCACGCT CCTCCCCGTG TCGTGGTCCG CCGTGCCCTG TAAGCCCGAC
GGTTCCCAGG TCACGGAGCT GGTCTTCTTC CAGGCCGTCG AGCAGGACGA GGGGACCCCC
GGGCGCTCCA CCCCTTCCCG ACGGACACTG TCGGAGACGG AGCGGCTGGC CCTGCTGGCC
GACACCACCG CACACCTGAT CAACAACGTC GACGTGGAGA AGTCCCTGCT CCGGGTGGTG
GAGCTGATGC TTCCACGACT CGCGGACTGG GCGATCATCG ACCTGATCAC CGAAGGCGAC
GAGGTGTCCC GCTCCCTGGT GGTCCAGGCG GACCAGGGCA GGACCACGGT GCGCGAGGAC
CTCCAGGGAC CGATGCCCCC GGTGCCCCCG ACCTCCAGCA TGCCCCTGTC CAAAGCCCTG
CGAGGCGCCG CCTCCACCCT CGTCAAGCGT GAGATCTACT CCGGTCCGCC CGACACGGGT
ATCGCGGTGG AGCAGCGCAG GCTGTTCGAG GCGACGGGCA TCAACACCGC GGCCATCGCC
CCCATCCGCG GGCCCCGGGA GGTGCTGGGC GCCGTGACCC TGGGGCGCAC GGGCTCCCAG
CACCCCTTCG CCCGCGACGA TCTCGCCCTT CTGGAGGACA TCGCCCGACG CATCGGCCTG
GCCCTGGAGA ACGCGCGCCA CTACCAGCGC CAGCGCCAGG TCGCCGAGAC CATGCAGCGC
TACCTGCTGC CCCAGCTCCC CCGGCTGGCG GGAGCGGAGA TGACCGCCCG GTACCTGCCC
GCACCAGACG TCTCGCACGT GGGCGGTGAC TGGTACGACG CCTTCCCCCT GCCCCGCGGC
GACACGGCCC TGGTCATCGG CGACGTCGTC GGCCACGACC TGGACGCGGC GGCCGGGATG
GCCCAGCTCC GCAACATGCT CCGGGCCTAC ACCTGGGCAC AGGAGCAGTC CCCCCACCGC
ACGCTGGAGC GCATGGACCA GGCCCTGGAG CACATCAGCG ACGTCTACAT GGCGACCCTC
GTCCTGGCCC ACCTGACGGT CGACGAGGCC GGACGGTGGG AACTGCTGTG GTCGAGCGCC
GGCCACCCCC CGCCCCTGCT CGTCCACCAC GACGGCATCG CCCACTACCT GGAGGAGGGG
AGCGGGGTCC TGCTCGGCAC GGGGATGGCG CGGCCGCGCG CCGACGCGCG CATCGCCCTG
CCGCCCGGGT CCACGCTCGT GTTCTACACG GACGGCCTGG TCGAAGCCCG GGGGCAGTCA
CTGGACACGG GCCTCAGACG TATGCGCCAG CACGCGGCCT CCCTCGCGCA CCGCCCCCTG
AACTCCTTCG CCGACCAGCT GCTGGAGCGC GCACGGCCGA GAAGCAACGA CGACGACGCC
GCCCTGCTCG TCATCCGTAT TCCGGCGGAC GCGACCGACG GCTGA
 
Protein sequence
MDEDSGTYSD STGVSGQHRL LAEALDSLSA GVYAVDERER IVAVNATALR LLARSADETL 
GQDLRSLHRD ARGQAVVREA RDASQGAPSG SLSRAGESWF QRGDGTLLPV SWSAVPCKPD
GSQVTELVFF QAVEQDEGTP GRSTPSRRTL SETERLALLA DTTAHLINNV DVEKSLLRVV
ELMLPRLADW AIIDLITEGD EVSRSLVVQA DQGRTTVRED LQGPMPPVPP TSSMPLSKAL
RGAASTLVKR EIYSGPPDTG IAVEQRRLFE ATGINTAAIA PIRGPREVLG AVTLGRTGSQ
HPFARDDLAL LEDIARRIGL ALENARHYQR QRQVAETMQR YLLPQLPRLA GAEMTARYLP
APDVSHVGGD WYDAFPLPRG DTALVIGDVV GHDLDAAAGM AQLRNMLRAY TWAQEQSPHR
TLERMDQALE HISDVYMATL VLAHLTVDEA GRWELLWSSA GHPPPLLVHH DGIAHYLEEG
SGVLLGTGMA RPRADARIAL PPGSTLVFYT DGLVEARGQS LDTGLRRMRQ HAASLAHRPL
NSFADQLLER ARPRSNDDDA ALLVIRIPAD ATDG