Gene Ndas_4166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4166 
Symbol 
ID9248040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4974455 
End bp4976752 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content73% 
IMG OID 
Productputative PAS/PAC sensor protein 
Protein accessionYP_003682067 
Protein GI297563093 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.299427 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACGGT GGGGTGAGCC CTTCCCCAAG GGTGACGCGC TCGCCGCGGC CGCCGTGGCG 
GACCAGGCGC AGATCGCCAT CGTCGTCATC GACCGGGCCA GCCTCCTGCG CTACTGGAAC
CCCTTCGCGC GCGAGCTGTT CGGGTTCGTG GGCGGTCGCG ACTACGTCGG CCTCTCCCTG
CTCGACATCG GTATCCACGA GTCCGACCGG GAGCACGCCT CCCAGCTGGC CCGGCGCGTC
CTGCGCGGCG AGCCGTGGGA GGGCACCTTC GCGGTGCTCC GGGGCGACTC GACCTGGATC
CACGTGCGCG CCCAGGCCGT GCCCATGCAC AACGAGGCCG GCGAGATCGA CGGCATCACC
CTGATCGCCA GGGAGGCCCT GCGCAGCGGA CGGGTGGAGG AGCAGTACGG CCTGCTGGAG
CGCATCGGCA GCCGCCTGAC CAGCTCCCTG GAGTTCGACT CCACGGTCAG GGGGGTCGCG
GGCATCCTCG TGCCGCAGTT CGCCGACCAC TGCTTCATCG ACCTCTACGA CCGCGACCGC
CTGGTCAGAC AGGTGTCGGT GCACGCCGAG GGGTGGACGC CCCCGCCGCG CACCTGGTTC
GAGGTCGGCG ACGAGGTCCG CTACCCCGAA CGCCACTTCG TCACCCAGGC CCTGCGCCGC
CTGGAGACGG TGGTCAGCAG CGAGTACCTG TTCGAGAACT CGCCCAGCAC GCGCTCCGAC
CAGGTCTCGC GCCAGGTCGG CGTCACCTCC GCCATCGCCG CCCCGCTGCG GGCCCGCGGC
GAGGTGCTGG GCGTGCTCAC CCTGGCGCTG TCCGGGCTCT CCCCGCGGCA GAAGAGCACC
TACGGGGGCT TCGACCGCGA CCTCGTCGGC GCCATCGCCT CCCGCGTGGC CCTGGCCATC
GACAACGCCC GCCTGTTCGA GCAGGAGCGC AGCACCGCGC TGGCCTTCCA GCGCAGCCTG
CTGCCCAGCA GCCTGCCCCG GCTGGACGGG CTGACCGCGG CCCACCGCTA CCTGCCCGCG
GGACCGCTGC GCTCCGACGG GTACGGGGTC CAGACCCAGA TCGGCGGGGA CTTCTACGAC
GCCATCCCGC TCTCCGCCGG ACGCGTGGGC CTGGTCATCG GCGACGTCGA GGGCCGCGGC
CCGCACGCCG CCGCCGTCAT GGGCCAGCTG CGCGCCGCGC TGCGCGCCTT CGCCCAGGCC
GACCGCGAGC CCGCCGACAT CCTGCGCGAA CTCGACGAGT GGGTCCGCCA GCTGGGCCAG
GAGGACGAGG AGGGCGGCAC CTGGATCCCC AGCGTCAGCT GCCTGTACAT GGTCTACGAC
GCCTGGTCGC GCGAGCTGTC CTACGCCAAC GCCGGGCACG CGCCCCCGCT GCTGGTCACC
TCCGAGTCGG TGGAGAAGAT CGACCTGGAG GTCACCGACC GCATGCTGGG CGTGCGGGCC
AAGGGCGGAT CGGGCGAGGA CGTGGTCTAC CACCAGGCCA ACCTGCGCCT GCCCATCGGC
GCCACCCTGG TCCTGTACAC CGACGGGCTC GTGGACCGGC GCCCGGCCGG GGGCAGGGCG
GACCCCGAGA GCGCCTTCGA GCTGCTGGCG GAGCGGGTGG CCGAGGTCGC CGACAAGGAC
GTGGACCAGA TCGCCGAGGC GGCCGTGCAC AGCGTCCCGG GCGAACACGA CGACGACACC
GCTCTGCTGG TCGTGCGCAC CCACTCCGAG GAGCTGGCGC TGCGCGAGGG CTGGTTCCCC
TCGGAGGCCT CCACCGTGGG CGAGGCCCGC CACATGGCGG CCCACACCTT CAGCGAGTGG
GGGGTGGACC GCGACCAGGC CGAGCTGGCC TGCCTGCTGG TCTCGGAGAT CGTCACCAAC
GTCGTCATCC ACGCCACCCC GCACCCGGTT CACCGCGAGT TCACCGGCGG AGGGGTGCTG
GACTCCGAGA CCCCGCTCGA CGCGGTGGCC GACGAGTTCG ACGAGGACTG GACCGACCTG
CTGGAGGCGG TGGCCGAGGA GGCCGACGAA CCCGCCGAGG AGGCCGGGGA GAAGGAGTTC
CTGCTGCGGC TGCGGCGGGG CGCGAACACG GTGTGGGTGG AGGTCTTCGA CAACGACCTG
CGCCTGCCCC GCATCCGCAG CGCCGCCGCC GACGACGAGG GCGGCCGCGG CCTGTACCTG
GTCGAGCAGC TGGCCAGCCG GTGGGGGGCG CGGCCGACCC CGGACGGCAA GGCGGTCTGG
TTCGAGATGC CCATGCACTC CGAGGAGACC GGGGAGCAGC ACAGGGCCGA CGACGGGGAG
AAGGCCGGGC GGGACTAG
 
Protein sequence
MTRWGEPFPK GDALAAAAVA DQAQIAIVVI DRASLLRYWN PFARELFGFV GGRDYVGLSL 
LDIGIHESDR EHASQLARRV LRGEPWEGTF AVLRGDSTWI HVRAQAVPMH NEAGEIDGIT
LIAREALRSG RVEEQYGLLE RIGSRLTSSL EFDSTVRGVA GILVPQFADH CFIDLYDRDR
LVRQVSVHAE GWTPPPRTWF EVGDEVRYPE RHFVTQALRR LETVVSSEYL FENSPSTRSD
QVSRQVGVTS AIAAPLRARG EVLGVLTLAL SGLSPRQKST YGGFDRDLVG AIASRVALAI
DNARLFEQER STALAFQRSL LPSSLPRLDG LTAAHRYLPA GPLRSDGYGV QTQIGGDFYD
AIPLSAGRVG LVIGDVEGRG PHAAAVMGQL RAALRAFAQA DREPADILRE LDEWVRQLGQ
EDEEGGTWIP SVSCLYMVYD AWSRELSYAN AGHAPPLLVT SESVEKIDLE VTDRMLGVRA
KGGSGEDVVY HQANLRLPIG ATLVLYTDGL VDRRPAGGRA DPESAFELLA ERVAEVADKD
VDQIAEAAVH SVPGEHDDDT ALLVVRTHSE ELALREGWFP SEASTVGEAR HMAAHTFSEW
GVDRDQAELA CLLVSEIVTN VVIHATPHPV HREFTGGGVL DSETPLDAVA DEFDEDWTDL
LEAVAEEADE PAEEAGEKEF LLRLRRGANT VWVEVFDNDL RLPRIRSAAA DDEGGRGLYL
VEQLASRWGA RPTPDGKAVW FEMPMHSEET GEQHRADDGE KAGRD