Gene Ndas_3089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3089 
Symbol 
ID9246945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3697262 
End bp3699292 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content73% 
IMG OID 
Productputative PAS/PAC sensor protein 
Protein accessionYP_003681004 
Protein GI297562030 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGGAA AACGCGCTTC AGGCGATGGT TCCGAGAGCG CTCTCGCGGA GGCGTTCTCC 
CACGCCCCCG AGCCCATGGT GCTCACCCTG GCCGACGGCG CGATCGTGCA CGCCAACCTG
GCCTTCACCG AACTCTTCGG CCGCGACGCC TCCGACCTCG ACGGAGGGGC CTGGTTCGAC
CTCCTCGACG GTGACGACGC CCAGCGCGGC GCCCGCGCCC ACCACGAGGC CCTCGCCGGG
AACCGGGGTC GCCGGTCCCG CCTGCGCCTG AAGGTCGCCG ACAAGGCCGC CCACCTGGTG
GAGGCCGAGC TGCGCCCCCT GCCCGGGGAC GGCCCGGGGG ACGCCCGCGC GGTCGTCCTG
CTGCACCTGC TCTCCACCGA GGAGGCCGAC CTGCGCGTGA TCGGCGAGCT GCGCACCGAC
AACTCCGACT CCGTGCTGTG GAGCCTGGAC CCGGCCAGCG GACGCCTGCG CGAGATGTTC
GGCCCCACCC CGCTCGGCGC GCTGCTGGCC GGGAACGACC GCGAACTCGA ACGCATCCTG
GAGCGCGTCC ACCCCGACGA CATCGCCCGC GTGCGCGACG CCATCTCCGC CTCCTTCGCC
GGGCGCGACT ACGAACAGCG CTTCCGCGTC TTCGACCGCC TCGGCGACGA GCGCTCCCTG
CACGTGCGCG CCCGCTACGT GCCAGGCGAT CCCGACCGCC TCGTGGGCAT CGTCGACGAC
GTCACCGAGC ACGTCCAGCT CGTACGCCGC CTCGCCGACC GCCGCCGCAC CGAGGCCGAG
CACGGCCGCC TGGTCACCGA ACTGTCCTCC AAACTCGTCT CGGCCACCAC CGTCGACAAG
GTCATGGACC TGCTCAGCGA GGAGTTCCTG CCCATCTTCG GCGGAGTCCA GGCCGTCGCC
CTCTACGTCG AGAACGGACT GCTGCGCAGC TCGCCCAGCG CACACGCCAA GAGCGCCACC
AGCAACATCG AGCGCATAGA CGGCCGCCGC GCCGACGACA CCGACTTCCC CATGGGCGCG
GTCATCCAGG ACCGCCAGCC CCGCTTCTTC GAGAGCCGCG CCGAGATCAT CAGCCGCTTC
CCCGCCGCCG TCGAACTCAT GCGCCAGGTG CGCGGCCAGG CCTGGGCCAC CGTGCCCATC
TTCGGCGACG GCAAGGTCGC CCTGGGCGTG TGGCAGATGG TCTGGGACCG CCCCCACCAC
GCCAGCCGCG ACGAGCGGGC CCTCATGCTC ACCTTCGCCG GACTCGCCGG GCAGGCCCTC
CAACGGATCA AGGCGCAGCA GGCCGAGCTG GAGCTGGCCG ACGCGCTCCA GCGCCGCATG
CTGCCCCGCC AGGTGGCCAG CTTCCCGGAC ATGGACATCG CCACCAAGTA CCTGCCCTCC
CGCGCGGACT GGCGCATCTG CGGCGACTTC TACGACGTCA TCGAACTCCC CGAGAACAAG
GTCGGCCTCC TGGTCGGCGA CGTCCAGGGG CACGGCGTCG AGGCGGCGGC CGCCATGGGC
CAGATCCGCG TGGCCTTCCG CGCCTACGCC ACCAACCAGT CCGACCCCGG CGTCGTCCTG
GGCGAGACCA ACCGCCTGCT CACCGAGACC GGCGAGATCG TCTTCGCCAC CTGCGGCTAC
CTCGTCGTGG ACCGCGAGAG CGGCGTCATG CAGGCGGCCT GGGCCGGACA GCCGCCGGTC
GTCCTGGCCA CCCGGAGCGG CTACGAGCTC TGGGAGCCCG AGACCGGCCC GCCGCTGGGC
GTGCTCTCCG ACCCCGAGTA CGCCGTCACC ACCCGCATGC TCCCGCCCGG CACCGCCCTG
CTGCTGTGCT CCGACGGACT CGTGGAGAGC TCCGAGGTGC CCATGGGCGA GGGGCTGGCC
CGGGTCGGCG CGGCGCTCTC CGAGCACCAC GAGGACCCCG AGGCCGCCGC GCACGTCATC
GCCGAGATGG CCCCGGCCGG GCGCGGCGAC GACATCGCCC TGCTCATCAC CCGCATGATC
CCCTCCACCG AGCCGGCGCG GGACCGCGCC GCCGTGGCGA CCGCGACCTG A
 
Protein sequence
MNGKRASGDG SESALAEAFS HAPEPMVLTL ADGAIVHANL AFTELFGRDA SDLDGGAWFD 
LLDGDDAQRG ARAHHEALAG NRGRRSRLRL KVADKAAHLV EAELRPLPGD GPGDARAVVL
LHLLSTEEAD LRVIGELRTD NSDSVLWSLD PASGRLREMF GPTPLGALLA GNDRELERIL
ERVHPDDIAR VRDAISASFA GRDYEQRFRV FDRLGDERSL HVRARYVPGD PDRLVGIVDD
VTEHVQLVRR LADRRRTEAE HGRLVTELSS KLVSATTVDK VMDLLSEEFL PIFGGVQAVA
LYVENGLLRS SPSAHAKSAT SNIERIDGRR ADDTDFPMGA VIQDRQPRFF ESRAEIISRF
PAAVELMRQV RGQAWATVPI FGDGKVALGV WQMVWDRPHH ASRDERALML TFAGLAGQAL
QRIKAQQAEL ELADALQRRM LPRQVASFPD MDIATKYLPS RADWRICGDF YDVIELPENK
VGLLVGDVQG HGVEAAAAMG QIRVAFRAYA TNQSDPGVVL GETNRLLTET GEIVFATCGY
LVVDRESGVM QAAWAGQPPV VLATRSGYEL WEPETGPPLG VLSDPEYAVT TRMLPPGTAL
LLCSDGLVES SEVPMGEGLA RVGAALSEHH EDPEAAAHVI AEMAPAGRGD DIALLITRMI
PSTEPARDRA AVATAT