Gene Ndas_1805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1805 
Symbol 
ID9245655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2206100 
End bp2208226 
Gene Length2127 bp 
Protein Length708 aa 
Translation table11 
GC content75% 
IMG OID 
Productputative regulator protein 
Protein accessionYP_003679739 
Protein GI297560765 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.569489 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACTCCC CCGGTCACAC CTCCCCGACC AACGACCCAA GCGCCCGGAC CAACACCGTG 
CGCGGCGACA TCAGCGGAAC CGCCGTCCAG GTCGGCTCGC TCCGCGGGGA CGTGAACCTG
CACCTGGCAT CCGCGGCCGA CCGCGTCCCC CGCCAACTCC CCGCCGCGCC CGCCGTCTTC
GTGGACCGGC CCCATCCGAT GCGGGCGCTG GAGGCGGCCG TGTCGCGCGC CCACGCCCGG
GGCCGCCACG CACTGGTGGT GATCACGGGC TCGGCGGGGG TGGGCAAGTC CGCGCTGGCG
GCGGAGTTCC TGCGCGCGCA CCCGGAACTG GGCGGAGGCG GACAGCTGTT CGTGGACCTG
CACGGCTTCT CCCCCGGACA GCCCGCCGAC CCCCACGACG TGCTGGACGT GCTGCTCCGC
GCCGTGGGCG TGCAGATCGG GACGGCACCG GCGGACCTCG CCACACGCGC GGCCTGGTGG
AGGTCGGCCA CCGCGCACGC GCCCGTGTCG CTGCTGTTGG ACGACGCCCT CTCCGCCGCC
CAGGTCCGCG CCCTGCTGCC CGGGGGCGAC CGCGGCACGG TGGTGGTCAC CACGCGTCTG
CGGCTGGCCG GGCTGCGCCT GGACGGCGGG CAGTTCGTGG ACCTGCATCC GATGACGGGG
GCCGAGGGGG TGGCCCTGCT CTCCGAACTC AGCGGCCGCT CCCCCGCCGG AGAGGCCGAC
CGGAAGGCCA TGCACGGCGT CGTCACCCTG TGTTCGGCAC TGCCGGTGGC CGTGTGCACG
GTCGCCGTCG ACCACGGCAC CCGCAGCCGG ACCTGGCGGG ACACCGAGCG GCGCCTGCGC
TCCGCCCCCC GTCGGCTCGA CCACCTGGAC ACCGCGAGCA GGGAAATGGG TATGGACGTG
TCCGTGCGCA ACGCCTTCGA CCTGTCCTAC GCCTCCCTCC CGGTCTTTCC CGCCCTGCTC
TACCGGCGTC TGGCCTGGCA CCCGGGCCCC GACGTCACCG GTGAGCTGGC CGCCCACCTC
ACCGGCCGCC CGGCGGCCGA GTGCGAGGCG GGGCTGGCCG CCCTCACCCG CCACCACCTG
CTGACCGAGC ACAGCGCCGA ACGCTTCTCC TTCCACGACC TGCTGCGCCT GCACGCCGCG
GAGAAGGCCG AGGCCGAGGA GGACTCCGGC GCCCGCCACC GGTCGCTGAC GCGGCTGGTG
TCCGGGTTCG CCGACCTGGC GGTGTCGGCC GACGCCGTGC TGCGGCCCTA CGCGGGCAAC
CGGGCGGTGC CGGGCTCACC CTTCACCGAC GCCGCGCAGG CCACGGCCTG GCTCAACCGG
GAGCGCGACA ACCTCGCCGC CCTGGCCGAA CTGGCGCCGC GGCTGGGAGC GCACGAGCAC
GTGCCGCGCC TGCTGGACGG CCTGTGGTCG CTGTTCCTGC ACCACGGTCA GGCGCGGTTG
TGGCTGCGTG CCGCCGCGCC CGTGCTCGGG GAGTCCTCGG CCGACCTGGA GGAGACGACG
GTCGCACGGC TGCTGAACAA CCGCGCGCTG GTGCACAGCC ACCTCGCCGG TGTCGAGGAG
GCGATGGCCG ACCTGGACGC CGCCGAACGG GTCTGGCGGC GCCACCAGGA CCTGGAACGC
CTGGCCCAGA CCCAGCAGCG CCGGGGCATC GTGGCCTTCC AGAACCACCT CCCCGGGCAG
GCCGCCGACC ACCTCGCCCG GGCCGTGGCG ACGGACGAGC GGACCGGGGT CGCCCACAAC
CTGGCCATCA GCCTGTTCAT GCTCGGCCGG GCCCGCCACG CCCTCGGGGA CCTCGGCCCG
GCCCGCCTGG CCCTGGAGCG CGCCCTCCCC CTCCTGGACG GGGACGCCTA CAACCAGGCC
CGGACCAGGA TCGTCCTGGG CACCGTCCTC GCCGCCATGG AGGAGTTCGC GTCCGCATCG
GCGGAACTGG ACCGGGCGCT GGAGGTGATG CGCGAGCGCG GCTCGGCCTC CGGCCAGGGC
AAGGCCCTGG AGGCGATCGG GGAACTCGCC GAACGCCGCG GGGAGCCGGG CCGGGCCCGC
GAGGCCTACG AGCGGGCGGT GGAGCTGCTG CTGCCGACGG ACCCGGCCAG ACTGCGTGTG
GAGGAACGCC TGGAGGCGTT GGGATGA
 
Protein sequence
MDSPGHTSPT NDPSARTNTV RGDISGTAVQ VGSLRGDVNL HLASAADRVP RQLPAAPAVF 
VDRPHPMRAL EAAVSRAHAR GRHALVVITG SAGVGKSALA AEFLRAHPEL GGGGQLFVDL
HGFSPGQPAD PHDVLDVLLR AVGVQIGTAP ADLATRAAWW RSATAHAPVS LLLDDALSAA
QVRALLPGGD RGTVVVTTRL RLAGLRLDGG QFVDLHPMTG AEGVALLSEL SGRSPAGEAD
RKAMHGVVTL CSALPVAVCT VAVDHGTRSR TWRDTERRLR SAPRRLDHLD TASREMGMDV
SVRNAFDLSY ASLPVFPALL YRRLAWHPGP DVTGELAAHL TGRPAAECEA GLAALTRHHL
LTEHSAERFS FHDLLRLHAA EKAEAEEDSG ARHRSLTRLV SGFADLAVSA DAVLRPYAGN
RAVPGSPFTD AAQATAWLNR ERDNLAALAE LAPRLGAHEH VPRLLDGLWS LFLHHGQARL
WLRAAAPVLG ESSADLEETT VARLLNNRAL VHSHLAGVEE AMADLDAAER VWRRHQDLER
LAQTQQRRGI VAFQNHLPGQ AADHLARAVA TDERTGVAHN LAISLFMLGR ARHALGDLGP
ARLALERALP LLDGDAYNQA RTRIVLGTVL AAMEEFASAS AELDRALEVM RERGSASGQG
KALEAIGELA ERRGEPGRAR EAYERAVELL LPTDPARLRV EERLEALG