Gene Ndas_1501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1501 
Symbol 
ID9245351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1841008 
End bp1843059 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content74% 
IMG OID 
Producttranscriptional regulator 
Protein accessionYP_003679437 
Protein GI297560463 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.634725 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000677289 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTCGCGCA GGAAGACGGA ACGACAGCTG AACCTGGTCA TCTGTCTGTT GTCCACGCGC 
CGCCACCTGA CCGCTCAGGA GATCAGGAAG ACGGTGCACG GCTACGCCGA GGCCGAGACC
GAGTCCGCCT TCAAGCGCAT GTTCGAGCGC GACAAGCGCG AACTGCGCGA CAGCGGCATC
CCCATCCAGG TGGGCACCGG CGACGCCATC AGCGGCGAGG AGGGCTACCG CATCTCCCGC
TCCGACTACG AGCTGCCCGA GATCGAACTC CTGCCCGACG AGGCCCTCGT CCTGGGCCTG
GCCGCCCGCG CCTGGCGGCA CGCCTCCCTC GGCGAGGCCG CCGCCAACGC CCTGTTCAAG
CTGCGCTCCG CCGGGGTCCC CGTCGACAGC GAGGCCGCCC CCGGACTCAC GCCCGCCATG
CGCACCGACG AGCCCTCCTT CGGCCCGGTC TGGCAGGCCG TGCGCGACCG GCGCCCCGTC
TCCTTCGCCT ACCGCAAGCC CGGCCAGACC GAGGCCGAGA CCCGCGAGGT CGAACCCTGG
GGCATCGTCA ACCTGCGCGG ACACTGGTAC GTGGTCGGTC ACGACCGCCT GCGCGACGCC
CGCCGCGTCT TCCGTCTGGG CCGCATCCGC GGTGAGGTGA CCATCCTGCG CGGCGGCCCC
GACGTCGTCG TCCCCGAGGG TGTGGACCTG CGCTCGGTGG TCGCGGGCCA CAGCGCCGAG
TCCGAGCGCA CCGCCGTCCT GCGGGTGCGC ACCGACTCCG CGCACGCCCT GCGCCGCAGC
GCCCTGCGCG TGGTGCCCGG CGAGCGCGAC AGCTCCGGCG TCGGCTGGGA CACGCTCACC
CTGCCCTACA CCGACACCCC CGACCTGGTG CGGCGCGTGG CCTACTTCGG CTCCAACGCG
GTCGTCGTCG AACCCGTGGG CGCCCGTGAG GCCATGGCCG CCCACCTGAC GGCCACCGCC
GAGACGGCCC CCGCCGCCGA GGCCGCGGAC GGTCCCGGCA CGGTGGAGGA GACCGAGGCC
CCCGTCCGCG GTCCCCGCTC CGCCGACCAG CTGCGCCGCC TGCTGATGCT CGTCCCCTAC
GCCCTGGGTC GCGACGTGCG CGTGCCCGAG GTGGCCCGCC ACTTCGGGCT CAGCGAGAAG
CAGGTGGTCG CCGACCTCAG CCTGCTGTGG ATGTGCGGCC TGCCCGGCTA CACCCCCGGC
GACCTCATCG ACGTCGACCT CGACGCGGCC AGGGAGACCG GCGAGATCAT CATCGCCAAC
GCCGACACCC TCGCCCAGCC GCTGCGCCTG ACCGCCGACG AGGCGGCCAG CCTGGTCGTG
GGCCTGTCCC TGCTGGAGGC GCTGCCCGAG ACCGAGGGCC TGGAGACCGG CGCCCTCAAG
CGGGTGGGCG AGAAGCTGCG CGCCGCCGCG GGGGCCGCCG TGGGCTCGCT GGCCGACTCG
GTCCAGGTGC GCGTGGAGGG CGACGAGCAG GTCGCCCAGA CCCAGCGCCG CTGCGCCGAC
GCGCTGGAGG CGGGCCAGCG CCTGCACCTG CGCTACCTGT CGGGCTACCT GGACCAGGTC
ACCGAGCGCG ACGTGGACCC CATGGGGCTG GTGGTCCAGG ACGGCTACCC CTTCCTGGAG
GGCTACTGCC ACCTGCGCCG GGACGTGCGC CTGTTCCGCC TGGACCGGGT CCTGGAACTG
ACGGTGCTGC CCTTCGCCGC CGAGGTGCCC GCGGGGGTGG GCCGCCGCGA CCTGTCCGGG
GGCGTCCTCC AGCGCTCGGA GCGCGACGCC CTGGTCGTCC TGGACCTGGA GCCCGCGGCC
CGCTGGGTGA CCGAGGACTA CGTGTGCGAG TCGGTGACCG AACTGCCCGG GGGAGGGGTC
CGCGCCACTC TGCGCACTCC CGCTCCGGCC TGGGTGGTGC GGCTGGCGCT GCGTCTGGGG
CAGCAGGGGC GCGTGGTGTC TCCTGTGGCG CTCGCCGAGG AGGCCCGGGC CGAGGCCCGC
CGTGCGCTGG AGCACTACCG GACGTCCCAA ACCTCCGCGA AATTGGTCGA GACCACTTCG
TCGTCCGAGT GA
 
Protein sequence
MSRRKTERQL NLVICLLSTR RHLTAQEIRK TVHGYAEAET ESAFKRMFER DKRELRDSGI 
PIQVGTGDAI SGEEGYRISR SDYELPEIEL LPDEALVLGL AARAWRHASL GEAAANALFK
LRSAGVPVDS EAAPGLTPAM RTDEPSFGPV WQAVRDRRPV SFAYRKPGQT EAETREVEPW
GIVNLRGHWY VVGHDRLRDA RRVFRLGRIR GEVTILRGGP DVVVPEGVDL RSVVAGHSAE
SERTAVLRVR TDSAHALRRS ALRVVPGERD SSGVGWDTLT LPYTDTPDLV RRVAYFGSNA
VVVEPVGARE AMAAHLTATA ETAPAAEAAD GPGTVEETEA PVRGPRSADQ LRRLLMLVPY
ALGRDVRVPE VARHFGLSEK QVVADLSLLW MCGLPGYTPG DLIDVDLDAA RETGEIIIAN
ADTLAQPLRL TADEAASLVV GLSLLEALPE TEGLETGALK RVGEKLRAAA GAAVGSLADS
VQVRVEGDEQ VAQTQRRCAD ALEAGQRLHL RYLSGYLDQV TERDVDPMGL VVQDGYPFLE
GYCHLRRDVR LFRLDRVLEL TVLPFAAEVP AGVGRRDLSG GVLQRSERDA LVVLDLEPAA
RWVTEDYVCE SVTELPGGGV RATLRTPAPA WVVRLALRLG QQGRVVSPVA LAEEARAEAR
RALEHYRTSQ TSAKLVETTS SSE