Gene Ndas_1016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1016 
Symbol 
ID9244862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1243584 
End bp1244732 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content76% 
IMG OID 
Productputative transcriptional regulator, PucR family 
Protein accessionYP_003678965 
Protein GI297559991 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.13413 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.31968 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGACC CCGCGGAGCA GGCCCCGCCC GCCGACGAGC CCCGGTCCTG GGCGGGCATG 
GTCGAACGGC TCGCCGAGGA CCGGGAGAAC CTCGTCGAGG ACTTCCTCCA GCGCCTGGCG
GCCCTGGGCA ACTACACCGA CGGCATGGTC CCCGACACCG ACCTGCGCCA GAGCGCCACG
GAGACCTTCG ACATGCTCAC CCGGCGGATC GCCGGGGTGC CCCTGCCCGA CCACCTGCGC
GACCTCTCCA CCCGCCTGGG CGTGCGCCGG GCCCGCCAGG GTGTGGCGCG CGAGCACCTG
CTGGAGGCGG TCCGGCTCGA CTTCCGCGTG CTCTGGGCGG GTCTGGTGCG CGCGGGCGGG
CCGGGATCCT CGCAGATCCT CGTGCTGCAC GCCGAGGAGA TCCTCACCAC GGTCGAGCAG
TACATCAGCG ACGTGCAGGC CGCCTTCCTG GAGGAGAGGG CCGCCCTGGA ACGCGACTCC
CGGGCGGCGG CGGCCCAGGC GTTCTCCCGG CTGCTCAACT CCGGGAGCCG GGCCGCCGCC
GTGGCCTCCG AGGTGGCGGG CACCCTGGGG CTCCCCGAGC ACGGGACCTT CGACGTGGCC
TTCGTCGTGC CGTCCCCGGA GCGGGAGCAG CGGCCCGCGG CCCGGGTCCG CGAACGCGGC
GGGGGCCTGG CGTGGGAGTT CGACGACGGC GTCGCCCTGG TGCGCGAGAC CGCCCGCACG
CCGTGGCCGG ACCTGCCCCC GGGGACTTCG GGGGGACTGG TCGCGCGGGT GCGGGGGCTG
GCCGCGGTCC CCGCCTCCGT CGAGGCGGCC CGCGTCCTGT CCGGCTACGC CGGCACCGGC
GGCGGGTTCG TCCGCGAGGA GGACGTCTGG TCCGCCATCG CCCACGACCA GCTGCGCGGA
CTGATCCCCG GTTTCGGACG CGACCGGGTG GAGCGCTTCG AGCGGCTCGA CGGGGACAGC
CGGGCGCGGC TGCTGGAGAC GCTGACCCAC TACGCCGCGA CGGGATCGGT CAAGGCCACC
GCCGAGGCCC TGTACTGCCA CCGCAACACG GTGGTCAACC GGCTCCAGGC GTTCCGCGAG
ACCACGGGCC TGGACCTGAC CGTCCCGGCC GAGGCGGCCC AGGCCCTGGT GCTGTTCTCG
GGCCGCTAG
 
Protein sequence
MTDPAEQAPP ADEPRSWAGM VERLAEDREN LVEDFLQRLA ALGNYTDGMV PDTDLRQSAT 
ETFDMLTRRI AGVPLPDHLR DLSTRLGVRR ARQGVAREHL LEAVRLDFRV LWAGLVRAGG
PGSSQILVLH AEEILTTVEQ YISDVQAAFL EERAALERDS RAAAAQAFSR LLNSGSRAAA
VASEVAGTLG LPEHGTFDVA FVVPSPEREQ RPAARVRERG GGLAWEFDDG VALVRETART
PWPDLPPGTS GGLVARVRGL AAVPASVEAA RVLSGYAGTG GGFVREEDVW SAIAHDQLRG
LIPGFGRDRV ERFERLDGDS RARLLETLTH YAATGSVKAT AEALYCHRNT VVNRLQAFRE
TTGLDLTVPA EAAQALVLFS GR